Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question about --nbr_fracs #64

Open
anchormok opened this issue Sep 25, 2023 · 5 comments
Open

Question about --nbr_fracs #64

anchormok opened this issue Sep 25, 2023 · 5 comments

Comments

@anchormok
Copy link

Hi everyone,
A wonderful package to link scRNA and scTCR!!
Recently I test the run_conga.py script and found that CoNGA scores in TCR2D was easily influenced by argument 'nbr_fracs'. When I use the default option (0.01)(above), it seems that less association between gex and tcr. If I set to 0.5 (below), obvious association was present. And you emphasized the smalllish nbr_fracs in your script. So is there any suggestions in selecting the suitable value for nbr_fracs?

by the way, is the "length" in TCR.csv necessary for analyses? I have a TCR.csv file without "length" (It is not standard output from 10x). I want to include this file in CoNGA input.

Looking forward to your reply. Thanks!

merge_graph_vs_graph_logos
test1_graph_vs_graph_logos

@phbradley
Copy link
Owner

phbradley commented Sep 28, 2023 via email

@anchormok
Copy link
Author

Thanks for your sugguestions!
After confirming the special cells cluster identified by CoNGA score, I used the Seurat to analyze their transcriptional characteristics and met some questions. For example, total 1400 interested cells with unique TCR sequence were identified by CoNGA (because CoNGA choose the representative cell for clone), but in my seurat object, 1585 interested cells with the share TCR sequence were found. Should I used the 1400 cells or 1585 cells for analyses? If the answer is 1585 cells, does it mean that the similar TCR sequence present the similar transcriptional characteristics?

Thanks again for your reply

@anchormok
Copy link
Author

by the way, I found the Top 9 DE genes logo in "graph_vs_graph_logos.png" were too crowded to present. Is there some parameters to solve this item??
Thanks a lot!
Snipaste_2023-10-08_16-50-40

@phbradley
Copy link
Owner

Re: 1485 vs 1500, it probably depends on your dataset, and there may not be much difference. Generally speaking, cells with the same clonotype do tend to have similar GEX profiles, which supports conga's reduction to clonotypes. But in a dynamic setting there can be substantial heterogeneity.

Regarding the logos, the genes themselves are probably getting written out somewhere to a TSV file. But the TCR logos also don't look good, so it might be worth fixing that and then seeing if the DEGs are more interpretable. I think the problem is with the conversion from SVG to PNG. If you attach the log file, maybe we could figure out which tool is being used and swap it out for something that performs better on your system. I find Inkscape to be the most reliable across linux and mac.

@anchormok
Copy link
Author

An excellent suggestion!!
The question about DEG and TCR logo presentation have been solved since I employed the "Inkscape" for the conversion from SVG to PNG.
Thanks for your reply!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants