Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNA01270 |
Symbol | |
ID | 3253767 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006670 |
Strand | - |
Start bp | 343821 |
End bp | 345760 |
Gene Length | 1940 bp |
Protein Length | 429 aa |
Translation table | |
GC content | 51% |
IMG OID | 638252460 |
Product | U1 snRNP 70K protein (short form), putative |
Protein accession | XP_566577 |
Protein GI | 58258329 |
COG category | [R] General function prediction only |
COG ID | [COG0724] RNA-binding proteins (RRM domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGCAAAAACA TCCTCAATAA AGCCCCACGA AGGGCCGATA ACCAACATGT CACACCTCCT TCCTCCTAAC CTCCTGAAGC TTTTCGCCCC TCGACCTCAG CCGCCCTTCT TAAAACCCCT GACGAGGGAC GAACGCATCC GAGGTCCAAA CAACCTTGGG GGTGTTGCCG GCCTTGCTAA GCGTATAAAA GAAGAAGCCG AAGATGCTGA GGTCAAGCAA GGGATGGGCA TGAATCCCGA AAAAGCTCTC GATGAACAAA AGGAGGAAAA TGTAAAGATG GAATTGAAGC AAGATGGCGA AGATGGAGAG GTCGCGGAGG ATTCTGGCAA GGAGAAGAAG AAAAAGACAG CTAGAGATAA GATTGCGGAG ATGGGTATCA TTGGAGAGGA GGCTGTCAAA ATGAGGAAGG AATTGAGGAA GAAGAGGCAA GAGGAGTACA AGAAGAATGC CGAGAGCAAC TGTGGGTATT ATCTAACCGT CTATGTAAGG TCACTAACAT TTCTGCAGAT AAACCTCAGG ATGACGCTCA AGCCATCGGT GACCCTTATA AAACCCTTTT CATCTCAAGG CTAGTAAGTA GATGTTGTAA ACCTATCTGA CCGTTCTCAC TGACACGATG TTCCACAGTC TAAGAAAGCG AACGAGACAG ACCTTCGCCG AGAGTTTGAA ATGTATGGCC CTATCGAGCG GATACGTATC GTTCGAAATC GAAAGGGCAA GAGTAACGGC TACGCCTTTA TTGTTTATGA GCGAGAACGA GATATGAAAG GTGGGCCTTC AATTGCTGTT CAGAAGGTTA ATAACTTACG ATTGGCTAGC CGCATACAAG GACGCTGAAG GCATTCCTAT TCACCACAAG AAGATTCTTG TCGATGTCGA GCGTGGACGC ACTGTTAAAG GATGGAAACC TCAACGTCTC GGTGGAGGTC TTGGTGGCCG TCCCAAACCC GTCGCCCCAG AAGCCTCCCC TGCACCGTAT GTCGCTCCCA GCAACCTCCG AGGTGGCCGA GGAGGGTTCC GTGGAGGAGG ACGAGGCGGC GGTGCTGGTT TCCGAGGTGG TTTTCAAGGG AACAGGGGAG GGTTTGGCGG AGGTAACAGA GGAGGATTTG GCGGAGGGGA TGACAGAGGA GGTTTCGCTG GACGAGGCGG TTTCCGAGGG GGTTTCCAAA ATCGAGGAGA TCGTGGTCCT GGTGGATTTG GCCAACAACA GGGAGGGTTT GGCGGTCCAG GAGGCCCAGG TGGATACGGT GGACAGGGAG GAGGGTACGG TGGCGGTGGT GGCGGCTTCG GGTAAGTTCT CATCCGATAC GCACTGTCCG TCTGTGGTAC TAACAAGTCA GCAGCGGTCC CGGCCAGCAA AATGGAGTTC AAGGCGGACA AGGTGGAGGA GGCTTTGGGG GGGGTGGAGG TTACAAACGC GATTACGACA ACGCCGGTGG ACCTGGTGGT TATGGTGGAG GAGGCGGCGG TGGTGGTGGT GGGTTCGGGG GAGGCGGTGG AGGTTACCAG GACAGGGATC CCAAGAGGAT GCGGTATTGA GATGTTCTAT TTATTTGATT CATTTACTCA ATTTTCAGTG CGGTCCGATG AGGAGATATT TGCGGAAGCC GTACGGAGCA TTGTGGGTGC ATTTGCACAG GTCTAAGTAT GTAAAACAAG GTATAGCATT TTCTCGGGTT CTGTCTGTTC GTGTGCTTCC TGTATCGACC TACCATCACC ACGTTCTTGG CACTTGAGGA AGCCTAATGA TAGGAACAAC ATTATTTAAT AAATGGTGGG TATCGGTAGG TGCCTGTACT CGATCAGCAA GTCACCCCCT TCTTTTATCA CATTATCGAC AACTGAATTG AGGCTAGATA AAGTGTTGGG AGACTGACCT CAGAGAAGTA GGCGTACTGT ATTTGGATAG GTAGCAGGCA TATACTTCGC TTCGAAGCAG
|
Protein sequence | MSHLLPPNLL KLFAPRPQPP FLKPLTRDER IRGPNNLGGV AGLAKRIKEE AEDAEVKQGM GMNPEKALDE QKEENVKMEL KQDGEDGEVA EDSGKEKKKK TARDKIAEMG IIGEEAVKMR KELRKKRQEE YKKNAESNYK PQDDAQAIGD PYKTLFISRL SKKANETDLR REFEMYGPIE RIRIVRNRKG KSNGYAFIVY ERERDMKAAY KDAEGIPIHH KKILVDVERG RTVKGWKPQR LGGGLGGRPK PVAPEASPAP YVAPSNLRGG RGGFRGGGRG GGAGFRGGFQ GNRGGFGGGN RGGFGGGDDR GGFAGRGGFR GGFQNRGDRG PGGFGQQQGG FGGPGGPGGY GGQGGGYGGG GGGFGGPGQQ NGVQGGQGGG GFGGGGGYKR DYDNAGGPGG YGGGGGGGGG GFGGGGGGYQ DRDPKRMRY
|
| |