Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNI00920 |
Symbol | |
ID | 3259395 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006694 |
Strand | - |
Start bp | 228034 |
End bp | 229558 |
Gene Length | 1525 bp |
Protein Length | 401 aa |
Translation table | |
GC content | 52% |
IMG OID | 638258577 |
Product | small nuclear ribonucleoprotein, putative |
Protein accession | XP_572846 |
Protein GI | 58271380 |
COG category | [A] RNA processing and modification |
COG ID | [COG5200] U1 snRNP component, mediates U1 snRNP association with cap-binding complex |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATTTGAGACA TCCAACATGG GTCGATTAGC CGAAGCACAG AGAAGGCTCC TCGAGGTGAG CAAGTTACCG GAGCATATGC GAAATCCTTT CTTACCTTCT GCAGCAAATG ATGGGCCCCG AAGCAATGGG TATCCAGGCG GTCAATCTCG ACTGGTGGAA CGAAAAAGTC TGTAGGAACT TCCTCTTCGG AACATGTCTG CACACCTTGT TTGGTAACAC CGTAAGTTAC AATGGCTTAG GTTAAAGATG TACCGGTCTT ATAGAAGCTG TCATAGAAAA TGGATCTTGG ACCATGCCCC AAGGTTCATT CTGACCGTAT CCTCAAACAA TTTCGAGAGC ATGCCGAAGC GAACCCCAAC GATCCCAGGC TCTCTGCTTT CAGACAAGAG CACGAAAACA GCCTGTACTC ATTTGTGGAG GATTGTGACA GAAGGATAAG GGCGAGTCAG AGAAAGTTGG AAAAGACACC CGAAGAGAAC AGAAAGACCG TCGATCTCGT GAGTGGGGTC TGCTGGCCAG CTGAGTTGAA ATACTGATAA GGTGTAGATG AGGGAAATTG GAGAAATTGA GCTTTCTATC CAAGGTGGTA CAGAGGAAAT TGAGGCGCTT GGTGAGGCAG GAAAGGTTGA GGAGTCTATG GAGAAACTTG CCGCCGTCGA CGCTCTCAAG GCTATGAAGG CCGAAAAAGA AAAGGATCTT CAGCACCTCA ACGAAAACGC TGGTGCAAGT GGTCACCAGA AGCTTCGTGT CTGCGAGACT TGCGTATGTT TGATCCCTTT TCCCCATTGG CTCTCAACTG ACAAATGTAT AAAGGGTGCC ATGTTGAGTG TTCTTGACTC TGACAAACGT CTTGCCGACC ACTTCGGTGG TAAGCTCCAT TTGGGTTACC ACGAGCTCCG TAAAATCCTC AGTGCCTTCT CCGAAGCCCG CATGACCGGT CGACCCATAC CCATCATCCC TCCCAAATCC CCTCGAGCTG ACGGGGATGA GCCTTTACCG TTCTCCGCCC CCGCCATCCC TGCCACCGCA CCTGCTGCCC CCACTGGTCC TCGTTCTGGG CTAAACCCCC CTATGGGCCC TTCAGGCAAC GAGCCGCACA CGCCCCACCA CGCTCGTGTT CCCCCTGTTG AAGAGATGCC CGTTGTGGGA CATGGCGACA AGGTGAAGCG CGAGGCTGGC GAGCTGGTGG AGGATCTGAA GGAAGAGAGG GCGATGGAAG AAAGGGATAA GGAGAGGGAG AGGTATAGGG AACGAGAGAG AGATAGGGAT GATAGACATG ACAGAAATAG GTATGACGAT AGGGATAGGG ACAGGTATAG GGACCGAGAC AGGGAGAGGG ACAGCAGGTA TGGAGGAGAC AGAGATAGGA AGAAGAGCTA CGAGTAAGTG CTGCATCTGA TTGAAGACGG GTTGTGCTAA TGGTGGCAAC AGCCGCGAGA GATCGAGGTC TCCTGTCAAG CGACGCGTTC TGTAACAAAC TCTGGGCCTT GTTGAAGTGG TAAATATGCA GTTTGATCGA CTATC
|
Protein sequence | MGRLAEAQRR LLEQMMGPEA MGIQAVNLDW WNEKVCRNFL FGTCLHTLFG NTKMDLGPCP KVHSDRILKQ FREHAEANPN DPRLSAFRQE HENSLYSFVE DCDRRIRASQ RKLEKTPEEN RKTVDLMREI GEIELSIQGG TEEIEALGEA GKVEESMEKL AAVDALKAMK AEKEKDLQHL NENAGASGHQ KLRVCETCGA MLSVLDSDKR LADHFGGKLH LGYHELRKIL SAFSEARMTG RPIPIIPPKS PRADGDEPLP FSAPAIPATA PAAPTGPRSG LNPPMGPSGN EPHTPHHARV PPVEEMPVVG HGDKVKREAG ELVEDLKEER AMEERDKERE RYRERERDRD DRHDRNRYDD RDRDRYRDRD RERDSRYGGD RDRKKSYDRE RSRSPVKRRV L
|
| |