Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNI01130 |
Symbol | |
ID | 3259645 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006694 |
Strand | + |
Start bp | 278522 |
End bp | 282287 |
Gene Length | 3766 bp |
Protein Length | 1154 aa |
Translation table | |
GC content | 50% |
IMG OID | 638258600 |
Product | small nuclear ribonucleoprotein, putative |
Protein accession | XP_572829 |
Protein GI | 58271346 |
COG category | [A] RNA processing and modification |
COG ID | [COG5181] U2 snRNP spliceosome subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.500955 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTCGAGGAAA CATTCTGTCC ACCAGCTCAC CACGAACCAT GTCTGACGAA GAGTACCCAC AGCTCCAGTC GTACACGGCT CCCCAGGACA TTCTCGATGA GCACGCCGAG CTCGCAGACG AGGACGAGAC CCCGGATGCC TTCCAGGATA AGGCCAAGTC AAAGCAAGTA GCTGCTAGGC AAAGCGACTA CCACTTGCGT AGGTTCAACA GGACAGATGG CCAGGGTGAA GGCGAGGATG AAAGCTATGA AGAACGTATG AGGAGAATCA ATTTGGAGAA GGAGGAGGAG AAGGTTCGAA GATACAAGGA AAAGATGGAG AAGGAGGAAA AGGAGAGAGG AGAAATGCGA GTGGACGACA AGACCCCGCC GAGAGAGCTC ACAGGTGGTG ATACCCCACC GCGTAAGGCT CTTATTGGGG ACGAGACGCC CCCTAGAGCT CAGGCTGGTG ATGTTACTCC TCCCCGCAAG AAGCGAAGAT GGGATACTGA GCCCGAAGTT AAGCAAGAAG TCAAGGAAGA GGTCAAGGAG GAGGTCAAGG AAGAAGAGCC CAAAAAACGT CGCTCTCGAT GGGATCAGAC TCCAGCTGAG GCACCACCCG AGAAAAAGCG TTCCCGATGG GATCAAACTC CTGCGCAGAC TGCGTCCTCT AGTGTTCTCG CTGCTCCTAC CAACCTCGCC AAACCTTCCG GCATCGTCCT GGTCGAAGAC AAGCGATACC GGCGCATGAC TGACGAAGAG CTCGACTCAT TGCTTCCTGG ATCTGAAGAA GGTTACGTTG TGGTACCCGT ACCAGACGAC TATCACCCCG CCCCTTCAGT TCGCAAGATG GTTCCCCAGC AGGCTGAAGC TGGATTTATG ATGCAGGATG AAACAGATGC TGCACGAGCG CGAGCAGCTG CTGGTGGTTT ACAGGGTACA ACTGAGCAAA CAGAGATTGA TGGAATTGGG ACTCTCCAAT TCCTCAAGCC TGAAGATACC CAGTACTTTG CCAAGGTTCT CGGCGAAGGC GGTGGTGAAG AAAACGACGC AGAATACACT GTTGAAGAGC TCAAAGAACG CAAAATCATG CGACTTTTGC TCAAAATCAA GAACGGTACC CCTCCCGTCC GTAAAACCGC TTTGCGTCAA ATCACCGATC GTGCGCGAGA GTTCGGTGCG GGTCCATTAT TCGACAAAAT CCTTCCTCTT CTTATGGAGC GTACGCTCGA AGACCAAGAA CGACATTTGC TTGTCAAGGT CATCGACCGT GTTCTCTATA AGCTTGATGA CCTTGTTCGT CCATATGTCC ACAAGATCCT TGTCGTCATC GAGCCTCTTC TTATCGATGA GGATTACTAC GCTCGAGTCG AAGGTCGGGA AATCATTTCC AACCTCGCCA AAGCAGCCGG TCTTGCCCAC ATGATCTCCA CCATGCGTCC CGATATCGAC CACGTTGATG AGTACGTCCG TAACACCACT GCCAGAGCCT TTTCGGTTGT AGCCTCCGCC CTTGGCATCC CTGCTCTTCT TCCTTTCCTT CGTGCTGTCT GTCGATCAAA AAAGTCTTGG CAGGCGAGAC ATACCGGTAT CCGTATTATC CAACAAATTG CCATTATGTC AGGCTGCGCA GTCCTTCCCC ATCTTCGTAA CCTCGTCGAC GCCATTGCCG ATGGTTTGAA AGATGAGCAA CAAAAGGTTC GAACTATGAC CGCTCTCGCC CTTGCTGGTC TTGCCGAAGC TGCCGCGCCT TACGGTATCG AGTCATTCGA CAACGTTCTC AAACCTTTGT GGCTCGGTAT CAGACAGCAC CGAGGTAAGA CTCTCGCCGC CTTCCTCAAA GCCATCGGTT ACATTATCCC TCTTATGGAC CCCGAGTACG CCGGATACTA TGTGCGAGAG TGTATGCCTA TTTTGATCCG AGAATTCCAG ACTTCTGATG AAGAAATGAG AAGGATTGTC TTGCAAGTCA TCAAGCAGTG TGCCTCCACC GAGGGTGTTA CTCCTTCATA CATCAAAGAA GAGGTTCTCC CCGAGTTCTT CAAGGCTTTC TGGGTCAGGC GTATGGCGTT AGACAAAAGG AATTACAAGC AGCTTGTAGA GACCACTGTC GAGCTGGCCA ACAAGGCTGG TGTCGCAGAG ATCGTCGGTA GGACAGTCAA TGATTTGAAG GACGAAAGTG AACCTTTCCG AAAGATGGTT ATGGAGACCA TCACCAAGGT GGTTTCCAAC ATTGGTGCTG CCGATGTCGA CGAACGTCTA GAAGTCTTGT TAATTGACGG TATTATCTAT TCCTTCCAGG AGCAGACTTT TGAAGACACT GTCATGCTTG ACGGTTTCGC CACTGTTGTC GCCTCTCTCG GGCCTCGTGT CAAGCCTTAC CTTCCACAAA TCGTTTCCAT GATCCTTTGG AGATTGACCA ACAAGTCGGC CAAGGTGCGA ATGTTGGCAG CTGACTTGAC CACCAAGTTG GCACCGATTA TCAAGAGCTG TAAGGAGGAT GTACTGTTGA GCAAGTTGGG TGTTGTTATC TTCGAGCAGT TGGGTGAAGA GTATCCGGAC GCTTTGGGAA GGTCAGTCTA TTTACCAAAC ATGAAATGAG CATAAGCTGA CTCTTTACTA GTCTTATCGC CGCGGAGGGG GCTATCGCGA ACGTTGTTGG TATGACAGAG ATGAACCCGC CCGTCAAGGA TCTTCGTGAG TGACATTGTT GATGTGACGA ATGAGCCAAT TGTTAATACA TTTTGGCAGT CCCCAGAATG ACCCCCATTC TTCGAAACCG ACACGAAAAG GTTCAAGAAG CGACCATCAA CTTGATTGGT CGTATTGCCG ATCGAGGTGC CGAATATGTC CCTGCTAAGG AATGGATGCG AATCTGTTTC GAGCTTCTTG ACCTGCTCAA GGCCCATAAG CGAGCGATTC GACGAGCGGC TGTCAACTCT TTCGGTTATA TCGCCAAGGC TATTGGTCCT CAGGATGTGT TGAGTGTCCT CTTGACAAAT TTGAAGGTGC AGGAGCGACA GAGTAGGGTG TGTAGTACCG TCGCTATCGG TAAGCATCGA TTTTCAAATG ACTCTGCACA GATACTGATA GCAACTAGCC ATTGTCGCCG AAACCTGTGG TCCTTTCACT TGTATCCCCG CTATCCTTAA TGAATATCGA ACTCCTGAAC TTAACGTTCG AAACGGTTGT CTCAAGGCTC TTGCTTTCGT ATTCGAATAC GTCGGCGAAA TGTCCAAAGA TTATATCCAT TCAGTCGTTG GGTTGCTCGA AGATGCTCTC ACTGATCGAG ATCACGTCCA CCGACAAACG GCTTGTGCCA TTGTCAAGCA CCTGGCGATT GGTGTCGCCG GCTTGGGTTA CGAAGAGGCG CTCACACATT TGTTGAACCT TGTCTGGCCA AACATTTTTG AAACAAGTCC ACACGTTATT GGTGGTGTGA TGGATGCTAT CGAAGCGATG AGACTTGGTA TTGGATCTGG TGTCGTGTTG AGTTATGTGT TGCAAGGGTT GTTCCACCCT GCCAGGCGAG TAAGGGAAGT TTACTGGAGG ATGTACAGTG AGTATCTGTT TTTTATTGCA ATGGGGAGCA AGGCTAATTA CGTCTCCGTA GACACTTTGA TTCTTGGATC ATCAGACGCG ATGGTACCCT TCTACCCTGC TTTGGGTTCA GAGTCCGATT TGGCATCTGG ACAGGATTAC ACAAGGCATC AGCTCATGAT GTGGATCTAG AAAGGGGAAT GGGATGGAGT AATAGTGCGA AATTGAAATG CATGTTGTAT AATGTT
|
Protein sequence | MSDEEYPQLQ SYTAPQDILD EHAELADEDE TPDAFQDKAK SKQVAARQSD YHLRRFNRTD GQGEGEDESY EERMRRINLE KEEEKVRRYK EKMEKEEKER GEMRVDDKTP PRELTGGDTP PRKALIGDET PPRAQAGDVT PPRKKRRWDT EPEVKQEVKE EVKEEVKEEE PKKRRSRWDQ TPAEAPPEKK RSRWDQTPAQ TASSSVLAAP TNLAKPSGIV LVEDKRYRRM TDEELDSLLP GSEEGYVVVP VPDDYHPAPS VRKMVPQQAE AGFMMQDETD AARARAAAGG LQGTTEQTEI DGIGTLQFLK PEDTQYFAKV LGEGGGEEND AEYTVEELKE RKIMRLLLKI KNGTPPVRKT ALRQITDRAR EFGAGPLFDK ILPLLMERTL EDQERHLLVK VIDRVLYKLD DLVRPYVHKI LVVIEPLLID EDYYARVEGR EIISNLAKAA GLAHMISTMR PDIDHVDEYV RNTTARAFSV VASALGIPAL LPFLRAVCRS KKSWQARHTG IRIIQQIAIM SGCAVLPHLR NLVDAIADGL KDEQQKVRTM TALALAGLAE AAAPYGIESF DNVLKPLWLG IRQHRGKTLA AFLKAIGYII PLMDPEYAGY YVRECMPILI REFQTSDEEM RRIVLQVIKQ CASTEGVTPS YIKEEVLPEF FKAFWVRRMA LDKRNYKQLV ETTVELANKA GVAEIVGRTV NDLKDESEPF RKMVMETITK VVSNIGAADV DERLEVLLID GIIYSFQEQT FEDTVMLDGF ATVVASLGPR VKPYLPQIVS MILWRLTNKS AKVRMLAADL TTKLAPIIKS CKEDVLLSKL GVVIFEQLGE EYPDALGSLI AAEGAIANVV GMTEMNPPVK DLLPRMTPIL RNRHEKVQEA TINLIGRIAD RGAEYVPAKE WMRICFELLD LLKAHKRAIR RAAVNSFGYI AKAIGPQDVL SVLLTNLKVQ ERQSRVCSTV AIAIVAETCG PFTCIPAILN EYRTPELNVR NGCLKALAFV FEYVGEMSKD YIHSVVGLLE DALTDRDHVH RQTACAIVKH LAIGVAGLGY EEALTHLLNL VWPNIFETSP HVIGGVMDAI EAMRLGIGSG VVLSYVLQGL FHPARRVREV YWRMYNTLIL GSSDAMVPFY PALGSESDLA SGQDYTRHQL MMWI
|
| |