Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNG02640 |
Symbol | |
ID | 3258607 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006692 |
Strand | + |
Start bp | 740445 |
End bp | 743839 |
Gene Length | 3395 bp |
Protein Length | 868 aa |
Translation table | |
GC content | 55% |
IMG OID | 638257886 |
Product | conserved hypothetical protein |
Protein accession | XP_572021 |
Protein GI | 58269730 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAACGTTTCT CCTGCGAGCA GTATCCAGCC CAGTACGTGT ACGACAGACC GAGTGCGCAC CGCCTCGCCT CCACATTCCC GCTTCCCACC GTTCCCTCGT AGCAACAATA TTTCTCACCT TGCTCATCCC CACCGCCGAG CGCATCTCCC ATGTCCCCCG CCCCTGTAGC ACCCATCCGC CAGCCGCGCG GCCCGTCCAC AAACTCCTTC TCACAGGCCG CCTCCCCCCC GTCATCCACG TCGGCCGGCA ACTCGGGCAC TGCAGCAGCC ACAGGTCCGC CCGCCCCCGA CACAGACTCG TTGCGGCGGG CGAACACCGT CTCCAACCCT CGCCACCAGC CGTCCGCATC CATCTCTGCC AGCTCCGCCG CCTCGTCCGC TTTCCATGGC GTCCCGCCAT CCGCCGCCAG CACATCGGCC AGCTCCAGCA GGGGCGTCAG AGGCGTCAGC CGTTTCAGGA GTGGATCGCT GTCCAACGCA TTGCCAGACC CGGGCCTAGT AAGAAGGGCC AGTGGGAGAG AGGTAGCGAA GGAAGTAGTG CTGGAGAATG AAGGCGAGGG CGAAGGCATC GAGGCGGCGA GCTGGGGCAA GGGTCTGGGC AGGCAGAGCA GCCTGCCCTC CAGGCGAGGT ACATTCTGAG CAGTCCGGCA GTCGTTCCAG GAGCTGACAT CCAGTTAGGT CTCAGCCTTG CTCAGGCTGC CAACCAGGGC GCTCCTGAAC CAGTACCGCC GCCCCGCCCG CCACGCCGCA TCGCTATAAA CACCACCACT GTATCCGGCC CCACACCGCC CGTTTCTGCC CATTCGGCAT CGCACTCTCT GTCCTCTCTC TCCATGCTCC ACCGTCCGAC CTACTCTTCC CAACTGTCCC AGGACGAACC TATCCCTTCA GCAACAAGCG GTGGCGTATC GCGCACGCAG AGCCTGAGGG CACAGGCCAA ACATAGTGAT GTCGGCGGAC TGGGAAGAAG CATTAGTCTC AAGGCTGTAG GCGAGGTGTG TTGATTGTCT GCCGACAGTG AGCAAAGGCT AACCTGCATA GCATACGTAC CGCCAGTCAA CGAGCCAGCA GGCGAGTAGC GGCCAGTCTT CAGGTGTCAT TTCCTCTCCA TCTTCTGTTG CATTCTCTCC TCCCGCCCCT CCGCCCGTTT CTTCCACAGC AGCGCTCCCC CCGTTCCAGC TTCCACTTCC TGACTCCTCC TATGTTTCGA CAACGACGGG CGGCGCGGAC GTCAAGCGCC ACCAAAGTCT TACTCAGGGG TACGGCTCAT CGTCCCGCGT CCGTGATCGA TTGGAGCGCA GTCCCGCCAT CACGCTTGAG CCCCGTGAAC CTCGCGATAC GTCGGGCGTG AGGGTGAGGA TCGATACCAG GTTTGAAGAC GAACCGCCTA CGAGCCCCAT TGGTCCCAGC GTGTGGTCCA ACACCCTCTC CAGGACGGAT GACGGTTGGG GCTCTGCGCC TCTGAACCAA AGCGCCAGGA ATGCTTCGCA GCAGCTGCAA GACGCCTTTG AAGCAATGAC TCTTGGACGG AATATGATGG CTCCCGACGG TAACGCAATG CTTGCGAATG ATTTCGATCC CAGAGAAAGG CTCAGCCTGG AGACAGACAT CATGCGTCAC CCACAACCAT CCTCCAATAA TACTTTACCG TCCGGTGGGC AAGGTCTCCA GAGGATCAAC TCTGGCGCCG AAGAACCTAG CTGGGTGAAC AAGCTCGTGG GAGGTGATTC AACGCCCATG GTCGGCAACG TGATGCCTCG CTCCTCCAGT GCGCTCGGTT GGAGCGAACG TGCGCGCCAG AATGAGACTC AAAGGTTCCC GTACGGGGCT TACAACATGC CCTTTGGCAA TGGATTTAAT ATGGGTCTAA AGCCTCAAAT GGGTCAAATG GGCATGCCTC AAATGGGTCC GGTGGGTATG AATGGTATGC CTATGGGAAT GCCCATGGGT ATGGGCCCCA TGAACATGGG CTTCCCGATG GGGATGAACA TGGGTTTCGG GCAGATGGGC GCAAACGGAT TTGGTCAGTT TGCCCCTGGC CATCCCGGTC AAGCTCAAAA CCAGCATCCC GGTCAAGGAT ACCCCCCACC ATCTAGTTCG GCTGCACCCA CTGGCCACGG CATGGGCGCG CAGGATAGGG AAGTGATCGA GTTGGCCAGA AAGAAGGGAT TAAACCCGGC CACATTCAAC TGCAAACCTC AAAATGCCAG ATTCTTTGTT ATCAAATCTT ACACTGTAAG TGTTGTTGCC ATAACGAAAT TGCAGAATAC GCAGCTGACG GATTTTGAGC AGGAGGAAGA CGTTCAGAAA TCGCTCAAAC ACGAGATTTG GTCTTCTACC GTCCTCGGCA ATAAACGTCT GGACGCTGCG TACAGAGAAA CGGCAAACAA GGGTCCGATA TACTTGTTCT TCAGCGTCAA TGGGTCCAGG CATTTCTGCG GTGTGGCCGA AATGACAACG CCGTGAGTCT AGAGTCCCTG ACCAGGGAAC CATCTCGTCG AAAAGTCCAT GTTATAGTAC TGACCGCCTA GTAGTGTGGA TGAGACAAAG ACGTCCAAAG TGTGGGCACA GGACAAGTGG AAAGGTATCT TTGAAGTCAA ATGGATCTTT GTCCGTGATG TTCCCTCAGC GTACGTCCTA TTGCTCTAGT GTTATGCTGC TTCTTTCCGC TTATACCCAT TGTTAGCGCC CTTCGACATA TCCGACTGAC CAACACTCCC GAGTGCAAAC CCATCACCAA CTCGCGCGAC ACGCAAGAAT TGCCCTATGA AGCTGGTACA GAGGTCCTAC AGATCTTTTT GGACCACCAA ACCAAGAGCA AGACTAGTCT GTTGCAGGAT TTTGCTTATT ACGAGGTTTG TCTTTTTGTT TTCTTTGACT TCTGTATATA ATACTGACGG ATGTTCATAG CAACTTTCTG TAAATCGCCA AAACCAAAAC TCTCCTCAGA ATGGCAACAG CCAGAACCAG CAGCCATCCT CGTATCAGCA GTTCCAACAG AAGCCAGGGA ACGCTATCGC TCCTCCGATG AGCAATAGCC ATTCTGGTCC CACTATTCCT CCTGTTCCCG CCATCCCTGA TAAATTCCGA TAAATCCAAA GTATACTTTA TCAGTCTTGG TTTTTTTTTT TACTACCGAT TGGAACGACC TTGCAATGAA CAATATGAAC GTTCGGGAAA CCAAATCTGG GTAATAATAT TCTGTGCAAA CTTTTACCGA ACGACCGCAG TTAATGGATA AAGAAAAGGA ACGTTATATG TGAAAGCGAA AATCGTATTA ATTCAAAGTA ACATGTGTGG AATTGGAGAT GAATAGTACA TGGGTCTCTT ATTATAGAGC TGGATATTTA TGTGTGTAGC TGTCGGATTA TGGTGAGATG TATAAATTTA TCTTG
|
Protein sequence | MSPAPVAPIR QPRGPSTNSF SQAASPPSST SAGNSGTAAA TGPPAPDTDS LRRANTVSNP RHQPSASISA SSAASSAFHG VPPSAASTSA SSSRGVRGVS RFRSGSLSNA LPDPGLVRRA SGREVAKEVV LENEGEGEGI EAASWGKGLG RQSSLPSRRG LSLAQAANQG APEPVPPPRP PRRIAINTTT VSGPTPPVSA HSASHSLSSL SMLHRPTYSS QLSQDEPIPS ATSGGVSRTQ SLRAQAKHSD VGGLGRSISL KAVGEHTYRQ STSQQASSGQ SSGVISSPSS VAFSPPAPPP VSSTAALPPF QLPLPDSSYV STTTGGADVK RHQSLTQGYG SSSRVRDRLE RSPAITLEPR EPRDTSGVRV RIDTRFEDEP PTSPIGPSVW SNTLSRTDDG WGSAPLNQSA RNASQQLQDA FEAMTLGRNM MAPDGNAMLA NDFDPRERLS LETDIMRHPQ PSSNNTLPSG GQGLQRINSG AEEPSWVNKL VGGDSTPMVG NVMPRSSSAL GWSERARQNE TQRFPYGAYN MPFGNGFNMG LKPQMGQMGM PQMGPVGMNG MPMGMPMGMG PMNMGFPMGM NMGFGQMGAN GFGQFAPGHP GQAQNQHPGQ GYPPPSSSAA PTGHGMGAQD REVIELARKK GLNPATFNCK PQNARFFVIK SYTEEDVQKS LKHEIWSSTV LGNKRLDAAY RETANKGPIY LFFSVNGSRH FCGVAEMTTP VDETKTSKVW AQDKWKGIFE VKWIFVRDVP SAALRHIRLT NTPECKPITN SRDTQELPYE AGTEVLQIFL DHQTKSKTSL LQDFAYYEQL SVNRQNQNSP QNGNSQNQQP SSYQQFQQKP GNAIAPPMSN SHSGPTIPPV PAIPDKFR
|
| |