Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNI03210 |
Symbol | |
ID | 3259533 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006694 |
Strand | - |
Start bp | 871322 |
End bp | 874555 |
Gene Length | 3234 bp |
Protein Length | 991 aa |
Translation table | |
GC content | 50% |
IMG OID | 638258813 |
Product | hypothetical protein |
Protein accession | XP_572927 |
Protein GI | 58271542 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.353593 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTCTTATTCA ATCGTCTAGA CCCAGACACG ATGTCAGTGC CCTACGAGGC CCATAATTCT CCACCACCGC CGCCCCCGAA GCATGCCTCA TCTTCGTCTT CCGGCAAAAA GGCAAAGCAA AAACGACCAT CTTCCTCGAA TGGCACTCCC GATCAAAGCT CGGCAACGGC GGTAAGGGCT TTCTCGGCTT GCCGGAACTG CCGGAACAAG AAGGTGAAAT GCCTTCCTGG ACCACCCCTT CCCAGTTCGA GTATCACTTT GCCAGCATCT CCGACTTCTC TGGAAAATCC CGGGGCGAAC CTAGGACCTT GTCAACAATG TCTTCAATCT GGTGCCGAGT GCATTTATCC CCCGACTAGA GATAGAGCTG CGTACAGCAG ACAATATGTG GCGAATTTGG AAACAAGGGT ACAAAGTCTT GAGATGGTGC AGGCGAGATT GATGCCTTTA CTCGAAACTT TCGAGGCGAG TACGCATAGC GGAAAGCCGA TGCCTATGCC GCTTCCTCCG GTACCGGCCA GAAGGACTGA AACACCAGCT CAAGAGATGA ATGAAGACGT CGAAGAAGGA GAGGATGAGA TTCCCGAGGA TAGCGCGATG CAACCTGCTT CAGACAGTGA AGATGCTGGT CAGATCACAC AAGATGATAG GGGAAACTAT CGATGGATAG GTTCGTCGAA TACACTTTCT CTTCTTGACT CCTTCTCTGG TCGGCAATTA GCAAGTGGTC GACCATTGCC TTCTCGTACG CAATCATCTA CAACCCAAAT GGATGCAGCC ATTTCCAGGG ATCGCACACC GTCAAACATC GCGTCTACTC CTACTCGAGA ATCAAATCCT TATTTTGGTC CAGTGGCTGG TTCAGGTGTC GTCAAAGCCT TACCTCCTGT TGATGAGGTG CAGTATCCTT CTGCAGAGAA GTCACTGGAA ATGGTCGATG CTTTCTTTCA AGAGGTGCAT CCTTGCTTAC CTGTCCTTTT GGAACACGAA TTTAGAAGAG ATTTCCGGGC GCTGATGGAG GCAAGGGCCA GAGGTAATCT CTCGTGGGGC GGTGGAGTGA GTCATCCGCT TGGAGCCAAA CAGTCGCCAC GTTGACGGTG TGTCTAGTTT ATTTCAGTCG TGTTTGCCAT ATTTGCCCTG GGTGAAAGAG TAATTGTCAC ATCAAGAGCG TGGAGGAGAG AAATGGCTAA GGCTGAAGGT GATGATGATG ATCATGAGAC TGTCTTGCCT GGTGAGGCAG AGGCCGGTGT AATCTGGTAT GAGAGGTAAG TGCTGCTAAG CTTGCGGAGA GATGCGGGAT AATAATGCTT CTGCTTCAGA GCTCAAATCT TACATTACAC CACTTTAAAA GACGTCAACA TCCACCAGGT CCAATGCCTT ACTCTTCTCG CTGCATTCCA AGCAAGTGTA AATGCTATGC CCATGTCATG GCTTCTTGCC GGACAGGCTA TCCGTGTAGC TCAAGACTTG GGTTTACATC GATCAACCGC CCGGCTTCCC TTATCGTTTG CAGAAAAACA GCTGCGTTCA CGATGTTGGT GGGCCATCTA CGGTTTGGAG AGGATGATGT CAATTTCTCT CGGCCGGCCG CTGGGTGTGG ATGATCTTGA TGTGGATGTA GCATACCCGC TGGAGGTTGA CGATGCCGTG CTGGAGAAGA TGGCGATGGA GAACCTGCAA GCTTTACCGC CTGAATTCGA GAAAGAGCCC GAGGCTTCGA CAATGAGTGG GTTCATCGCG CTCACAAAGC TCTGCAAGAT TGCTGGGCGA GTTGTGCATC TGCTCTATCG GCCTTCAAAT GGAAGGTCGG TGAGTGATCC TTCGTGGGCG GTACAGCAGC AGAATGCGAT CAATAAATTA GACAAGTTAC TTAGAGATTG GTTAGCAAAC GACGTGGTAA GTTGTCACGT GCACTGTATT CATGGGTAGC TGACTAGTAA TAGCCTTCAA AATACAAAGA TCCTTCAGAA ACGCATTCGG TATCCCTTCT TTCCGCCATT TTATCCAACT CTTACTTTAC TGTTCTCGTC ACCCTTCACC GAAACTTCTT GCCCTCATCA CCCGATTATC CTCGACCTAA ACCTCCTCCC TCTTCTCAGT CGCTCGCTCA TTGCGTCGAC GCTGCCAGAT CTGTCATCCA CATCGCCTCT CAATCTCGCA CCCTCGTACC ACCCTCTCAT CATCTTGCAA TGTATTGTCA ATACTTGTGG TCGTCAGCAG TCATCCTGCT GTTATGTGAG ATCCAAGCGA GAGATGAAGT GGTCATCGAA GCGGTCGGTT CGCAGGTGGA GGCTTGTAGG AAGTGCTTAC AGGCTTTGGA ACCTGTTTGG CCTGGGTCAA AAAAGTTGAA GGAATTGTTA AACGATGTGG CCAGTCGTGC GAAAGAAGTG ATGGTTTCAA AGTCATCAGA CAAAAAGCGC AAGTCATCTG CGCATAAGGA CAAGGACAGA GAGAGGCAGA TGCTACATCC TTCCCAAAAC CATCAGTCTC GACCGTCAAC CGACTCGCCT ATCCCGCAGC ACGCTGAGAA TCAGTGGCCG TCTCACTCTG TTTCTCCACC AGAGAAGAGA CAACGTGTGT TTGAACTTTC TGACACTCGC ACAGCTTCGA ATGAGGATGA ACCGGCTCAA AACCCCCAAG CATATTATTC AGTGTATCCT ATGACCACGC CGATAACGTC TCAGGCTTCA CTCCAGTTTA CTGAGCCAAT GCCTACTTAT GATATGGTAT TCGACCTTGG AGGAGTCACC TTTGACGGAT TAGAGTTGTT GCAAGGCTTT AGCGGGGGCG CTTCCAATTT TTGGAATAAC TTCAACTTTG GCATGGATGG AGCTGGCAAT GGTAGCGCTT CTGGAGGCTC TGCGCCAGTG GCAAGAACCG GGCAGCAATT CCTGCCCTCT GGTCAATTGA CACCCAACTC TAATGGTGAC GGTTCGAGGC CTTCGTCATC CAGCTGGCAA GGGCAGCTAT CACGGATGGT CAGCCAGAAT GGGCAGCAGT ATGTGCAAGG ACAGGGACAG GATGGATATA ACGGCGCAGG GAGTGGAAGC GGTGGCGTTG GGACGCCTAA TGCGCATCGG GGAGCAACAT TCTGGGAGCA AGTGACTGGG AGTACATTCG ACTGGCAGGC AGACCCAAAT GTGCCTTTCA ACATCTAGCT TCGAATTCAT TTACTGCATC CGTATCTTCA AATCATACTT ATTGTATTTT AGATTTGTAT AATTTGTGAT TCTG
|
Protein sequence | MSVPYEAHNS PPPPPPKHAS SSSSGKKAKQ KRPSSSNGTP DQSSATAVRA FSACRNCRNK KVKCLPGPPL PSSSITLPAS PTSLENPGAN LGPCQQCLQS GAECIYPPTR DRAAYSRQYV ANLETRVQSL EMVQARLMPL LETFEASTHS GKPMPMPLPP VPARRTETPA QEMNEDVEEG EDEIPEDSAM QPASDSEDAG QITQDDRGNY RWIGSSNTLS LLDSFSGRQL ASGRPLPSRT QSSTTQMDAA ISRDRTPSNI ASTPTRESNP YFGPVAGSGV VKALPPVDEV QYPSAEKSLE MVDAFFQEVH PCLPVLLEHE FRRDFRALME ARARGNLSWG GGFISVVFAI FALGERVIVT SRAWRREMAK AEGDDDDHET VLPGEAEAGV IWYERAQILH YTTLKDVNIH QVQCLTLLAA FQASVNAMPM SWLLAGQAIR VAQDLGLHRS TARLPLSFAE KQLRSRCWWA IYGLERMMSI SLGRPLGVDD LDVDVAYPLE VDDAVLEKMA MENLQALPPE FEKEPEASTM SGFIALTKLC KIAGRVVHLL YRPSNGRSVS DPSWAVQQQN AINKLDKLLR DWLANDVPSK YKDPSETHSV SLLSAILSNS YFTVLVTLHR NFLPSSPDYP RPKPPPSSQS LAHCVDAARS VIHIASQSRT LVPPSHHLAM YCQYLWSSAV ILLLCEIQAR DEVVIEAVGS QVEACRKCLQ ALEPVWPGSK KLKELLNDVA SRAKEVMVSK SSDKKRKSSA HKDKDRERQM LHPSQNHQSR PSTDSPIPQH AENQWPSHSV SPPEKRQRVF ELSDTRTASN EDEPAQNPQA YYSVYPMTTP ITSQASLQFT EPMPTYDMVF DLGGVTFDGL ELLQGFSGGA SNFWNNFNFG MDGAGNGSAS GGSAPVARTG QQFLPSGQLT PNSNGDGSRP SSSSWQGQLS RMVSQNGQQY VQGQGQDGYN GAGSGSGGVG TPNAHRGATF WEQVTGSTFD WQADPNVPFN I
|
| |