Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNF03620 |
Symbol | |
ID | 3258257 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006691 |
Strand | + |
Start bp | 1062994 |
End bp | 1065385 |
Gene Length | 2392 bp |
Protein Length | 586 aa |
Translation table | |
GC content | 48% |
IMG OID | 638257481 |
Product | Beta-hexosaminidase precursor, putative |
Protein accession | XP_571630 |
Protein GI | 58268948 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3525] N-acetyl-beta-hexosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCTTCA GTGGCCTGCT CGAAGTTCTC ACCTCGTCTC TACCGTTCCT TGCTCCTTCG TCTCCTCTTT CGGCCAAACC CGATATCAAT GTCGTCCCTT TGCCTAGGCA TTACATCATC GGGGATGGGT CGACGCCTGT TTGTCTTTCT ACCAATTTCA GCATACAAGC AGCCCCCTCA TCTTTAGCCA CTTTCCCGAC TGACCTTCAA GATGCCATCA CATCAACTCA ACACCGCTTG AAGAACACCC AGGTAACGTA TCTTTCGCCC AACGAAGGAT CAGAGTTTTT CACCGGCGGC TCTGGTGCTA TTAGATCTTG CGCATACTAT CTCGACACCT TGCACATTGA TTTCACTGCC TATAATGGTA CCGATATCCT CTCGGAAACC GTTGCACCCG TCGAAGAACG CGCCGAGCTC GAGGCATATA CGCTTGATCT CTCTCTCAAG GGGAAAGCGA CGATCAGCTC TCGAGGGGCT TTGGGTGCGT TCAGAGGTCT CAGCACCTTC GAAGGCCTCT TCTACAGCCT TGAGGCTGGA GTTCAGGGAT CGGACAGAGT GTATGCTCCA CTCGCTCCTT ATCATATTGA AGACAAGCCA AGTTTTGGCT GGCGTGCAGT ATTATTGGAT ACCTCGAGGC ATTACTTTTC CGTTCCATCC ATCCTGAAGG TGAGTATGCA AGGCTTCAAG CAAAAGGCCT TGCTAATACA GCAGAACAGA TACTGGATAC GATGTCCATG GTTAAGCTCA ACGTCTTCCA CTGGCACGTC ACGGACTCCA ATTCATGGCC TTTAGATCTT GACAGCTATC CAGAACTCGC AGCCAAAGGA GCATCCTCTC AGTCTGAGAG GTATAGCCAG AAAGATATGC AAATGATCAT TGACTATGCA GGCCACGTAA GCCTTTCACT CAGAAAATGA GAACTCGGGC TGACCAAAAG CTCGGCAGAG AGGCATTGAC ACCCTTCTCG AGATTGACAC ACCGGGCCAC ACTGCCTCTA TTGCTCCTTC GCATCCCTCC TTCGTGGCAT GCTTCGAGTC AACGCCATTC AAACACTTTG CTCACCAGCC TCCAGCAGGA CAATTGCGAT TTGCCGACGA GAAGGTGACA GAGTGGACGG CTCAGCTCCT GCGGGAGATC GGCAGTCTGT CCAAAGGAGG ATATTTCAGT ACGGGAGGGG ATGAGATCAA TATGAACTGC ATGGTAAGGG ACAATCATGA ATACTTTTCC GATATTCATT ATGTGTGTAG TTGGAAGATA TGCCTACGGC GTCTAAGCTG AAAGCCAAAG GCTGGACGTT GGATGACGCC TTGGATCATT TTACTGAAAA GACACATGCC CCCTTGAGGC AGGCAGGAAA AACTCCAGTG GTCTGGCAAG AGATGGTAGG CCCTTGTTGA TTCTCTTATC ACCCACCGAA ATCTGATCTG GATGGTTCAC AGGCGCTCAA TCACGGGACG ATGTCTTCTC TTACTAATGA CACCATTGTT GATATCTGGG TCAACTCTGC GGACGCTCGC AAGGTTCTGG ACCAAGGATA CCGTATAGTC CACGCTTCGG CAGACTACTT TTACCTGGTG AGTGTGCCAG CTGTTGCTGT TACGTGAATT ATGGCTTAAA ATGAGCTCCT TAGGACTGTG GACAAGGCGG ATGGATAGGG GAAGAGGGAG GTAATAACAG CTGGTGTGAT CCCATGAAGT CTTGGGCAAG AATGTATTCG TGAGTTTATA TTGAAATCCA AAGCATATTT ATTTTATCCT AACCAGTATG TTTTTAGTTT CGACCCTTTC AAAGATGTCA AAGACGAAGA GAGGCACTTG GTTTTGGGTG GTGCGTCAAC GCATTCATAA CGGGGGACAA GCTAATTTGA TTGATAGGTC AAACATCGCT CTGGACAGAG CAGGTAAACT AGCCTCCTGT CCTTGTACAT TAATTGAGAC TGACGTTAAA CACAGACGGA CGAGACGAAC TTGGAGCCTA CTCTCTGGCC TAGAGCAGCA GCTTTAGCGG AAGTTTTCTG GTCAGGGCCA GGACCAGACA GCCGACCTCG CAGTCAGTCC CAATTGCCTC AATGTTTTTC CGTTCATAAC ATGCTGATTT TTCAAGTACA CAGGTTCAAA CAAGGCGCTA CCTAGAATGC ACGATATCCG TTATAGAATG GTGGGAAGAG GTGTGAGAGC TGCACCTTTG CAGCCTCGTT GGTGCGCCCT TCGTCCCGGT AAGTTGAGAG GAGGGTCCCC CCCAAATCCC TAGCAGTAGG CTGATATTCT TATGAATAGG TGCGTGTATT TTAGCTGCTT GAGGTGTTGA TGATTCTGTA TGGCTACGAT AAATTATACA TATAGCTCCA AATGGTCTAC TACGTTTTTT TGATACTTGT TTTTTTGAAT CA
|
Protein sequence | MLFSGLLEVL TSSLPFLAPS SPLSAKPDIN VVPLPRHYII GDGSTPVCLS TNFSIQAAPS SLATFPTDLQ DAITSTQHRL KNTQVTYLSP NEGSEFFTGG SGAIRSCAYY LDTLHIDFTA YNGTDILSET VAPVEERAEL EAYTLDLSLK GKATISSRGA LGAFRGLSTF EGLFYSLEAG VQGSDRVYAP LAPYHIEDKP SFGWRAVLLD TSRHYFSVPS ILKILDTMSM VKLNVFHWHV TDSNSWPLDL DSYPELAAKG ASSQSERYSQ KDMQMIIDYA GHRGIDTLLE IDTPGHTASI APSHPSFVAC FESTPFKHFA HQPPAGQLRF ADEKVTEWTA QLLREIGSLS KGGYFSTGGD EINMNCMLED MPTASKLKAK GWTLDDALDH FTEKTHAPLR QAGKTPVVWQ EMALNHGTMS SLTNDTIVDI WVNSADARKV LDQGYRIVHA SADYFYLDCG QGGWIGEEGG NNSWCDPMKS WARMYSFDPF KDVKDEERHL VLGGQTSLWT EQTDETNLEP TLWPRAAALA EVFWSGPGPD SRPRSSNKAL PRMHDIRYRM VGRGVRAAPL QPRWCALRPG ACILAA
|
| |