Gene CNF03620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF03620 
Symbol 
ID3258257 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp1062994 
End bp1065385 
Gene Length2392 bp 
Protein Length586 aa 
Translation table 
GC content48% 
IMG OID638257481 
ProductBeta-hexosaminidase precursor, putative 
Protein accessionXP_571630 
Protein GI58268948 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCTTCA GTGGCCTGCT CGAAGTTCTC ACCTCGTCTC TACCGTTCCT TGCTCCTTCG 
TCTCCTCTTT CGGCCAAACC CGATATCAAT GTCGTCCCTT TGCCTAGGCA TTACATCATC
GGGGATGGGT CGACGCCTGT TTGTCTTTCT ACCAATTTCA GCATACAAGC AGCCCCCTCA
TCTTTAGCCA CTTTCCCGAC TGACCTTCAA GATGCCATCA CATCAACTCA ACACCGCTTG
AAGAACACCC AGGTAACGTA TCTTTCGCCC AACGAAGGAT CAGAGTTTTT CACCGGCGGC
TCTGGTGCTA TTAGATCTTG CGCATACTAT CTCGACACCT TGCACATTGA TTTCACTGCC
TATAATGGTA CCGATATCCT CTCGGAAACC GTTGCACCCG TCGAAGAACG CGCCGAGCTC
GAGGCATATA CGCTTGATCT CTCTCTCAAG GGGAAAGCGA CGATCAGCTC TCGAGGGGCT
TTGGGTGCGT TCAGAGGTCT CAGCACCTTC GAAGGCCTCT TCTACAGCCT TGAGGCTGGA
GTTCAGGGAT CGGACAGAGT GTATGCTCCA CTCGCTCCTT ATCATATTGA AGACAAGCCA
AGTTTTGGCT GGCGTGCAGT ATTATTGGAT ACCTCGAGGC ATTACTTTTC CGTTCCATCC
ATCCTGAAGG TGAGTATGCA AGGCTTCAAG CAAAAGGCCT TGCTAATACA GCAGAACAGA
TACTGGATAC GATGTCCATG GTTAAGCTCA ACGTCTTCCA CTGGCACGTC ACGGACTCCA
ATTCATGGCC TTTAGATCTT GACAGCTATC CAGAACTCGC AGCCAAAGGA GCATCCTCTC
AGTCTGAGAG GTATAGCCAG AAAGATATGC AAATGATCAT TGACTATGCA GGCCACGTAA
GCCTTTCACT CAGAAAATGA GAACTCGGGC TGACCAAAAG CTCGGCAGAG AGGCATTGAC
ACCCTTCTCG AGATTGACAC ACCGGGCCAC ACTGCCTCTA TTGCTCCTTC GCATCCCTCC
TTCGTGGCAT GCTTCGAGTC AACGCCATTC AAACACTTTG CTCACCAGCC TCCAGCAGGA
CAATTGCGAT TTGCCGACGA GAAGGTGACA GAGTGGACGG CTCAGCTCCT GCGGGAGATC
GGCAGTCTGT CCAAAGGAGG ATATTTCAGT ACGGGAGGGG ATGAGATCAA TATGAACTGC
ATGGTAAGGG ACAATCATGA ATACTTTTCC GATATTCATT ATGTGTGTAG TTGGAAGATA
TGCCTACGGC GTCTAAGCTG AAAGCCAAAG GCTGGACGTT GGATGACGCC TTGGATCATT
TTACTGAAAA GACACATGCC CCCTTGAGGC AGGCAGGAAA AACTCCAGTG GTCTGGCAAG
AGATGGTAGG CCCTTGTTGA TTCTCTTATC ACCCACCGAA ATCTGATCTG GATGGTTCAC
AGGCGCTCAA TCACGGGACG ATGTCTTCTC TTACTAATGA CACCATTGTT GATATCTGGG
TCAACTCTGC GGACGCTCGC AAGGTTCTGG ACCAAGGATA CCGTATAGTC CACGCTTCGG
CAGACTACTT TTACCTGGTG AGTGTGCCAG CTGTTGCTGT TACGTGAATT ATGGCTTAAA
ATGAGCTCCT TAGGACTGTG GACAAGGCGG ATGGATAGGG GAAGAGGGAG GTAATAACAG
CTGGTGTGAT CCCATGAAGT CTTGGGCAAG AATGTATTCG TGAGTTTATA TTGAAATCCA
AAGCATATTT ATTTTATCCT AACCAGTATG TTTTTAGTTT CGACCCTTTC AAAGATGTCA
AAGACGAAGA GAGGCACTTG GTTTTGGGTG GTGCGTCAAC GCATTCATAA CGGGGGACAA
GCTAATTTGA TTGATAGGTC AAACATCGCT CTGGACAGAG CAGGTAAACT AGCCTCCTGT
CCTTGTACAT TAATTGAGAC TGACGTTAAA CACAGACGGA CGAGACGAAC TTGGAGCCTA
CTCTCTGGCC TAGAGCAGCA GCTTTAGCGG AAGTTTTCTG GTCAGGGCCA GGACCAGACA
GCCGACCTCG CAGTCAGTCC CAATTGCCTC AATGTTTTTC CGTTCATAAC ATGCTGATTT
TTCAAGTACA CAGGTTCAAA CAAGGCGCTA CCTAGAATGC ACGATATCCG TTATAGAATG
GTGGGAAGAG GTGTGAGAGC TGCACCTTTG CAGCCTCGTT GGTGCGCCCT TCGTCCCGGT
AAGTTGAGAG GAGGGTCCCC CCCAAATCCC TAGCAGTAGG CTGATATTCT TATGAATAGG
TGCGTGTATT TTAGCTGCTT GAGGTGTTGA TGATTCTGTA TGGCTACGAT AAATTATACA
TATAGCTCCA AATGGTCTAC TACGTTTTTT TGATACTTGT TTTTTTGAAT CA
 
Protein sequence
MLFSGLLEVL TSSLPFLAPS SPLSAKPDIN VVPLPRHYII GDGSTPVCLS TNFSIQAAPS 
SLATFPTDLQ DAITSTQHRL KNTQVTYLSP NEGSEFFTGG SGAIRSCAYY LDTLHIDFTA
YNGTDILSET VAPVEERAEL EAYTLDLSLK GKATISSRGA LGAFRGLSTF EGLFYSLEAG
VQGSDRVYAP LAPYHIEDKP SFGWRAVLLD TSRHYFSVPS ILKILDTMSM VKLNVFHWHV
TDSNSWPLDL DSYPELAAKG ASSQSERYSQ KDMQMIIDYA GHRGIDTLLE IDTPGHTASI
APSHPSFVAC FESTPFKHFA HQPPAGQLRF ADEKVTEWTA QLLREIGSLS KGGYFSTGGD
EINMNCMLED MPTASKLKAK GWTLDDALDH FTEKTHAPLR QAGKTPVVWQ EMALNHGTMS
SLTNDTIVDI WVNSADARKV LDQGYRIVHA SADYFYLDCG QGGWIGEEGG NNSWCDPMKS
WARMYSFDPF KDVKDEERHL VLGGQTSLWT EQTDETNLEP TLWPRAAALA EVFWSGPGPD
SRPRSSNKAL PRMHDIRYRM VGRGVRAAPL QPRWCALRPG ACILAA