Gene Tmel_1226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmel_1226 
Symbol 
ID5298069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermosipho melanesiensis BI429 
KingdomBacteria 
Replicon accessionNC_009616 
Strand
Start bp1238626 
End bp1240551 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content28% 
IMG OID640769502 
Productglycoside hydrolase family protein 
Protein accessionYP_001306464 
Protein GI150021110 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.459632 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTATA ATTATTTTGT TTTTTATAAA TTATATCAAA AATTAAACCC AGAATCGTTA 
ACTTTTTCAA ATTTCTTTAA AATGTTATAT AATGAAAATG AGGTGATAGA TATGATATTA
ATTCCTAGGC CAAAAAAGTT TGAAATAAAA GATGATGTTT TTGAATTCAA ATCTGGAGAG
TATATTTTTT TATCAACAAA GAACTTATTT AAAAGTGCAG AGAAGGTAAA AAGGGTGTTA
AAAGATTTTG GCGTTGAACT TTTTATTTCA TACTATAAAG GAAGTAAAAC TTCTATATAT
GTTGAGGTAG AAAGGGGAAA AATAAAGGAA AAATCAGGGT ATAAATTAAT AGTTTCTTCA
GATGGCGTAA AGGTATATGG TAATAATAAT CTTGGAGTTC ATTATGGACT TATGACTCTT
GTTCAGATAA TAAGGAACTA TGGTAATACA ATTCCTTTTA TGGAAATACA TGATTGGCCG
GATATAGAAA ATAGAGGAGT ATTGATTGAT ATAAGCAGAG ATAAGGTGCC TAAATTAGAA
ACATTATATT ATATTGTTGA TCTTTTGTCG GAATTAAAAT ATAATCAATT CCAGCTATAT
ACCGAGCATA CATTTGCATA TCGTGAACAT GAGAAAGTTT GGAGAGATTA TTCACCATTT
ACTAGTGAAG ATATTATTAA ACTTGATAAA TATTGTAAGG AAAGATTCAT TGAACTTGTT
CCTAATCAAG CTTCATTTGG TCATATGGAA AAATGGTTAA GCCATAATGA ATATAGTTAT
TTAGCTGAAA CGTTTGAATT TAATACTTCA TGTGGGGAAC ATTATAGTCA TCCTTTTACA
TTGTCACCTG CCGTTAGTGA TTCAATTGAG TTTTTAGATT CTTTGTATAG GGAATTACTT
CCGCATTTTT CGAGTAAACA TTTTAATGTG AATGCAGATG AAACGTTTGA TTTATGTCAA
GGTAAATCAA AACCATTATG TGAGAAACAC GGTAAAGGAA AGGTTTATCT GAGTTTTGTT
TTAAAGATAT ATAATTTGGT AAAAAAATAT GGAAAAAAAA TGATGGTGTG GGCGGATATA
TTAAAAAATT ATCCAGAATT ATTTAATGAT TTACCTAAAG ATGTTGTGTA TTTAATTTGG
GGATATGAAA AAGATCATAA TTTTGATGCT GAGTGTGAGT TATTTAGAGG ATTTGAATTT
TATGTATGTC CAGGTACTTC GAGTTGGAAT TCTTTTTTAG GAAGACTGGA AAATGCGCTT
TTAAACATAA AAAATGCAAC AGAAAGTGGA TTGAAATATG GTGCTAAAGG AATTTTAGTT
ACGGATTGGG GGGATAATGG TCATTGGCAA CATTTACCTA TTTCTTTTCC CGGATTTTTT
TATACTTCAG CTGTTTCATG GGGTTTTGAT AAGAATCAAG ATATAGATCT AAAAGAAGCT
TTGTCTTTAT ATTTTGGAGA ACAAGTTTCT GAAATATTAT TGGAAGTTGG TAATGCTTAT
AAATTACTTG AGTTTGAAGT TCCAAATTCT TCTATTTTTG CATTGGTTTT CATAAAGCCT
GAATTTATAA ATAAAAAGTT TATTTCTAAG TTAAAGATAG AAGCTCTTGT AAAGACTAAA
GATTATTTAG AAGAAAGATT ATCAAAGATA AATATCTTGA ATGATAATTT AATAAAATAT
GAATTAAAAA ATGCCATTGA ATTTTCTATC TTAGCAATTG AAATACTTTT ATCAGCAAGA
AAATCAAAAT TAGATAGCTT AGAAGGTATT TCGCCGGTTG CAAGGGAAGA ATTTTCAATA
AGATTAGAAA AATTGATTAA TGAATTTGAA AAAATTTGGC TTAAGAGAAA CAAAATGGGA
GGTTTAAGCT ATAGCATAGA AAAGTTAAGA AAGATCCAAA ATATTTTTAA GGAGTTAAAA
AAATGA
 
Protein sequence
MSYNYFVFYK LYQKLNPESL TFSNFFKMLY NENEVIDMIL IPRPKKFEIK DDVFEFKSGE 
YIFLSTKNLF KSAEKVKRVL KDFGVELFIS YYKGSKTSIY VEVERGKIKE KSGYKLIVSS
DGVKVYGNNN LGVHYGLMTL VQIIRNYGNT IPFMEIHDWP DIENRGVLID ISRDKVPKLE
TLYYIVDLLS ELKYNQFQLY TEHTFAYREH EKVWRDYSPF TSEDIIKLDK YCKERFIELV
PNQASFGHME KWLSHNEYSY LAETFEFNTS CGEHYSHPFT LSPAVSDSIE FLDSLYRELL
PHFSSKHFNV NADETFDLCQ GKSKPLCEKH GKGKVYLSFV LKIYNLVKKY GKKMMVWADI
LKNYPELFND LPKDVVYLIW GYEKDHNFDA ECELFRGFEF YVCPGTSSWN SFLGRLENAL
LNIKNATESG LKYGAKGILV TDWGDNGHWQ HLPISFPGFF YTSAVSWGFD KNQDIDLKEA
LSLYFGEQVS EILLEVGNAY KLLEFEVPNS SIFALVFIKP EFINKKFISK LKIEALVKTK
DYLEERLSKI NILNDNLIKY ELKNAIEFSI LAIEILLSAR KSKLDSLEGI SPVAREEFSI
RLEKLINEFE KIWLKRNKMG GLSYSIEKLR KIQNIFKELK K