Gene Athe_0857 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0857 
Symbol 
ID7407432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp954155 
End bp956668 
Gene Length2514 bp 
Protein Length837 aa 
Translation table11 
GC content37% 
IMG OID643715235 
Productglycoside hydrolase family 2 sugar binding 
Protein accessionYP_002572745 
Protein GI222528863 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGGCGGA CTATAAATAC CCCAGCCATC TTGAGCCGCT GCTTTTCTGA AGCGGCTCAA 
GATGTTCTTG AGTTTCAAAA AATGCAGAAG ATAAATTTGG ACAGGTCATG GGAATATCTT
GAGCTTGGTT TTGCAAATGT ATTAAACTTA ACAAGTTCAG AATATGAGTG GAAAAAGGTG
GATCTGCCAC ATGATGCTGT AATTGAAAAA GAAAGAAGTG AGGCTAATCC CTCAGGTGCC
GGTGAAGGGT ATACTGCGGG ATGCAGTTTG TATTACAAAA AAGAATTGAT TTTAGATGAA
GAGTGGCAAG GCAAAAACTT AATACTTGAG TTTGAAGGAA TCATGGGTAT AGCTGAGGTT
TTTGTAAATG GTAGGTTAGT TGCTAAACAC TTCAATGGAT ACACAAGTTT TCTAATTGAT
ATAACAAAGC ATGTCAAGTT TGATGATAAG AACATTATTA TTGTGCGTGT AGAAAATACT
CATAAGCCAA GCTCAAGATG GTACGCAGGT TGTGGCATAT ACAGACACGT GTGGCTACAC
ATTGGTGGCA AGGTGTATAT AAAACCATGG CATTTGCATG TTCAAACCAG AGCAATTGAA
AATGAAACTG CAAAGTTAGA AGTAAGAGCT GTTGTTGTCA ACAGTGTTAA TGAAAAAGTT
CAAGGAGTAA TAAAATTTGA TGTATTTTCA AAAGATGAGA AACTAATTCT CAGCAGTGAA
GAGAAGTTTT TGATTGGTAA AAATGAAGAA AAGGTAATTG CCAAAACCTT AGAACTAAAA
CCATTTAAAT ATTGGGATAT AGAAGACCCA TATCTTTATA AAATTCAAGC AACTATTATT
TGTGATGAGA GTATTGAAGA TAGTGCTTCT ACATTGTTTG GTATTCGGAC AATTTCAGTT
GATCCCAAGG AAGGATTTAA ATTAAATGGT AAGCCATTAA AATTAAAAGG TGGCTGTATT
CATCATGACA ATGGACCACT TGGAAGTGCG AGTTACGATA GGGCAGAAGA GAGAAAAGTA
GAACTTTTAA AAGCTTCTGG ATTCAATGCG GTAAGACTTG CACATAATCC TTTTGCTCCA
GCTTTTTTGG ATGCATGTGA TAGATTAGGG ATGCTTGTGA TAGAAGAATT TTTTGATGTA
TGGCATGCTG GCAAGGTTAG CTTTGACTAC CATCTATTTT TTGACAAGTA CTGGGAAGAA
GATTTGGAGT CCACCATAAT GAGGGACTAT AATCATCCTT CGATAATAAT GTGGTCAATA
GGAAACGAGA TTACATGGGG AGTTGGAGTT GATGTTGATG ACGATAGTAG CTACTCAATA
TACACCTGGT GTGAACGATT AGCAAAAAAG GTAAAAAGTT TGGATTCATC AAGGCTAATA
ACAGCGGCAC TGTGCGCAAT TCCGGATGAT TATAAAAGGC TTTTTGCTAT AATTGAAGAA
GGTAACTATG TAATTAGAAT GCTAAAACAA GAAGTAGATG TTATTGAAGA TAAATGGGGC
GAGTTTTCAG AGAAGTTTTC AAGATTTCTG GATGTAGTTG GTTATAACTA TAAAGTAGAT
AGATATGGAT TCGATAGGTA TAAATATCCT GACCGAGTAA TCTGTGGCAC AGAAACTTAT
CCATATACTC TTTTTAAAAA CTGGAAACAA ACAATAGAAA ATTCAAATGT TATAGGTGAT
TTTGTGTGGA CGGCTATTGA TTATTTGGGC GAAGCAGGTC TTGGTAGGGT AAGTATTGAA
GCAGATGACC TTAAATCTTT CTGCGGGTCT TATCCATGGT TTTTAGCAAA TTGTGGGGAT
ATTGATATTT GTGGGGAAAA ACGTCCTCAA TCGTATTATA GAGATGTTGT TTGGGGAAAT
AGAAAAGACC CTTATATTGT AATACTTCCA CCGCAAGTTT ATGGCAAAAA ACTATATTTT
AAACCCTGGG CATGGGAACC GGTCGAAAGA AACTACACTT TTCCTGGATA TGAAGGGATG
AAGGTGGCTA TACATGTCTA TGCAGATGCA GATGAGGTTG AGCTGTTTGT AAATGGCAGA
AGTTTGGGAA GAAAAGAAGT AGGGATTAAT ACTCAGTTTA AAATGGTTTA TGACACAATT
TATGAACCAG GAGTGATAGA AGCTGTAGCA TACAAAGATG GTAAAGAGAT TGGCAGAGAC
AAAATAGAAA CAACAGGTGA GCCCGTAGCC TTAAAACTGG TGCCTGATAG AGAAGTTATA
TCCTCTTCTT ATGGTGATTT GTGCTATATA AAAATAATGG CAGTAGATAG AAAGGGAAGA
GAGGTTGTTT TTGCAGATAA CAGAATAGTT GTAGAAGTTG AAGGTGTTGG CGAACTTGTT
GCTTTAGGAA GTAGCAATCC TTTGTCAACA GAACCATTTG TTAGCCGAGA GAGAAAGCTC
TATAAGGGGC GAGCATTGGC AATAGTAAAG AGCATTGGCA AAAAAGGAGA ATTTAAATTG
AAAGCTTGGG CAGAAGGATT GGATGGTGCA CAAGTTTTTG TAAAATGCAT TTAA
 
Protein sequence
MWRTINTPAI LSRCFSEAAQ DVLEFQKMQK INLDRSWEYL ELGFANVLNL TSSEYEWKKV 
DLPHDAVIEK ERSEANPSGA GEGYTAGCSL YYKKELILDE EWQGKNLILE FEGIMGIAEV
FVNGRLVAKH FNGYTSFLID ITKHVKFDDK NIIIVRVENT HKPSSRWYAG CGIYRHVWLH
IGGKVYIKPW HLHVQTRAIE NETAKLEVRA VVVNSVNEKV QGVIKFDVFS KDEKLILSSE
EKFLIGKNEE KVIAKTLELK PFKYWDIEDP YLYKIQATII CDESIEDSAS TLFGIRTISV
DPKEGFKLNG KPLKLKGGCI HHDNGPLGSA SYDRAEERKV ELLKASGFNA VRLAHNPFAP
AFLDACDRLG MLVIEEFFDV WHAGKVSFDY HLFFDKYWEE DLESTIMRDY NHPSIIMWSI
GNEITWGVGV DVDDDSSYSI YTWCERLAKK VKSLDSSRLI TAALCAIPDD YKRLFAIIEE
GNYVIRMLKQ EVDVIEDKWG EFSEKFSRFL DVVGYNYKVD RYGFDRYKYP DRVICGTETY
PYTLFKNWKQ TIENSNVIGD FVWTAIDYLG EAGLGRVSIE ADDLKSFCGS YPWFLANCGD
IDICGEKRPQ SYYRDVVWGN RKDPYIVILP PQVYGKKLYF KPWAWEPVER NYTFPGYEGM
KVAIHVYADA DEVELFVNGR SLGRKEVGIN TQFKMVYDTI YEPGVIEAVA YKDGKEIGRD
KIETTGEPVA LKLVPDREVI SSSYGDLCYI KIMAVDRKGR EVVFADNRIV VEVEGVGELV
ALGSSNPLST EPFVSRERKL YKGRALAIVK SIGKKGEFKL KAWAEGLDGA QVFVKCI