Gene Athe_0097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0097 
Symbol 
ID7408459 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp119197 
End bp120537 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content38% 
IMG OID643714507 
ProductGlucosylceramidase 
Protein accessionYP_002572030 
Protein GI222528148 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5520] O-Glycosyl hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACAAAA CAATTGCATG TTACGTAACA GCAAAAGATC AAGACATTCC GATGCAGCAA 
GTAGACAATA TAAAAGAAAG CACAAAAATA GAAAGTAGTA TTGTTATAAC AATTGAGCCA
TCCACTACAT TTCAAGAGGT AATAGGTTTT GGTGGGGCAC TGACTGAAGC TGCTGCGGTA
AATATACTGT CGCTTTTGCC ACACCAGCAA GAAGAGATTT TAAGAGGGTA CTTTGACCCA
AAAGAGGGGC TTGGCTATAA GCTTTGTAGA ATCCACATGA ACAGCTGTGA TTTTTGTGTT
GATAGCTACA GCTGTGATGA TGTTGAAGGT GACATAGAGC TAAAACACTT TAACATTGAA
CGAGACAAAA AGATGGTAAT TCCTCTTTTA AAAAGGATAA AGGAGTATTG CAAAGACCTA
AAAATTCTTG TTTCGCCATG GAGTCCGCCT GCATGGATGA AGACAAACGG TGATATGTGC
CATGGTGGAA AGCTAAAGGA TGAGTATAAA AAAACATGGG CAAGATTTTT CTGCAAATTC
ATAAAAGCAT ATAAAGAAGA AGGAATTGAG ATATGGGCTG TGACAGTTCA AAATGAGCCT
ATGGCAACTC AAGTGTGGGA GTCGTGCATA TACACAGCTG AAGAAGAAAG AGATTTTGTG
AAGTATTATT TGGGGCCGAC TCTTGCGGAA GAAGGACTGG GGGATGTAAA GATACTCATT
TGGGATCACA ACAAAGACAT CATATATGAC AGGGTAAAAA CAATTTTGAG TGACAAAGAA
GCTGCAAAAT TTGTATGGGG AGTTGCATTC CACTGGTATG GAGGAGACCA TTTTGACCAG
CTCAAAAAAA TAAAAGAAGA ATTTCCTGAT GTCAATTTGG TGTTTACCGA AGGTTGTCAG
GAAGGTGGAG TGAAGCTTGG TTCTTGGGAG CTTGGAGAAA GGTATGCTCA TGAGATAATT
GGTGATTTTA ATAACTACAC AATTGGATTT ATGGATTGGA ATATTGTTCT TGACACAGTT
GGAGGCCCCA ATCATGTAGG AAACTTTTGC GACGCTCCAA TAATAGTTGA TAAAGACCAG
AAAAAGATTT ACTATCAAAA TGCATGTTAT TATATAGGGC ATTTTTCTAA ATTCATAAGG
CCGGGAGCTA AAGTAGTCAA AAGTAGCTGT AGTAGTTCAA AACTTGAAGT TTTGGCAGCG
AAGAGCCAGG ACGATACTTT AGCAGTGGTT GTGTTTAATA AAAACCCAGA GGAAATAGAG
TTTAGTATTG TCATTGGAGA TAAAATATTC AGCGGAAAGT CTCCAGCAAG GTCTATATTG
ACCATTGTTC TGGAAAAGTA A
 
Protein sequence
MHKTIACYVT AKDQDIPMQQ VDNIKESTKI ESSIVITIEP STTFQEVIGF GGALTEAAAV 
NILSLLPHQQ EEILRGYFDP KEGLGYKLCR IHMNSCDFCV DSYSCDDVEG DIELKHFNIE
RDKKMVIPLL KRIKEYCKDL KILVSPWSPP AWMKTNGDMC HGGKLKDEYK KTWARFFCKF
IKAYKEEGIE IWAVTVQNEP MATQVWESCI YTAEEERDFV KYYLGPTLAE EGLGDVKILI
WDHNKDIIYD RVKTILSDKE AAKFVWGVAF HWYGGDHFDQ LKKIKEEFPD VNLVFTEGCQ
EGGVKLGSWE LGERYAHEII GDFNNYTIGF MDWNIVLDTV GGPNHVGNFC DAPIIVDKDQ
KKIYYQNACY YIGHFSKFIR PGAKVVKSSC SSSKLEVLAA KSQDDTLAVV VFNKNPEEIE
FSIVIGDKIF SGKSPARSIL TIVLEK