Gene Athe_0234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0234 
Symbol 
ID7407225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp283862 
End bp284848 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content37% 
IMG OID643714634 
Productglycoside hydrolase family 5 
Protein accessionYP_002572157 
Protein GI222528275 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAT TACCAAAATA CAAAGGATTC AATCTACTTG GTCTTTTTGT TCCAAATATG 
AGCTATGGAT TTTTTGAAGA TGATTTTAAA TGGATGCAGG AGTGGGGATT TAACTTTGCA
AGAATACCTA TGAACTACAA GAACTGGTAT GTTGAAGGGC GACCGGAAAT AAAAGAGGAA
GTTTTAGAAA TGGTAGACAA GATTGTTGTC TGGGGGCAAA AGTATGGTAT TCATATCTGT
TTGAACATTC ACGGTGCGCC AGGCTATTGC GTGAATGAAA GAACAAAAGA AGGCTATAAC
CTGTGGAAAG ACAGAGAACC TCTTGAGTTA TTTGTATTAT ATTGGCAGAC ATTTGCTAAA
AGGTATAAAG GGATATCTTC AAAACATTTG AGTTTTAACC TTATAAATGA ACCAAGACAG
TATTCAAAAG AGGAGATGAC AAAAGAGGAT TTTATAAGAG TTATGACATA TACAATTGAA
AAGATAAAAG AGATTGATAA AGAGAGGCTT ATTATAATTG ACGGTGTTGA TTATGGCAAT
GAACCTGTTT TTGAGCTCAC AAATTTAGAT GTAGCACAAG CTTGCAGGGC ATACATTCCG
TTTGAGCTTA CCCATTACAA GGCAGAGTGG GTTGAGGAGA GTAACAAATT TGCTGAGCCA
TCGTGGCCGC TTGTACGCGA AAATGGTGAG GTTGTTGACA AGGATTATTT AAGAAGGCAT
TATGAAAAGT GGGCAAAATT GTTTGAATAC GATGTTGGTG TAATTTGTGG TGAAGGTGGA
GCATACAATA AAACACCCCA TCATATTGTT GTGAGATGGC TGGCTGATGT ATTGGATGTT
TTAAAAGAGC TTGGGATTGG GATTGCGCTT TGGAATTTGC GTGGTCCTTT TGGAATTATT
GATTCTGGAA GAGATGATGT CGAATATGAA GACTTTTATG GGCACAAGCT TGATAAAAAG
TTACTCGAAC TTTTGATGAG ATTTTGA
 
Protein sequence
MNKLPKYKGF NLLGLFVPNM SYGFFEDDFK WMQEWGFNFA RIPMNYKNWY VEGRPEIKEE 
VLEMVDKIVV WGQKYGIHIC LNIHGAPGYC VNERTKEGYN LWKDREPLEL FVLYWQTFAK
RYKGISSKHL SFNLINEPRQ YSKEEMTKED FIRVMTYTIE KIKEIDKERL IIIDGVDYGN
EPVFELTNLD VAQACRAYIP FELTHYKAEW VEESNKFAEP SWPLVRENGE VVDKDYLRRH
YEKWAKLFEY DVGVICGEGG AYNKTPHHIV VRWLADVLDV LKELGIGIAL WNLRGPFGII
DSGRDDVEYE DFYGHKLDKK LLELLMRF