Gene Athe_2381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2381 
Symbol 
ID7407800 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2532825 
End bp2533946 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content36% 
IMG OID643716744 
Productglycosyl hydrolase family 88 
Protein accessionYP_002574223 
Protein GI222530341 
COG category[R] General function prediction only 
COG ID[COG4225] Predicted unsaturated glucuronyl hydrolase involved in regulation of bacterial surface properties, and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAAAT TAAAACAGCT CGCAAAAGAA AGCTATTCTG TCAAAATGGC AGATAGTGCA 
CTTTTTAAGT TTCCAGATCT TATTGATAAA TGGCAATATG ACTATGGTGT TGTGTTCAAA
GGTTTAGAAT ATATTTATGA AAACACCAAG GATGAGAAAT ATTTTGAATA CATAAAGAAA
AATATAGACT ATTTTGTCAA AGAAGATGGA AGCATAAAAA AATATTCTCT TGATGAATAC
AACATAGACC ATATAAACAA TGGTAAAGCT GTATTGTTTT TGTATAGAAA AACAGGTGAA
GAAAAGTACA AAAAAGCAGC TCAGCTTTTA AGAGAACAGC TCAAGACTCA TCCAAGAACG
GTAGAAGGTG GATTTTGGCA CAAAAAGATA TACCCGCACC AGATGTGGCT TGATGGGATA
TATATGGGTT CGCCATTTTA TGCTGAGTAT GCAACTTTGA TAGGAAAAGA TGAAGCAGAA
GAGATATTCG ACGATGTTGT AAGACAGGTC ATTCTTTGTG CAAAGCATAC TAAAGACCCA
GTGACTGGTC TTCATTATCA CGGCTGGGAT GAAAGCAGAC AGCAAAAATG GGCAAATAAA
ATCACAGGCT GCTCTCCAAA TTTCTGGGGA AGAGCACTTG GCTGGTTTGC AATGGCAATA
GTTGATGTTC TCGATTTTCT TCCACAGAAT CACCAATCAA GAGATACAAT TTTAGCTATT
TTCAGGCAGC TTATGGATGC CATTTTAAAG TATCAAGACC CAGAGACAGG TGTGTGGTAT
CAAGTTGTAA ACTATATTGG TAAAAATGGT AACTATCCAG AAGCTTCGGC ATCATGTATG
TTTGTATATG CACTTGCAAA AGGCATACAC AATGGATACC TTTCTTCAAA GTACTTAGAT
GCATTAGAAA GGGCATATGA GGGGATAATT TATAGGTTCT TAGAAAAAGA CCACAATGGA
CATTTGAGCT TAAATGGAGT TTGTATGGTT GCAGGGCTTG GTGGGAATCC GTACAGAGAT
GGTTCATATG AATATTACAT CAGTGAGCCA ATAAAAACAG ATGATTTGAA AGGTGTTGGA
GCGTTTTTGA AAGCTTCCGC ATGGGTTGAA AGACTTTTTT AA
 
Protein sequence
MQKLKQLAKE SYSVKMADSA LFKFPDLIDK WQYDYGVVFK GLEYIYENTK DEKYFEYIKK 
NIDYFVKEDG SIKKYSLDEY NIDHINNGKA VLFLYRKTGE EKYKKAAQLL REQLKTHPRT
VEGGFWHKKI YPHQMWLDGI YMGSPFYAEY ATLIGKDEAE EIFDDVVRQV ILCAKHTKDP
VTGLHYHGWD ESRQQKWANK ITGCSPNFWG RALGWFAMAI VDVLDFLPQN HQSRDTILAI
FRQLMDAILK YQDPETGVWY QVVNYIGKNG NYPEASASCM FVYALAKGIH NGYLSSKYLD
ALERAYEGII YRFLEKDHNG HLSLNGVCMV AGLGGNPYRD GSYEYYISEP IKTDDLKGVG
AFLKASAWVE RLF