Gene Athe_0228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0228 
Symbol 
ID7407219 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp276624 
End bp278459 
Gene Length1836 bp 
Protein Length611 aa 
Translation table11 
GC content37% 
IMG OID643714628 
Productglycoside hydrolase 15-related 
Protein accessionYP_002572151 
Protein GI222528269 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID[TIGR01577] oligosaccharide amylase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.159353 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAGC CACATATTAT AGAAGCTATA ATTGGGAACA CAAAGGTTTT AGGGCAGCTT 
GATTCAAATG GCATATTGCA AAGGTTTTAT TGGCCTGCAG TAGATTATTA TCAGCAGCTA
AAACTCTTTT TGGCAGCTGT TTTTTTGGAT GGGCTTGTAT TTTTCGAGGA TGAAAATTTC
AAGATAAAAA GTGGATTTGT GGATGACTTT GTGTACTTTT TTGAATATAA AATTGCAGAC
AAGACAATTT TTCAGCTTGA CTTTGTTGAC TTTGAAACAG ATAGCTTGGT TCGTTTATGG
GAAACTGGCT TCGAAGACTT CTATGTCTTT TTAGAACCCA TGATAAATTC TTCAAGTCTT
TTTAATGCTG CAAAGGTTGA TAAGGAAAAT GAAATAGTCT ATGCATATTT TAAAGGGACA
TATATAGGTC TTGCTTTTGA GAATAAGATA AAAAGCTTTA CAGTTAAAAA CGGAATTGAT
GATGCAAACG ATAATCAACT GGAAGGCTGG AATGAAGCTA CAAATCCTCA GATTGCCGTA
AAACTTAAAA ATACAGGAAA GGTTGTATGT TTTCTTGCTT TTGGGAACTC AAAAGATGAA
ATCTATCAAA AGCTTTCTTA TTTAAAGCAA AAAGGGTATG ACGAAGTTTA CAGGCAAAAC
AAAGCCTTTT GGGAAAAAAA ATTCTCAAAA GTAAAGCTCA TTTGCACACA AGACCCAAAA
GATATGCAGC TTCAGAAAAG AAGTGCATAT GTATTTTATG TACTGCAGAA CTCCAAAACA
GGTGGAATTT TAGCTGCATC AGAGGTTGAC GAGAAGTTTT TCCACTGTGG CGGGTATGGT
TTTGTCTGGG GAAGAGACGC TGCGTTTATA GTATCTGCAA TGGATGAGCT TGGGCTCTCA
AGGGAGGTTG AAAAATTTTT TGGATTCAAA TTTTCTTGTC AGGAAAAGGA AGGATTCTGG
GACCAGAGAT ATTACACAGA TGGCAGCTTA GCTCCAAGTT GGGGAATTCA GATTGATGAG
ACAGCTTCTG TTGTGTGGGG ATTTTTAGAA CATTGCGAGA AGCAAAATTC TCTTCATTTG
ATTGATTTGC ATAAAGAACA GCTCAAAAAA GCACTGCTGT TTTTGATAGC TGCTGTGGAT
AGCGAAAAGG GAGTTATCTT TAGAAGCTTT GACCTGTGGG AAGAAAGAGA AGGAATTCAT
CTTTACTCAA ATGCAAGCAT ATATGCAGCG CTAAAGAAAG CCAAAAAATA TTTTCCTGAG
CTTGAAAGTG AAATTGAAAA GAAGCTAAAG GCAATAAAAA ATCAGATGGC AACAAGATTT
TACAGTCCTA AACTTTCCCG GTATGTAAGG TCAACAGATG TTAGAATTCC ACATGAGGAA
TTTTTAAAGC TTCCTGAAGA GAACAGGTAC ATGCAAAAAG ATGAGAGATA TGAGATAACC
TATTATTTCA AAAAGCAAGA TGAAGTTGTT GACATTTCAA TGCTTGGCAT TTATTATCCT
TTTGAAATGG TAGATAGCAG CGATAAGGCT TTCAAAGCAA CCATTTTGGC TATTGAAAGG
GAGTGTCAAA ATTCAATTGT CGGGGGCTAC AAGAGATACT CTGATGACAG ATACATTGGT
GGAAATCCAT GGATACTGAC AACACTCTGG CTTGCAATTT ACTACAAAAA AACAGGGCAG
ATTGACAGGG CAGAAAAACT TTTTGAGTGG GCAAAAGCGC ACAGTTTGCC AAACGGACTT
TTTCCAGAGC AGGTTGACAG AATAACAGGA AAGCCTGCAT GGGTTGTTCC TTTAGCATGG
TCTCATGCAA TGTATGTGCT GTATCTTTAT GAATAA
 
Protein sequence
MRKPHIIEAI IGNTKVLGQL DSNGILQRFY WPAVDYYQQL KLFLAAVFLD GLVFFEDENF 
KIKSGFVDDF VYFFEYKIAD KTIFQLDFVD FETDSLVRLW ETGFEDFYVF LEPMINSSSL
FNAAKVDKEN EIVYAYFKGT YIGLAFENKI KSFTVKNGID DANDNQLEGW NEATNPQIAV
KLKNTGKVVC FLAFGNSKDE IYQKLSYLKQ KGYDEVYRQN KAFWEKKFSK VKLICTQDPK
DMQLQKRSAY VFYVLQNSKT GGILAASEVD EKFFHCGGYG FVWGRDAAFI VSAMDELGLS
REVEKFFGFK FSCQEKEGFW DQRYYTDGSL APSWGIQIDE TASVVWGFLE HCEKQNSLHL
IDLHKEQLKK ALLFLIAAVD SEKGVIFRSF DLWEEREGIH LYSNASIYAA LKKAKKYFPE
LESEIEKKLK AIKNQMATRF YSPKLSRYVR STDVRIPHEE FLKLPEENRY MQKDERYEIT
YYFKKQDEVV DISMLGIYYP FEMVDSSDKA FKATILAIER ECQNSIVGGY KRYSDDRYIG
GNPWILTTLW LAIYYKKTGQ IDRAEKLFEW AKAHSLPNGL FPEQVDRITG KPAWVVPLAW
SHAMYVLYLY E