Gene Athe_0165 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0165 
Symbol 
ID7407156 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp203747 
End bp205420 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content37% 
IMG OID643714567 
Productalpha amylase catalytic region 
Protein accessionYP_002572090 
Protein GI222528208 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000167663 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACTTGC ACAAAAAGTG GTGGAAGGAA GCTGTTGTAT ATCAAATCTA TCCTCGAAGT 
TTTTATGACT CAAACGGTGA TGGAATTGGA GATTTGCCAG GAATAATAGA AAAACTTGAT
TATTTACAAG AACTTGGAGT GGATGTTATT TGGCTAAATC CTATTTACAA ATCCCCAAAT
GCAGACAATG GTTATGACAT CAGCGATTAC TATGATATCA TGGACGAGTT TGGTACAATG
GAAGACTTTG ATAGGCTCTT GAATGAAGCT CACAAAAGAG GAATTAAAAT TGTAATGGAC
CTTGTTGTGA ACCACACATC TGATGAACAC AAATGGTTTT TGGAGTCCAG AAAATCAAAA
GACAATCCTT ACAGAGATTT TTATTTTTGG CGGCCTGGTA AAAATGGAGG GCCTCCTAAT
AACTGGACAT CCTTTTTCAG CGGCCCTGCT TGGGAATACG ATGAGCTGAC AGGAGAATAT
TACCTTCACC TTTTTGCTGT AAAGCAACCA GACCTCAACT GGAACAATCC TCAGGTTCGT
CAAGAAATTT ACAAAATGAT GAAGTGGTGG CTTGACAAAG GCATAGATGG TTTTAGAATG
GATGTAATAA ACCTCATTTC AAAAGTTGAA GGACTGCCAG ATGACCCAGA TGAAAATCAA
GGAGGTTTGA TTGGTTTTAA ATACTATGCC AACGGTCCAA GGGTGCATGA ATATCTTCAG
GAAATGAACA GAGAAGTACT TTCTAAATAC GATATCATGA CAGTGGGAGA GACTCCATTT
GTAACTCCTG AGATTGCAAA GCTGTATGTA GAGTATGACA GAAATGAGCT TAATATGCTC
TTTCATTTTG AGCACATGGA TATGGACTGC AGCGGCAGCA AGTGGAATAT AAAACCTTGG
AAACTCACTG ATTTGAAAAA GATAATGTAC AAGTGGTATT TGGCTTTAAA AGACAAGGGC
TGGAATTCGC TTTATCTTAA CAACCATGAC CAGCCAAGGA TGGTCTCACG CTTTGGAAAC
GACAAAGAGT ACAGGGTTGA GTCTGCAAAG CTTTTAGCAA CCCTGCTTCA CACATGGCAG
GGAACTCCTT ATATCTACCA AGGTGAAGAG ATTGGTATGA CAAACTGCAA GTTTGAAAGT
ATTGATGAGT TCAGGGACAT TGAAACACTC AACTGGTACA AAGAGATGAA AAAGCTTGGA
AAATCAGATG AAGAGCTTTT AGAAATCTTA AACAAAAGAA GCAGAGACCA TGCACGAACA
CCTATGCAGT GGGATGATTC TGAAAATGCA GGATTTACAA AAGGTACCCC ATGGATAAAG
GTAAATCCAA ATTATAAGGA AATAAATGTT AAAAAAGCTT TAGAAGATAA AAATTCAGTG
TTTTATTACT ATAAGAAATT GATTGAACTT AGAAAAAAGC ATCCTGTCAT TGTATATGGG
GATGTTCAAA TGCTTTATGA GGACGATGAA AAGATCTTTG CATATACAAG AAGCTATGAA
GATGAAAGGC TTCTTGTTGT AATGAACTTT TCTGAAGAAG AAAGCGAGTT TTTAGCACCA
AATGAAATAT TTACCCAAAA ACCTGAACTT CTGATAAGTA ACTATGAAAT TGACGACAAT
ATCCAAGAAA AAATTGTTTT AAAACCTTAT GAATCGAGGG TATACAAAAT ATAA
 
Protein sequence
MDLHKKWWKE AVVYQIYPRS FYDSNGDGIG DLPGIIEKLD YLQELGVDVI WLNPIYKSPN 
ADNGYDISDY YDIMDEFGTM EDFDRLLNEA HKRGIKIVMD LVVNHTSDEH KWFLESRKSK
DNPYRDFYFW RPGKNGGPPN NWTSFFSGPA WEYDELTGEY YLHLFAVKQP DLNWNNPQVR
QEIYKMMKWW LDKGIDGFRM DVINLISKVE GLPDDPDENQ GGLIGFKYYA NGPRVHEYLQ
EMNREVLSKY DIMTVGETPF VTPEIAKLYV EYDRNELNML FHFEHMDMDC SGSKWNIKPW
KLTDLKKIMY KWYLALKDKG WNSLYLNNHD QPRMVSRFGN DKEYRVESAK LLATLLHTWQ
GTPYIYQGEE IGMTNCKFES IDEFRDIETL NWYKEMKKLG KSDEELLEIL NKRSRDHART
PMQWDDSENA GFTKGTPWIK VNPNYKEINV KKALEDKNSV FYYYKKLIEL RKKHPVIVYG
DVQMLYEDDE KIFAYTRSYE DERLLVVMNF SEEESEFLAP NEIFTQKPEL LISNYEIDDN
IQEKIVLKPY ESRVYKI