Gene Athe_0610 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0610 
Symbol 
ID7406951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp691938 
End bp693851 
Gene Length1914 bp 
Protein Length637 aa 
Translation table11 
GC content35% 
IMG OID643714992 
Productglycoside hydrolase starch-binding 
Protein accessionYP_002572508 
Protein GI222528626 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATGC TATTAAAAAG GTCTCTTGCT CTGCTGGTAA GTATTGTCCT TGTATTTTCG 
TTGTTCTTAA GTGTTTTTCC ACAGCAAGCA AGAGCACAAG ATACCATAAA AATTGTAGGT
AACTGGCAGG ATGCTGGTAA CTGGAATTTT GACAGCTCTA ATATTGTTTT GAGTGAAACC
TCAACTCCGG GACTTTATTA TGGTGAGTAT ACTTTTAAAA CTGGTGGTTC TTATGAGTTC
AAAGCAGTTA TAAATGGTAG TATATGGTGT ACAGGCGCTC CAAAAGTAGC AGATAATAAT
ACTAACATAC CTCTTAATGT TACAGATGGT CAGACAGTTA AGTTTTGGTT CTTCAAAAAT
TCTCAACTTG TAATTGACAG CACTCATTTT CCAAATGGGC CTGATAGTTT AGTAGGGCAA
AATTCTTTTA AATTTGTAGG AGTAAATAAT GAATGGAACC CTAATGATGG TAAGTATCAA
TTTATAAGAG TCCCAGATGC AACATATACC TATTTATATG ATAATAGTTA CAATGATTTT
TCGATACCAT ATGGATTTAA GATTATAATA GGTGGTTTTG GGAACCTATC ATGGGCATGG
AATGGTGCAA AAGAAGGTTC AGTTGTTAAA TTTAAAGAGG GTGGAGATAA TATTGACCTG
AAAGAATTTA AAGAATCTAA TGATAATCTT CTAAAAACAA AATTTTTCCT TGATATCCTA
AACGGCTGGC TCTTTACAGA AAAAGATTTG ACAAATATCC AACCAGTAGA TTTTGCAAAT
AACGGCTCTG TAATTGGTGG AACTTCAGTT CATCTTACAT GGACACCATA TACACCAAAT
GCAAATCCAC TTTCAGCCAA GCTTTACTAC AAAATAATTG ATGTGAGCAA TGATAGCGAG
TTGGTTAGTC TCACCGATTA CAGCACAATC CACAAAATAG ATATCCCAAG AGAGTGGATA
GGAAAGACAA TTAAAATTAT TGCAAATGCC AAGATAGGTG AAATAACAGG ACCTAATGTT
GAGTTTACAT TAAATATTGT TGACTTGCCA CAAAACTTAA TAGTTGAAGC AGCTAACTAT
ATAACATACA ACTCTATAGA TGAGACATTT TTAAATTTGG CTCAAGGTGA TACAGTTCAA
AACATAACTT CTAACTTCGC AGTTGTTACA TCATATGTTT ACAATGTGGT GTACGAGGGA
AATAGTTATC CACTGACATT TAATATTGAC TGGGAGAGCA GTAATTCATC TGTATTAACA
ATAAATGGTA GTACAGTTGT GGTTACAAGG CCAACACAAG GTGACCTTCA GGTTGAGTTG
AGAGCAAGAG CAAGATTTGG TGCTATTTCA GCAGATGGAC AAAAAATATT TACTCTTACA
GTAAAGAAAT TCCTATTAGG AGTAGATGGT GGAATACCTG TTACATTTAA CGTAACAGTT
CCAGATTATA CTCCTGATAA TGACAATATT TACATTGCTG GAGATTTTAA GACTGATAAG
CTTCCAAAAT GGGACCCTGT AGGTATCAAA TTGATAAAGG TCGGAGATAA AAAATATAGT
ATAACAATGT ATCTTCCACC GAATGTAACA ATTGAATATA AATATACCCG CGGAAGTTGG
TCAAAAGTAG AAAAGGATGC ATTTGGTAAT GAAATATCAA ATAGAGTTCT GCAGATAAAT
AATCAGGCTG TGACAAAAAA TGATGTAGTT GAAGCATTTG CGGACCTTGG AGCTGTCAAG
CAAGGTCTTC CAACGGTGAA TTTGGTCATC AATGTTCCTC AACAAACTGT AATTGATACA
GGTGATGGAG GAAGGGTAAT AAAAGCAGAA GGTGGTGCTA TTTCAGTTCC AAATACTAAG
AACACTGTTT TGATAGATAT TCGGAGTGCA TTAAATGCTA TTATATTAAA GTAA
 
Protein sequence
MRMLLKRSLA LLVSIVLVFS LFLSVFPQQA RAQDTIKIVG NWQDAGNWNF DSSNIVLSET 
STPGLYYGEY TFKTGGSYEF KAVINGSIWC TGAPKVADNN TNIPLNVTDG QTVKFWFFKN
SQLVIDSTHF PNGPDSLVGQ NSFKFVGVNN EWNPNDGKYQ FIRVPDATYT YLYDNSYNDF
SIPYGFKIII GGFGNLSWAW NGAKEGSVVK FKEGGDNIDL KEFKESNDNL LKTKFFLDIL
NGWLFTEKDL TNIQPVDFAN NGSVIGGTSV HLTWTPYTPN ANPLSAKLYY KIIDVSNDSE
LVSLTDYSTI HKIDIPREWI GKTIKIIANA KIGEITGPNV EFTLNIVDLP QNLIVEAANY
ITYNSIDETF LNLAQGDTVQ NITSNFAVVT SYVYNVVYEG NSYPLTFNID WESSNSSVLT
INGSTVVVTR PTQGDLQVEL RARARFGAIS ADGQKIFTLT VKKFLLGVDG GIPVTFNVTV
PDYTPDNDNI YIAGDFKTDK LPKWDPVGIK LIKVGDKKYS ITMYLPPNVT IEYKYTRGSW
SKVEKDAFGN EISNRVLQIN NQAVTKNDVV EAFADLGAVK QGLPTVNLVI NVPQQTVIDT
GDGGRVIKAE GGAISVPNTK NTVLIDIRSA LNAIILK