Gene Athe_0578 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0578 
Symbol 
ID7406919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp652880 
End bp654358 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content32% 
IMG OID643714961 
ProductPBS lyase HEAT domain protein repeat-containing protein 
Protein accessionYP_002572477 
Protein GI222528595 
COG category[C] Energy production and conversion 
COG ID[COG1413] FOG: HEAT repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.94571 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTCAG CTATGGAAAT AATTGAAATG CTAAATTCAG ATGACTATCT ATTACAAAAA 
GAAGCTATTG AAAATGCACC AAAGTTTAAG GAACCTGAAA TTGTTGATAG ATTAATAGAT
CTTTTTATAA AAACAAACAA TAAAATGATT GAGGAGCACA TTACAGAAGC TTTAAAACAA
ATAGGTGGCA GTTATACTGT TGAAAAATTA TTGAGGCTTT TAGACCATGA AGAAGCAAGG
GTTAGGGTTT TTGCTTTTGA GGTTTTAAGT AAAATAGGTA ATGATAATAT TCACGCAATA
ATCAAAGAGG CGGAAAATCC AGATAAAAAT GTTAGGAAAT TTGTAGTGGA TATTTTAGGT
GCTCTTAAAA ACAAAGAAGC AGTTGATACT TTATTGAAAA GGTTATCAGA TGATGATGTA
AATGTAGTAC AGGGGGCCAT TGAGGCACTT GGCAATATTG GGGATGTTGA AGCTCTCAAA
AAGGTAATTG AGTTTTTACC TTCTGCTCAT CTGTGGGTAC AGTGGACAAT AATTGAGAGC
ATAAAAAAAG TGAACAATAG AGAACTAATT TCAGAGGTTT TAAATCTGCC ATGGGAGATT
GAGGATATCA TCTTTGACAG TATTTTTGAC ATGGTGAAAG AAAATGGTAC CCTTGAGAAT
GTGGAGGATG CAGTAAATCT TTATTTAAAG CTTTCAACAC AGCTGAGGAT AAAGGTTTTA
GATACCATAT ATTCGATTTA TATAAAGTCA GATAAACAAA AAGTTGAAAA AGTTTTATCA
AATACAAGCT TTTTTGATGA AATAAAAACC ATTTTGATAT ATGGTTCTGA TACACAAAAA
TTTGAGATTT TTAACTATAT GGGTAGCATA GAAGATAAGG ATTTTGTAAG GTTTATAAAG
AGTAGGGTGT TTGATGAAAC AGTTATTCTA AGCGCTATAA AACTCTATTA CGCATCGAAT
ACTTTAGAGA AAAGAGAATT AGTTAGAGTG TTTAAATATT TTAATAAGTC AAAACTGGTT
GAATATATGA GAGAGATATT CAAAGGTGAT GATAATATCT TAAAGCTTAG CGGGTTGAAG
ATTATGAGAT ACAATGGTAT CAAGGAGGCA GCAGATGTGT TGCCAGAGAT GATAAAAGAA
GGAGAGCTTT TACCAGAGGT CTTAAAAACT GTGATAGAAC TAAACTTGAA GGAGTTATTT
GATAGAATTT ATGAAGAATA TTTCAATGTC AAAAGCGATG ACTTAAGACT TTTGATGCTT
GAATGTATGG TTGAACTAAG ACCTGACGAT GCAAAGGTTG TGGCTCTAAT TAAAGATGAG
CTTGCAAATG AGTATTTATC TGATGTTCAT ATTCTCAAGC TTTTACAACT TGTAAGAAAA
ATTAATGATA AAGAACCATT TAGATTACAG TTAGAATATT TGGCTGACCA TCCTAATATA
GAGATTTCTA TTGAAGCTCA GGACCTGCTT GGAGGCTAA
 
Protein sequence
MNSAMEIIEM LNSDDYLLQK EAIENAPKFK EPEIVDRLID LFIKTNNKMI EEHITEALKQ 
IGGSYTVEKL LRLLDHEEAR VRVFAFEVLS KIGNDNIHAI IKEAENPDKN VRKFVVDILG
ALKNKEAVDT LLKRLSDDDV NVVQGAIEAL GNIGDVEALK KVIEFLPSAH LWVQWTIIES
IKKVNNRELI SEVLNLPWEI EDIIFDSIFD MVKENGTLEN VEDAVNLYLK LSTQLRIKVL
DTIYSIYIKS DKQKVEKVLS NTSFFDEIKT ILIYGSDTQK FEIFNYMGSI EDKDFVRFIK
SRVFDETVIL SAIKLYYASN TLEKRELVRV FKYFNKSKLV EYMREIFKGD DNILKLSGLK
IMRYNGIKEA ADVLPEMIKE GELLPEVLKT VIELNLKELF DRIYEEYFNV KSDDLRLLML
ECMVELRPDD AKVVALIKDE LANEYLSDVH ILKLLQLVRK INDKEPFRLQ LEYLADHPNI
EISIEAQDLL GG