Gene Athe_0083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0083 
Symbol 
ID7408445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp104646 
End bp105944 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content31% 
IMG OID643714493 
Producthypothetical protein 
Protein accessionYP_002572016 
Protein GI222528134 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGTATA GCAAAAAAGA GTTTTTAAAA AGCATAGTGC TGAGTATACT TGTTATACTC 
AGCATTTATT TGTATTATAA AATTTATTTT GACTACAAAA CAGAAGACAT TGTGAAATTA
GCTTTAGCAT ACAAGGAAGA ACAATCTCAG CAGCAGCTTT TGAAAAAGGC AAAAGAAATT
TTGGTTTCGC CAAAGGAAAT GTATTTAAAC ATTTCAAAAA ATATTGCACT CAGGATTTTG
AGCAACCAGA ACGAATATTT ACAAGTTGTC TCAAAGTTTA TAGAAGGGAT TGAGAGAGGA
TTTGTCAAAA GAGAGGTAGA AGTGTCCACT AAAAATTTAA ATATAGATTT TTTTAAGTCA
GGTAGATATT TAATTTTTTG CTATGATTAT CCTGTTAACT TGAATGTTTT CTTGTATGAG
TTAACAAGAA GAACTACTGA CAAAATACCG AAAGGTTTTG AATTTGACAA AATATTTATA
AAAGAAGATT CAAATGCAAC TATTGTGTAT TTTTTTAATT CCCAAAAACA GATTGCGATT
TTGCAGAAAT TTGACCAGTT TGGTTTTTTG CCACTGGAAA GAGTAATTGA GAAAAACTTA
TCCATTATAT ATTCATGGGC AGACGGGCTT GGGTTTACAG AGATAGCCAA AAATGATGTG
TTAATACCTA TTGAATTTTC TGATATTCAG TTTTCTGAGA TAAAGATAAA AGATAGCAGT
TTTAAGAAAG AATGGATTGT TCGCAGACTA TTTCCAGATA CTATTCTTAC CAGAAAGAAT
ATTCTTAAAG ATGGCGATAT TGCAATAACA GACGAAAGAA AGATGTTGGT TTTTAAAAAT
GACCGAGGTT TTGAATTTGA ATACACAGAG AAAACATTTG GTGACAGAGT AGATTCTGTC
TGCGATACTT TAATGTTTTA TCTTAAGACA TTTTATACTG ATGAAGACTT GAGAGTTTTT
AGTCTTAAGA CAGAAAAAGA AGGCAATTTT ACCATAAAGC TTGGTTTGAG AACAAGCGGA
ATAGACATTG CTTCATCTGA TGAAGAGTAC TGTGTAGAAA TTGAAGTAAA AAGCGGAAGG
ATTTATAGAA TCTCGGGATA CATTTTTGAT ATCATTAAAG TCAGGACTTC ACAGATAAAG
GTTGAAGGGA TTGCGGCAAT TGACACATTA AAAGAACGGA AAGGTGATAT TTTCATAGAG
GATATAGACA TTGAATATGT TTTAGGTGGT GCAAGTTCAT ATCCTTACTG GAAAATAAAA
ACTCAAAACG GAGTTGCCTT TGTTGAAACA ATAAAGTAA
 
Protein sequence
MRYSKKEFLK SIVLSILVIL SIYLYYKIYF DYKTEDIVKL ALAYKEEQSQ QQLLKKAKEI 
LVSPKEMYLN ISKNIALRIL SNQNEYLQVV SKFIEGIERG FVKREVEVST KNLNIDFFKS
GRYLIFCYDY PVNLNVFLYE LTRRTTDKIP KGFEFDKIFI KEDSNATIVY FFNSQKQIAI
LQKFDQFGFL PLERVIEKNL SIIYSWADGL GFTEIAKNDV LIPIEFSDIQ FSEIKIKDSS
FKKEWIVRRL FPDTILTRKN ILKDGDIAIT DERKMLVFKN DRGFEFEYTE KTFGDRVDSV
CDTLMFYLKT FYTDEDLRVF SLKTEKEGNF TIKLGLRTSG IDIASSDEEY CVEIEVKSGR
IYRISGYIFD IIKVRTSQIK VEGIAAIDTL KERKGDIFIE DIDIEYVLGG ASSYPYWKIK
TQNGVAFVET IK