Gene Athe_0359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0359 
Symbol 
ID7409289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp411945 
End bp413345 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content39% 
IMG OID643714745 
Producthypothetical protein 
Protein accessionYP_002572268 
Protein GI222528386 
COG category[S] Function unknown 
COG ID[COG3885] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00296] uncharacterized protein, PH0010 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000914729 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAGGAT ATTTGCTTCC ACATCCACCA GTGCTGATTC CCGAGATTGG GAGAGGCGAG 
GAGAAAAAGT GCCAGGCGAC CTTAGATGCT TTACAAAAGG TAGCAGATGA GATAGCTGAA
TATAAACCTG AGGTCATAGC CATAATATCA CCCCATGCAC CTGTGTTTAC GGATGCTTTT
TTCTTGAACG ACAAGCCAGA AATTGGTGGA AGCCTTGCAA GATGGGGTGT ATATGGAATT
GAATTTAGGT TCAAAAATAA CCTTGAGATA GTTCAAGACA TAGCAAAAAT GTGCAGCCAG
GAAGGGTTGA CGGTTGGATT TGTGTCAGAC AAAATTCAAA AAAGATATGG CGTTTCGCGA
GAGCTTGACC ATGGCGCGTT AGTTCCGCTT TATTTTATTA CCAGAAAGTA TAAAGAATTT
GAGCTTATAC ACACTTCTTA CTGTATGCTT GATGATATTA AGCTTTATAA ATATGGAATG
ATACTCAGAA GGGCAATTGA AAAGCATGGC AAAAAAGGTT TAATTATAGC TTCAGGCGAC
CTTTCGCACA AACTCTCTTA CGATGGGCCT TACGGGTTTG CAAAAGAAGG ACCTGAGTTT
GACAAACTTC TGGTTGAACT TTTGCAAAGT AGCAATGTAC GAGCACTTTA TGACATAGAT
CCTGTACTTT CAGAGAAGAC GGCAGAATGT GGTTTCAGAT CCATAAAGGT TTTGCTTGGA
GCATTTGAGG GCTATAGTAT AGAATCAAAG GTTTATTCAT ATGAAGGACC TTTTGGCGTT
GGATACTGTG TTGCTGCCTT TTACCAGAAA GAACAGACAA GCTCTTCTTT GTTTGAGGAG
ATAGTGAAAA AAAGAGAAGA GAGACTAAAG AGAATAAGAG AAAATGAAGA TGAATATATA
AGACTTGCAA GAGAAAGCTT AGAATACTAT GTAAGACACC GCAGGTACTT AGATTATATA
CCAGATTATG TCACAGAACG GATGCTAAGA GAAAGAGCAG GAGTTTTTGT GTCAATTAAA
AAGGATGGAA ACTTGAGAGG ATGTATAGGT ACAATTTATC CTACTCAAGA AAACATTGCA
AAAGAGATAA TCAGAAACGC TGTTGCAGCA GGGTTTCACG ACCCCAGGTT TGAAGAGGTA
ACAGAAGATG AGCTTGACAG TCTTGTGTAT GATGTTGATA TTCTAAGCCC ACCTGAGAAG
GTAAACTCGA AAGACCAACT TGATCCTAAA AAATATGGAG TTATTGTGCG AAAAGGTGCA
AGACAAGGGC TTTTGCTTCC TGATTTAGAA GGTGTTGACA CAGTTGAAGA GCAGCTTAAG
ATAGCCTGCA GAAAAGCAGG AATTGATTAT GAAAGTGAAG ATTTTGAGAT AGAAAGGTTT
ACAGTTGAAA GACACAAGTA G
 
Protein sequence
MVGYLLPHPP VLIPEIGRGE EKKCQATLDA LQKVADEIAE YKPEVIAIIS PHAPVFTDAF 
FLNDKPEIGG SLARWGVYGI EFRFKNNLEI VQDIAKMCSQ EGLTVGFVSD KIQKRYGVSR
ELDHGALVPL YFITRKYKEF ELIHTSYCML DDIKLYKYGM ILRRAIEKHG KKGLIIASGD
LSHKLSYDGP YGFAKEGPEF DKLLVELLQS SNVRALYDID PVLSEKTAEC GFRSIKVLLG
AFEGYSIESK VYSYEGPFGV GYCVAAFYQK EQTSSSLFEE IVKKREERLK RIRENEDEYI
RLARESLEYY VRHRRYLDYI PDYVTERMLR ERAGVFVSIK KDGNLRGCIG TIYPTQENIA
KEIIRNAVAA GFHDPRFEEV TEDELDSLVY DVDILSPPEK VNSKDQLDPK KYGVIVRKGA
RQGLLLPDLE GVDTVEEQLK IACRKAGIDY ESEDFEIERF TVERHK