Gene Athe_0344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0344 
Symbol 
ID7409274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp392725 
End bp393768 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content41% 
IMG OID643714730 
Productoxidoreductase domain protein 
Protein accessionYP_002572253 
Protein GI222528371 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAAAAGC TAAGACTTGC TATAATCGGA TGCGGTTCAA TCACAAAGCA CAGGCATGCG 
CCGGAGGCAA AACAAAATCC CAATGTTGAG CTTGTTGCTG TATGTGACAA GAATTTAGAC
CATGCAAAAG CCATTGCAGA AAAATTTAAA GTTGGAAATG TCTATGATGA TTATGAAAAG
ATGCTAAAAG AAATAAAACC TGATGCAGTA GTGGTTGCAA CGCCAAATTA TCTTCATGCC
GATGCTACAA TAAAAGCGTT AAAAGAGGGG GCTCATGTTC TTTGTGAAAA GCCAATGGCA
ACAACCGAAG ATGAGTGCAG AATGATGGTA GAGACTGCAA AAGAGATGGG TAAGTTTTTG
ATGATTGCTC ACAACCAAAG GTTAAATATA GCCCACAAAA AGGCAAAAGA GGTAATACAA
AGCGGTGAGC TTGGGAAAGT GCTGAGTTTT AAAACAACCT TTGGTCATGG CGGACCTGAG
AGTTGGAGCT CAGACAGGCC CGATACATGG TTTTTTCACA AAGAAGCAGC AAGCTTTGGA
GCTATGGGCG ACCTTGGCGT TCACAAGATT GACCTTATGA GATTTTTGCT TGGTGAGGAG
TTTGTTGAGA CAGCTGCGTT TGTTACAACT CTTTCCAAGA AGTATCCAAA TGGCCAGCCA
ATTGACGTTG ACGACAATGC AGTCTGCATT TTAAAGACAC AGAGCGGCGC GATTGGAACG
CTCACAGCTT CATGGACATA CCCGGGAAGT GAGGATAACT CAACTGTAAT CTACTGTGAG
AAGGGTTCAA TTACACTTTA CGCAGATCCA AAATTTTCGA TGATAATAAG ATATGCAAAC
GGTCAAAAAG CATATTTTGA GCTTGACACA ATGCAGACAA ACGAAAGACA AACAAAATCA
GGTGTGGTAG ACGAATTTAT TGATTGTATC TTGACAAACA CACCACCAAG AATTTCTGGA
GAAGAAGGTT TGAAGACCAT GAAAGTTGTG TTTGCATGTT TTGAGTCAGC AAAAACTGGC
AAGATTGTGA GGATTGATTA TTAA
 
Protein sequence
MKKLRLAIIG CGSITKHRHA PEAKQNPNVE LVAVCDKNLD HAKAIAEKFK VGNVYDDYEK 
MLKEIKPDAV VVATPNYLHA DATIKALKEG AHVLCEKPMA TTEDECRMMV ETAKEMGKFL
MIAHNQRLNI AHKKAKEVIQ SGELGKVLSF KTTFGHGGPE SWSSDRPDTW FFHKEAASFG
AMGDLGVHKI DLMRFLLGEE FVETAAFVTT LSKKYPNGQP IDVDDNAVCI LKTQSGAIGT
LTASWTYPGS EDNSTVIYCE KGSITLYADP KFSMIIRYAN GQKAYFELDT MQTNERQTKS
GVVDEFIDCI LTNTPPRISG EEGLKTMKVV FACFESAKTG KIVRIDY