Gene Athe_1802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1802 
Symbol 
ID7408589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1874514 
End bp1875608 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content37% 
IMG OID643716179 
Producttranscriptional regulator, CdaR 
Protein accessionYP_002573668 
Protein GI222529786 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3835] Sugar diacid utilization regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000701672 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGACAC AGAAGGTTAT TGATGTTTTA GAGCAGGCAA AAGACATCAT CGACGATGAA 
TTTGGATATA TTGAAGCTGA TGGTAGAGTT ATCTACAGCT CAAATCCACT TTCTCAAAAC
AGAATCAACA CAGTTGCAAT TGACATGATA AAAACAGACA CTGACCTTGA GATATTTGAG
GGTCGCACAT ACAAGGTTTA CAGAAGCCAG ACAGACACCT ATGTTCTTTA TATAAACAAC
ACAGAACCTC ATGCTGAAAA GCTTTTGGAT ATGTTAAACC TTGTTGTGAT GAAGGCAAAA
GAGCCGGCAT CTGCGTATGA TAAAAAGCTC TTTATAAAAA ATCTGTTGTA TGACAATATT
CTGCCAGGGG AGATTTACAC AAAGGCAAGA GAACTTCACA TTGCAACAGG TGCAACAAGG
GTTGTATTTG CTATCTATAT TCCAAATGCA AAAGAGATTA AAGACCTGAA TATCGGTGAG
ATTTTGACAA GCATATTCCC AAAGAGCACA AAAGATTTTA TTATCCAGCT TGACAACAAT
ATTCTGGTAT TCATAAAAGA GTTAAAACCA GGTTCAAATG ATGAGGATGC ATACAAGGTT
GCAAGGATTA TACTTGACAC GCTCAACTCA GAGCTTTTGC TCAAAGCGTA TATTGGAATT
GGATCTGTTG TTGATGACAT AAAAGAACTT TCGATGTCTT ATAAGGAGGC AGAAGCAGCG
CTCAAAATAG GCTACATCTT TGAAAAGGAC AAGTATATTG TGAGTTATCA CAAGCTCGGC
CTTGGAAGAC TTATATATCA GATGCCGACA AAACTTTGTG AGATGTTCTT GGAAGAGGTC
TTCAAGGATG TAAAACTTTC TGATTTTGAC CCAGAACTCA TACAGACTGT TGAGATGTTC
TTTGAATGCA ACTTGAATGT CTCAGAGACA GCAAGACAGC TTTATATTCA CAGAAATACC
TTGGTTTACA GACTTGACAA GATAGAAAGA ATGATAGGGC TTGACCTTAG AAAGTTCGAA
GATGCTATTA TCTTCAAAAT GGCTATGCTT GTAAATCAGT ATTTAGAGTA TACAAAGGGT
AACATTACAT TTTAA
 
Protein sequence
MMTQKVIDVL EQAKDIIDDE FGYIEADGRV IYSSNPLSQN RINTVAIDMI KTDTDLEIFE 
GRTYKVYRSQ TDTYVLYINN TEPHAEKLLD MLNLVVMKAK EPASAYDKKL FIKNLLYDNI
LPGEIYTKAR ELHIATGATR VVFAIYIPNA KEIKDLNIGE ILTSIFPKST KDFIIQLDNN
ILVFIKELKP GSNDEDAYKV ARIILDTLNS ELLLKAYIGI GSVVDDIKEL SMSYKEAEAA
LKIGYIFEKD KYIVSYHKLG LGRLIYQMPT KLCEMFLEEV FKDVKLSDFD PELIQTVEMF
FECNLNVSET ARQLYIHRNT LVYRLDKIER MIGLDLRKFE DAIIFKMAML VNQYLEYTKG
NITF