Gene Athe_1750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1750 
Symbol 
ID7408537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1823676 
End bp1825040 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content33% 
IMG OID643716128 
Producthistidine kinase 
Protein accessionYP_002573617 
Protein GI222529735 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTATAA AGGTTAAGAT TTATTTGTCG TACGTGGGTA TAATTTTATT CATGTTTATA 
GTTTTTGTTA TCCTTTTTTA TGCTTGGTTT GAAAAAAACT TGATTGACCA GATGAAAGAG
GATAACCTTC GCTTTGCAAG ACTTATAGAA TTTGTTGTTA CAAGCCAGAA CAAGAACACG
GTTAATTTGA AACCATTTGA GTCGTATTTT GAAAGTTTAG AGTCAAAAAT TTTACAGGAC
TATATTGTGA TTGTAAGCAG TGATGGTGCG ATAGAATACT CAAATAAAGA ACTGGATGTT
GAGTCAAGGG TAAAGATAAT GAAGCTTTTT GATGAGAACA AGGATGCTCA ATATGGATTT
GAGATAGGAA GCATTAGGGA AAAGCCCTGT CTGTTCACCA TATACAGTGG AAAATCAAAA
AGTGTTTTTG TAATTATTAT TACTGATCTT GCAAGGATTT TGAACTTTGA AAGAAGATTT
TTCCTTCTCA TTTTGCAAAC AGCAGGACTC AGCTGTTTAT TTGCTGCAAC TGTGGCCATT
TTTATATCTC GTAGAATGAT TGGAGGACTT TTGCGCCTTA AAGAAGCAAT AGAACAGGCT
TCTCAGATGA GGTTTAACAA AAAGGTTGAA GTAATTTCAA AAGACGAAAT TGGCTTAATT
GCTACCGAAT TTAACAGACT TATCGAAAAG ATTGATGAAT ACAATCAAGC ACAGATAAGA
TTTTTACAAA ATATTTCTCA TGAACTCAAA ACTCCTCTTA CTTCAGTGCG TGGATACGCA
GAGATGCTAA AAGAGGGAAT TTTAGATAAA AGCAAGATGG AATCAGCTGC AGATAAAATA
ATTTGGCATG TTGACAGATT GAAAAGTTTA ATAAACCAGA TTATTGACCT CACAAAGATT
GAATCTATTG AAAACTATTT TAATTTCGAA AAAAGTATGC TTGAAGAGGT CATTTTTGAA
GCTATTTTAG AAAATGAAGG TTATTTGCTT TCAAAGACAG TAGATATTGA ATTTGCTCCG
CAGACAAGGA CGTATGTCCA ATGCGACAAA CACAGACTAA AAGAAGCTTT TTCAAATATT
ATCTCAAACT GCATAAAATA TGCGAACAAT AAGGTTACAA TTGAAATAAA ATCTGAAAAA
GATAAATTTG AGGTAACAAT TGAGGATGAC GGAGAGGGTT TTGGGCAAGG AGAAATTGAT
AAAATCTTTG AAAGATTTTA TAAAGGGAAA AGGGGCGAAA GTGGACTTGG TCTTTCAATT
GCAAAAGCTA TATTCGAGAA ACATGGGTTT ATTATAGAAG CTGAAAACAA AATAACCAAA
GGCGCCCGCT TTAAGATAAA AGGAAAAATC TACAAAGAGC AGTAA
 
Protein sequence
MSIKVKIYLS YVGIILFMFI VFVILFYAWF EKNLIDQMKE DNLRFARLIE FVVTSQNKNT 
VNLKPFESYF ESLESKILQD YIVIVSSDGA IEYSNKELDV ESRVKIMKLF DENKDAQYGF
EIGSIREKPC LFTIYSGKSK SVFVIIITDL ARILNFERRF FLLILQTAGL SCLFAATVAI
FISRRMIGGL LRLKEAIEQA SQMRFNKKVE VISKDEIGLI ATEFNRLIEK IDEYNQAQIR
FLQNISHELK TPLTSVRGYA EMLKEGILDK SKMESAADKI IWHVDRLKSL INQIIDLTKI
ESIENYFNFE KSMLEEVIFE AILENEGYLL SKTVDIEFAP QTRTYVQCDK HRLKEAFSNI
ISNCIKYANN KVTIEIKSEK DKFEVTIEDD GEGFGQGEID KIFERFYKGK RGESGLGLSI
AKAIFEKHGF IIEAENKITK GARFKIKGKI YKEQ