Gene Athe_2372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2372 
Symbol 
ID7407791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2521508 
End bp2523295 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content32% 
IMG OID643716736 
Producthistidine kinase internal region 
Protein accessionYP_002574215 
Protein GI222530333 
COG category[T] Signal transduction mechanisms 
COG ID[COG2972] Predicted signal transduction protein with a C-terminal ATPase domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAGAA AAAATGGAGT TTATAAAAGC AAGATTTTCA AAAAGAATTT TGTGCAGATA 
GTTATTGTTC CCATTGTGAT AATAACTATT CTTGGGCTTT TCTCATGCAT CATTATAGAA
CAATACGTTA AAAATGAAAT AAACAAAAAT TTAGAGACAA TGCTAATACA AAGCAAAAAC
AATGTCGAGC TTATGCTCGG TGAGATAGAC TATCTTTATA TGGTATTTGG GATAAACAAA
GATGTGACCC TTCAGATAAA GAGGATTTTG AACTCAATGT ATTTTTCTTT AGAAGATATC
TGGCAGATTA ACATGGTCAA AAATGTTTTA AATTCAATCT CATATTCAAA GCCGTTCATA
CATTCCATCT ATGTTTATTT CGAAAATCCT GAAGGGAATT TTATAGTTAC CCCAGATGGA
ATGACTAATT TTCAGTATTT TTATGACAAA TGGTGGTTTG ATCAGTATAA AGAAAATAAA
GCATTAATGT GGGTAGAGAG AAGAAAAATT CAACCTTACA ATTTTACTGG AGAATCAATT
GATGTTTTGA CCATCTACAA AAGGATAAAA TCTGCATATT CTGATGTGAA TGAGGGTGTT
ATTGTTCTTA ATCTGTATTA CGACCAGGTA AAAAAGCTCT TAAGCCTTAA AAGTTCGCTC
CCTCAGCATG CAATGTACAT ATTAGATCAA AATGGAAATG TTTTGGTATC AAATGAATCA
GATAACTCTA ATACCTCAAG TATGGCCCTC CTAAAAAAAG AAACAGACAA CTATCTCACA
AAAAGATTAG AGTCAAAAAA ATACAACTTA ACCTTTGTTT CAGTAATTCC CAAAAATTAT
CTTTACAGCA TCCCTATCAG ACTTTTCAAG GTGACACTGG TGCTACTTTT AATTTTTATA
GTTATCGCTT TTGCTGCCTC ATACTACATT GCCAAAGTAA ATTACAGGAA TATTAAAAAG
ATTATAGATA CAATAAATTC AGCAACAGAA GGAAAACCAC CAAAAGAAAT TAAAATTACT
TCAAATGATG AATATGGGTA TATCATGTAC AATGTAATCA AGAACTTTAT TGAAAAACAT
TATCTGACAA CACGCCTTCA AGCTTTGGAG CTTTTAGCAT TGCAGGCTCA GATTAACCCT
CATTTTCTTT TCAATACTTT AGAACACATA TATCTTAAAA CTTTAGCACT TACAGGCACC
CCAAACGAGA TTACAAAAAT GATAGAAAAC CTTTCGGCTA TACTCAAATA TTCTCTGAGC
AATCCAAAAA TTACTATCTT CCTAAGGGAA GAAATTAAAG CTACACAGGC ATATATTGAG
CTTGTAAAAG CAAGATATAA AGATAAGTTT GATGTGTTTT GGGACTATAG TGAAGATGTG
CTTGAGATAA AAGTGATGAA GCTTTTATTC CAGCCGCTCA TAGAAAATTC AATCTATCAT
GGGATAAAAC CTTGCGAAAA GAGATGTGGA ATAAAAATCA GGATAAGAAA ATTAAAAGAT
ACCAGTGATT GGCTTTGTAT ATGGGTAATT GACAATGGAA TTGGGATGAG CAAAGAAAAG
TTAGAGGAGG TACAAGGCAG GCTTTCACAG GATTTTGACT TTTCAGATCA TATTGGGCTT
TTAAACACCA ATGAAAGGTT AAAGCTCAAC TATGGGGGTA ACTTTAAACT CAAGGTTTGG
AGCAAGCTGG GTTTGGGGAC AATTGTAAAA ATAATTCTTC CTGTGAATTT TGAGGACCGA
AAGGAGAATG AAATAGATGC TAAAAAGACA GGATATTTAT ATCCGTGA
 
Protein sequence
MIRKNGVYKS KIFKKNFVQI VIVPIVIITI LGLFSCIIIE QYVKNEINKN LETMLIQSKN 
NVELMLGEID YLYMVFGINK DVTLQIKRIL NSMYFSLEDI WQINMVKNVL NSISYSKPFI
HSIYVYFENP EGNFIVTPDG MTNFQYFYDK WWFDQYKENK ALMWVERRKI QPYNFTGESI
DVLTIYKRIK SAYSDVNEGV IVLNLYYDQV KKLLSLKSSL PQHAMYILDQ NGNVLVSNES
DNSNTSSMAL LKKETDNYLT KRLESKKYNL TFVSVIPKNY LYSIPIRLFK VTLVLLLIFI
VIAFAASYYI AKVNYRNIKK IIDTINSATE GKPPKEIKIT SNDEYGYIMY NVIKNFIEKH
YLTTRLQALE LLALQAQINP HFLFNTLEHI YLKTLALTGT PNEITKMIEN LSAILKYSLS
NPKITIFLRE EIKATQAYIE LVKARYKDKF DVFWDYSEDV LEIKVMKLLF QPLIENSIYH
GIKPCEKRCG IKIRIRKLKD TSDWLCIWVI DNGIGMSKEK LEEVQGRLSQ DFDFSDHIGL
LNTNERLKLN YGGNFKLKVW SKLGLGTIVK IILPVNFEDR KENEIDAKKT GYLYP