Gene Athe_2149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2149 
Symbol 
ID7408342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2279681 
End bp2281681 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content38% 
IMG OID643716514 
ProductCheA signal transduction histidine kinase 
Protein accessionYP_002573997 
Protein GI222530115 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0643] Chemotaxis protein histidine kinase and related kinases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATATGT CTCAGTATCT TGGAATGTTT ATAGAAGAGG CAAGAGACCA CATTCAAAGC 
CTTAACGACA ATATGCTAAA ACTTGAAGAA AACCCGGAGG ATTTGCAAAT TGTAAATGAG
ATTTTCAGGT CAGCCCATAC TTTAAAAGGC ATGGCCGGCA CAATGGGATT TGTCAATATG
CAAAAGCTTA CACATGCGAT GGAAAATGTT CTTGCTGCTG CACGCGATGG CAAGTTAAAA
GTAAATCCTA ATATCATGGA TATTCTTTTC AAGACAGTTG ATGCGCTTGA ATCATACTTA
GATGTTATAA TTGCAACAGG TACAGAAGGA CAAGAGACAA ATTTGCATCT TGTCAATGCT
TTAAATGCTA TTTTAGGGAA ACCTGCCGAA GATGTGGCGG TGTCTTCAGC AACAAAAGCA
GGTAAGAAAT ATGAATATGA TGAGTTTGTT GTAAGAGCAA TAGAACGTGC TTGGGACCAA
GGGTTTAATG TTTACAGATT TGATGTTGAG CTTGACCAGA ACTGTCTTTT AAAATCTGCA
CGTGCATACC TTGTTTTCAG AGCGGTTGAG GAACTTGGTG AAATTATTCA TTCTAAACCC
TCGGTTCAGG ACATTGAGGA TGAAAAGTTT GATTTTGAGT TTTCTATTAC CGTTATAAGC
AAGCAGCCGA TTGAAAAAAT AAGAGATAGA ATTCTTTCAA TTTCAGAGAT AAGAGAAGTA
AAAGCGCTTG AGATAAAGTC TGGCGAAGTA AGTATGGCAG AAGAAAAAGA GGAGATTGAA
GAGGTACAGC AAGAGACACA AGTACAGGAA ACTGTAAAGG TTGTAAGGCA GCAGAAACAA
GAATCTTTGC AGAAGACAAG CAAAACAGTT AGAGTTGACA TTGAAAGATT AGATGTTCTT
ATGAACTTGG TGAGCGAGCT TATTATAATC AAAAGCCGAA TAGAAGGACT TGCAAAAAAG
TATAACGATA GACAATACGA AGAGTCTATT GAGTATTTGG AAAGAATTAC AACAAGTTTA
CACGATGCTG TAATGAAGGT ACGAATGGTC CCGGTTGAAA GGGTATTTTC ACGTTTTCCA
AGGATGATGA GAGATTTAGC AAGAGAACTT GGAAAGGAAT TTGAGCTTGT AATGTCTGGT
GAGGATACTG AGGTTGACAG GACTATTGTG GACGAGCTTG GAGATCCTCT TATTCATCTT
CTGAGAAATG CTGCCGACCA TGGAATAGAA GACCCTGATG AGAGGGTCAA AAACGGCAAA
CCAAGAAGTG GGCTTATTAA ACTTTCGGCT TATCATGACG GGAACAATGT TGTCATTGAG
GTTGAAGATG ATGGCAAAGG AATTGATTTA GAAAAGGTAA AGCAAAAAGC TATAGAAAAG
GGGCTTTTGA AAGAGGACCA AATAGAATTG ACAGAGCAGG AAATAATAGA TTTTCTGTTT
ATGCCAAGCT TTTCAACAAA AGACAAGGTT ACAAACCTTT CTGGACGTGG TGTTGGACTT
GATGTTGTAA AGACCAAGAT TGAACAGCTT GGTGGAATGG TTGAAGTGAA GACACAGAAA
GGAAAAGGAA CAAAGTTTGT TATACGACTT CCGTTAACTC TTGCTATCAT TCAGGCATTG
CTTGTCACTG TACATGATGA GATATATGCA ATTCCTGTTG CATCAATCAG AGAGATTGTG
GATGTTGCAA AAGAGGATAT AAAGGTTGTT CAAAAAGGAA AGGAAATAAT CATGCTGAGA
AATCAGGTGA TTCCAATAAA ACATTTACAT TCTATTGTGG GGTTAGAGCC AGTTCTTGAC
AAAAAGAAAT TTACAGTTGT GATAGTAAGA CGTGGCGAAA AACTGACAGG AATCATTGTT
GACAAACTTT TAGGACAACA GGATATTGTT ATAAAATCGC TTGGAAAGTA TTTAGAGGGA
GTTAGACTCA TATCAGGTGC AACAATCCTT GGCGATGGGT CTGTTGCAAT GATACTTGAT
CCTAACATGC TTACAGTGTA A
 
Protein sequence
MDMSQYLGMF IEEARDHIQS LNDNMLKLEE NPEDLQIVNE IFRSAHTLKG MAGTMGFVNM 
QKLTHAMENV LAAARDGKLK VNPNIMDILF KTVDALESYL DVIIATGTEG QETNLHLVNA
LNAILGKPAE DVAVSSATKA GKKYEYDEFV VRAIERAWDQ GFNVYRFDVE LDQNCLLKSA
RAYLVFRAVE ELGEIIHSKP SVQDIEDEKF DFEFSITVIS KQPIEKIRDR ILSISEIREV
KALEIKSGEV SMAEEKEEIE EVQQETQVQE TVKVVRQQKQ ESLQKTSKTV RVDIERLDVL
MNLVSELIII KSRIEGLAKK YNDRQYEESI EYLERITTSL HDAVMKVRMV PVERVFSRFP
RMMRDLAREL GKEFELVMSG EDTEVDRTIV DELGDPLIHL LRNAADHGIE DPDERVKNGK
PRSGLIKLSA YHDGNNVVIE VEDDGKGIDL EKVKQKAIEK GLLKEDQIEL TEQEIIDFLF
MPSFSTKDKV TNLSGRGVGL DVVKTKIEQL GGMVEVKTQK GKGTKFVIRL PLTLAIIQAL
LVTVHDEIYA IPVASIREIV DVAKEDIKVV QKGKEIIMLR NQVIPIKHLH SIVGLEPVLD
KKKFTVVIVR RGEKLTGIIV DKLLGQQDIV IKSLGKYLEG VRLISGATIL GDGSVAMILD
PNMLTV