Gene Athe_0081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0081 
Symbol 
ID7407152 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp102336 
End bp104051 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content35% 
IMG OID643714491 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_002572014 
Protein GI222528132 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGAAAA GCATTGAAAG TAGACTTATA ATTGTCTTTG GTCTATTGAT TTTGATACTC 
ATGTTTATAT CCAGCTTTTT TGTCATAGAC AGGACAAAAA GTTATTTTTA TGATGATATT
AAAAATAAGA TAGAATTCAT GGTGAGTTCA TCACTGATAA GGATTTTAGA GGATAAAAGC
CTTACACAGC AGAAAATTCA GGAAATAATT GACCAGTCAA TGAAGCAGAG TCAGTATGGT
TTTATGATAC AAAAACTAAT TGTGACAGAC AACAGAGGAA GACTCTTGGC ATCTTTTCCA
AGGATGGATA TCAGCTTTTA TCCTTCGGAT GAGATATTGA CAAGTCTTGC AGGCTATAAG
GTTATTAAAC GGGATTCAGA AAACCAGACT ATGATTTTTG CTTTTCCTAT AAAAAGTGGT
AAGTCTGTTG AAAGGTCATT ATATTTAGAA GTGTCTTGCC AAAGTATACT TGAAACTGTC
ACAGACATAA AAAATATTTT ATTTATGGCA TATGTTATTG GTATGGGTTT TTCACTATTT
ATTGGATTTT TGTTTGCAAA AACCCTTTCC AATCCGCTAA GGAAACTTAC CAGGCAGGCA
CTTGAGATGG CACAAGGCAA TCTTGATGTC AAGATTGAAA TTTCCAGTCA GGATGAGATA
GGCAAGCTTG CAAGTGCTTT TAAAATAATG GCAACAAATC TGAAAAGGTA TATAACAGAG
CTTGAGTTTG AAAAACAAAA GCTTGAAAGA ATACTTCAGA ACATGTCAGA TGGTGTTTTG
GCTATAAACT CGAGAAATGA GATAATTCAT ATAAATGAGA GTGCGAAGAG ATTTCTCAAA
GATGATATTC ATGGATTTTT GGACAAAATT CAAGCTCAAA AAAGTTCTGT AGTTTCTCAG
CCAATAATTT ATGAAGTTGA TGGCTATACA CTTGAGGTTA GCATAGCATT TTTTGTTGAT
TCTTTTCAAT CAACAGGTAT GGTATTTATA CTACATGACA TAACAGAACA AGCAAAGCTT
GACAGGATGC GCAAACAGTT TGTTGCAGAT GTCTCACATG AGCTGAGAAC ACCAATTACC
ACTATCAAAA CTTATTCTGA GACACTTTTA GATGTTGATG ATGAAAGTGT TAAAAGAGAG
TTTTTGACTG TAATAATAAA AGAATGTGAT AGGATGACAA GGCTTATATC TGACCTTTTA
TACCTCTCAA GGCTTGATAG TGGAGAGAAT ATACTGAGAA TAGAAGAAGT AAATATAAGT
GAGCTTGTGA GGTTTGTTTG TGAAAAGATG CGAATTCATG CCAATAAAAA ACACCAGAGT
CTTTTGTGCA ATGTGCAGGA GGATATTATA ATAGATGCAG ACAGAGACAG ACTTGAACAA
GTGCTAATAA ATCTTATTAA CAACGCTATT ACTTATGTTC AGGATGGTGG TAGGATAGAA
GTTTGTCTTA AAAAAGAAAA TGGAAATATT GAACTGACAG TTGAAGACAA TGGGCCTGGT
ATACCTAAGG AGGACCTTCC ACGGATATTT GAGAGGTTTT ACAGGGTTGA CAAGGCAAGG
TCACGAAGTC TTGGGGGTAG CGGGCTTGGT CTTTCGATTG CTGATGAAAT TGTAAAGGCT
CATGGTGGCA GGGTTTTGGT TGAAAGCGAA GAAGGCGTTG GAACAAAGTT TACAGTAGTG
CTTCCCTTAA AAGAAAAAGG ACAAGTGACT TTGTAA
 
Protein sequence
MTKSIESRLI IVFGLLILIL MFISSFFVID RTKSYFYDDI KNKIEFMVSS SLIRILEDKS 
LTQQKIQEII DQSMKQSQYG FMIQKLIVTD NRGRLLASFP RMDISFYPSD EILTSLAGYK
VIKRDSENQT MIFAFPIKSG KSVERSLYLE VSCQSILETV TDIKNILFMA YVIGMGFSLF
IGFLFAKTLS NPLRKLTRQA LEMAQGNLDV KIEISSQDEI GKLASAFKIM ATNLKRYITE
LEFEKQKLER ILQNMSDGVL AINSRNEIIH INESAKRFLK DDIHGFLDKI QAQKSSVVSQ
PIIYEVDGYT LEVSIAFFVD SFQSTGMVFI LHDITEQAKL DRMRKQFVAD VSHELRTPIT
TIKTYSETLL DVDDESVKRE FLTVIIKECD RMTRLISDLL YLSRLDSGEN ILRIEEVNIS
ELVRFVCEKM RIHANKKHQS LLCNVQEDII IDADRDRLEQ VLINLINNAI TYVQDGGRIE
VCLKKENGNI ELTVEDNGPG IPKEDLPRIF ERFYRVDKAR SRSLGGSGLG LSIADEIVKA
HGGRVLVESE EGVGTKFTVV LPLKEKGQVT L