Gene TM1040_0221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0221 
Symbol 
ID4076254 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp236023 
End bp237453 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content61% 
IMG OID638005515 
Producthistidine kinase 
Protein accessionYP_612216 
Protein GI99080062 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.641728 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGATC CTGCGTATGA TGCCATGACC CCAGAGGAGT TGCGCGCGGC TTTGATGCAA 
GAGCGCAGGT TGCGCGCAGA ACTCGAAGAA TCCATGGAGC GCAAGTCCCA ATCGCTGCGC
CGGGCGAACG CGTCCCTCGT GTCCTTGGCC CGACAGATCG ACCGGATGGT CAAAGAGCGC
ACCGAAGACC TCGCCCGCGA CCGGGCCGAG GCGGAGGCCG AAAGTGCCGC AAAGACCCAG
TTTCTGGCCA CGATGAGCCA TGAAATCCGC ACCCCGCTCA ACGGTGTTCT CGGGATGGCA
TCGGCACTTG CGGATACCGA ACTGGCCGAG CGTCAATCGC AGATCCTTGG GGTTCTACGC
GAAAGCGGGC AATTGCTCCT GAACATCGTT GATGACATTC TCGATCTCTC CAAGATCGAA
GAGGGCAAGC TCGAACTCGA ACGTCTGCCG ACAGATGTGG TAGCTTTGAT CGAGACGGTT
TATCGCCAGT TCAAACCGCG TATCGAGGCC AAGGGGCTCA ATTTCAGCTC GATCTATGGC
GAGGGGTTGG CGCAGGGCCC CGCCTGGGTC CACATCGATC CCACCCGATT CCAGCAAGTG
CTGACCAACC TGTTGTCCAA TGCGATAAAA TTCACCGACA CGGGTGCGAT TGCGCTTTCG
TCCAGCCTTG TCTTGCAGCA CGACGGTGAG CTCATTTTAT GCGTTGCCGT GCGCGATACC
GGCGTCGGGC TCACGGATGC GCAAGTGGAG CGCTTGTTCC AACCCTACAT GCAGGCCGAT
GCGAGCGTGG CGCGCAGCCA TGGCGGCACC GGTCTTGGCT TGGCCATCGC GCGCCAGATC
TGCGAGCGCA TGGGGGGCAC CCTGACCTGC CGCAGCCATG AGGGCGATGG CAGCGAATTC
TGCGCGACCT TTCGCGCCGC TCCGGCAGAG CCCGTGGTCT CGTCCCATGA CCCCCAGGAC
GAGAACGAAC TGGAGGCGCT GAGGGCGCAA CGCTGGAAGG TGCTGGTGGC CGAGGACAAC
CGAACCAACC GGATGGTTCT CCACCATATC CTGCGCGGCT ATGACCTGGA TTTGTACTGG
GTGGACAATG GCGAAGAGGC GGTAAAAACC TGGCGTCAAG AACGGCCGGA TCTGGTGCTG
ATGGATGTGA ATATGCCGCA GCTCGACGGG GTGTCTGCGA CCTCTGTAAT CCGTCAAGAA
GAGGCTGATG CACAAGCCGC GCCGGTGCCG ATCATTGCGG TGTCGGCCAA TGCGCTGGTG
CATCAGATCA AAAGCTACCT CGCGAGCGGC ATGACCGACC ATGTCGCCAA ACCGATCCGC
AAGAAAAAGC TGTTGTCAGC CATGGCGCGC GCGTTGAAGT CCCGCCCAAC GCTGCCCCCC
GTGCCAGAAG AACCAGCGGA CAAAGGCGCA GCGCGGGTGC TCTCGCCCTA G
 
Protein sequence
MSDPAYDAMT PEELRAALMQ ERRLRAELEE SMERKSQSLR RANASLVSLA RQIDRMVKER 
TEDLARDRAE AEAESAAKTQ FLATMSHEIR TPLNGVLGMA SALADTELAE RQSQILGVLR
ESGQLLLNIV DDILDLSKIE EGKLELERLP TDVVALIETV YRQFKPRIEA KGLNFSSIYG
EGLAQGPAWV HIDPTRFQQV LTNLLSNAIK FTDTGAIALS SSLVLQHDGE LILCVAVRDT
GVGLTDAQVE RLFQPYMQAD ASVARSHGGT GLGLAIARQI CERMGGTLTC RSHEGDGSEF
CATFRAAPAE PVVSSHDPQD ENELEALRAQ RWKVLVAEDN RTNRMVLHHI LRGYDLDLYW
VDNGEEAVKT WRQERPDLVL MDVNMPQLDG VSATSVIRQE EADAQAAPVP IIAVSANALV
HQIKSYLASG MTDHVAKPIR KKKLLSAMAR ALKSRPTLPP VPEEPADKGA ARVLSP