Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0221 |
Symbol | |
ID | 4076254 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 236023 |
End bp | 237453 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638005515 |
Product | histidine kinase |
Protein accession | YP_612216 |
Protein GI | 99080062 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase [COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.641728 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGATC CTGCGTATGA TGCCATGACC CCAGAGGAGT TGCGCGCGGC TTTGATGCAA GAGCGCAGGT TGCGCGCAGA ACTCGAAGAA TCCATGGAGC GCAAGTCCCA ATCGCTGCGC CGGGCGAACG CGTCCCTCGT GTCCTTGGCC CGACAGATCG ACCGGATGGT CAAAGAGCGC ACCGAAGACC TCGCCCGCGA CCGGGCCGAG GCGGAGGCCG AAAGTGCCGC AAAGACCCAG TTTCTGGCCA CGATGAGCCA TGAAATCCGC ACCCCGCTCA ACGGTGTTCT CGGGATGGCA TCGGCACTTG CGGATACCGA ACTGGCCGAG CGTCAATCGC AGATCCTTGG GGTTCTACGC GAAAGCGGGC AATTGCTCCT GAACATCGTT GATGACATTC TCGATCTCTC CAAGATCGAA GAGGGCAAGC TCGAACTCGA ACGTCTGCCG ACAGATGTGG TAGCTTTGAT CGAGACGGTT TATCGCCAGT TCAAACCGCG TATCGAGGCC AAGGGGCTCA ATTTCAGCTC GATCTATGGC GAGGGGTTGG CGCAGGGCCC CGCCTGGGTC CACATCGATC CCACCCGATT CCAGCAAGTG CTGACCAACC TGTTGTCCAA TGCGATAAAA TTCACCGACA CGGGTGCGAT TGCGCTTTCG TCCAGCCTTG TCTTGCAGCA CGACGGTGAG CTCATTTTAT GCGTTGCCGT GCGCGATACC GGCGTCGGGC TCACGGATGC GCAAGTGGAG CGCTTGTTCC AACCCTACAT GCAGGCCGAT GCGAGCGTGG CGCGCAGCCA TGGCGGCACC GGTCTTGGCT TGGCCATCGC GCGCCAGATC TGCGAGCGCA TGGGGGGCAC CCTGACCTGC CGCAGCCATG AGGGCGATGG CAGCGAATTC TGCGCGACCT TTCGCGCCGC TCCGGCAGAG CCCGTGGTCT CGTCCCATGA CCCCCAGGAC GAGAACGAAC TGGAGGCGCT GAGGGCGCAA CGCTGGAAGG TGCTGGTGGC CGAGGACAAC CGAACCAACC GGATGGTTCT CCACCATATC CTGCGCGGCT ATGACCTGGA TTTGTACTGG GTGGACAATG GCGAAGAGGC GGTAAAAACC TGGCGTCAAG AACGGCCGGA TCTGGTGCTG ATGGATGTGA ATATGCCGCA GCTCGACGGG GTGTCTGCGA CCTCTGTAAT CCGTCAAGAA GAGGCTGATG CACAAGCCGC GCCGGTGCCG ATCATTGCGG TGTCGGCCAA TGCGCTGGTG CATCAGATCA AAAGCTACCT CGCGAGCGGC ATGACCGACC ATGTCGCCAA ACCGATCCGC AAGAAAAAGC TGTTGTCAGC CATGGCGCGC GCGTTGAAGT CCCGCCCAAC GCTGCCCCCC GTGCCAGAAG AACCAGCGGA CAAAGGCGCA GCGCGGGTGC TCTCGCCCTA G
|
Protein sequence | MSDPAYDAMT PEELRAALMQ ERRLRAELEE SMERKSQSLR RANASLVSLA RQIDRMVKER TEDLARDRAE AEAESAAKTQ FLATMSHEIR TPLNGVLGMA SALADTELAE RQSQILGVLR ESGQLLLNIV DDILDLSKIE EGKLELERLP TDVVALIETV YRQFKPRIEA KGLNFSSIYG EGLAQGPAWV HIDPTRFQQV LTNLLSNAIK FTDTGAIALS SSLVLQHDGE LILCVAVRDT GVGLTDAQVE RLFQPYMQAD ASVARSHGGT GLGLAIARQI CERMGGTLTC RSHEGDGSEF CATFRAAPAE PVVSSHDPQD ENELEALRAQ RWKVLVAEDN RTNRMVLHHI LRGYDLDLYW VDNGEEAVKT WRQERPDLVL MDVNMPQLDG VSATSVIRQE EADAQAAPVP IIAVSANALV HQIKSYLASG MTDHVAKPIR KKKLLSAMAR ALKSRPTLPP VPEEPADKGA ARVLSP
|
| |