Gene TM1040_1228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1228 
Symbol 
ID4075936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1321904 
End bp1324213 
Gene Length2310 bp 
Protein Length769 aa 
Translation table11 
GC content61% 
IMG OID638006536 
Productmulti-sensor hybrid histidine kinase 
Protein accessionYP_613223 
Protein GI99081069 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGGTC GCTCTTCTTC GATTGCCCCC AGCCTGAAGA TCCAGACGCC GGAGCAGGCG 
CGCTTGGCGG CGACCTTTCT GCTATCGGCG CTGTTGATCG TGTTGCCGTG GCTGTTGCCG
GTGCCGGAGT GGATCGGGCG GGCGCTGGTG GCAGTCGGGC TGACGCTGGG CGGTGTCTCT
GCGATGATCT TGATACAGAC CCGGGTCCGC ATGGGCGCGC GCACGATGGC GAGCGAGTTG
CTGACGGGCT TTATCGACAA GGACGCATCC GCCAGCTTCG TCACCGACGA GGACGGGGTG
ATCCATGCCT GCAACACCGC TGCGCTCAAG CGATTTGAGG GCAGCGAAAG TGATACGATC
TCGGGCACAT TGCGCTCGGT TCTGGCCAAT CCGTCCGCGG TGCTCTTCCG TTTGCAGAGC
CGTGCCAAGC TCGAAGGGTC GGCCCGCGAG GACATCGTGA CGCGGCGTGG AAACGTCCGC
ATTGCGGTGC ATGAGATGGC CAATGGCAGC TTCTTGTGGC GAGTTGAGGA CATGGGCGAA
CGCCCCTCCG GGCGCACGGC GGATGCCAGC CCGGTGCCGA TGATCTCGGT TGGACGCACG
GGCGCCGTCC TGTTCATGAA CGAATCCGCG CGCAGCCTGA TCGGAGAGCG CGTGAAATCG
ACAGATCGGC TCTTTACGTC ACTGCCGGTG ATCCCGGGGC GGATCAACAC GATCATGACC
AAAAGCGGCC CCGTTGAGGT GCTGGTCAGC GAACACGCGC GCGGCCAGGG GCGCAGCGAA
ATCTACTTCA TGAAAGCCGT TGTCGAGGAC AGTGGGCCTC ATACCGGGTT CGAGAGCCTG
CCGGTGCCGC TCCTCAAGGT TCGCGCGGGT GGCGAGGTGA TGGCGGTCAA CCGCATGGCC
ATGCGGCTTT TGGGCATCGA AGAAGCCGAA GGCGTGCAGT TGGGGCAATT GATGGAGGGC
CTTGGCCGCC CCATGAACGA TTGGCTGCAA GAAACGTCCA AGGGGCTTGC GCCGCATAAG
TCCGAGTTCC TGCGGGTGTC ACGCGACGAC AAGGAAGTCT TTGTGCAGGT GACGCTCACG
CGGATCATCG AGGATGGCGA GACGCTATTG CTGGCGGTTC TGAACGATGC AACCGAGCTG
AAAACCCTTG AGGCGCAATT TGTTCAAAGC CAGAAGATGC AGGCGATCGG TCAGCTCGCG
GGCGGCGTGG CACATGATTT CAACAATCTT TTGACTGCGA TTTCAGGGCA TTGCGACCTG
CTTTTACTGC GTCACGATCA GGGCGATCAG GATTATGGTG ATCTGATCCA GATCCATGAA
AACGCCAATC GCGCTGCTGC CTTGGTGAGC CAGTTGCTTG CGTTTTCGCG CAAGCAGACC
CTGCAACCCG AGGTGCTCGA TGTGCGCGAG ACGCTCTCGG ATCTGACCCA TCTGCTGAAC
CGTCTGGTGG GGGAGAAAGT CTCGCTCACG CTGAGCCATG ACCCGGTTAT GCGCTCCATT
CGGGCTGACA AACGCCAGCT TGAACAAGTT CTGATGAACC TCGTGGTGAA TGCGCGGGAC
GCGATGCCAC ACGGGGGCGA AATCCGCGTG GAAACTGAGG TAGTAGCGCT CGACACAGCA
CTTGAACGCG ATCGCGCCCT TGTGCCGCCG GGCGAGTGGG TCACCATCCA GGTGAGCGAC
GAGGGCACAG GGATCTCGCC GGACAAGCTT CAAAAGGTGT TTGAGCCCTT CTACACCACC
AAACGCACCG GCGAAGGCAC GGGGCTTGGG CTTTCCACCG CCTATGGCAT CATCAAACAG
ACTGGTGGCT TCATTTTTGT GGATTCCGTG CTCAATCAGG GCACCAAGTT CACGCTCTAT
TTCCCGGTTT ATCGCAGCGA TGCGTCACAC GACGTCGAAG AAGCGGTCGC ACCGGCCAAG
ACCAAGGCAC CGGCAGCACA ACACGGCGAA GGCGTGGTTC TCTTGGTGGA GGACGAAGCT
CCGGTTCGGG CCTTTGCATC GCGCGCGCTG CGCATGCGCG GCTATACGGT CCTTGAGGCC
GAATCCGCTG AAGAGGCGCT CGACACGCTC GAAGATCCAA ACCTCAGCGT TGATGTCTTT
GTGACCGATG TCATCATGCC CGGGATGGAT GGGCCGACTT GGGTGCGGGA AGCCCTCAAA
ACCCGGCCCG GCACCAAGGT GGTCTTTGTG TCCGGGTACT CCGAAGGGGC CTTTGGCGAT
GCGGAGCCCG ACGTGCCGAA TTCGGTCTTC CTGGCCAAAC CGTTCTCGCT CTCGCAGCTC
ACGGAAACCG TTCAGTCTCA ACTTCAGTAA
 
Protein sequence
MIGRSSSIAP SLKIQTPEQA RLAATFLLSA LLIVLPWLLP VPEWIGRALV AVGLTLGGVS 
AMILIQTRVR MGARTMASEL LTGFIDKDAS ASFVTDEDGV IHACNTAALK RFEGSESDTI
SGTLRSVLAN PSAVLFRLQS RAKLEGSARE DIVTRRGNVR IAVHEMANGS FLWRVEDMGE
RPSGRTADAS PVPMISVGRT GAVLFMNESA RSLIGERVKS TDRLFTSLPV IPGRINTIMT
KSGPVEVLVS EHARGQGRSE IYFMKAVVED SGPHTGFESL PVPLLKVRAG GEVMAVNRMA
MRLLGIEEAE GVQLGQLMEG LGRPMNDWLQ ETSKGLAPHK SEFLRVSRDD KEVFVQVTLT
RIIEDGETLL LAVLNDATEL KTLEAQFVQS QKMQAIGQLA GGVAHDFNNL LTAISGHCDL
LLLRHDQGDQ DYGDLIQIHE NANRAAALVS QLLAFSRKQT LQPEVLDVRE TLSDLTHLLN
RLVGEKVSLT LSHDPVMRSI RADKRQLEQV LMNLVVNARD AMPHGGEIRV ETEVVALDTA
LERDRALVPP GEWVTIQVSD EGTGISPDKL QKVFEPFYTT KRTGEGTGLG LSTAYGIIKQ
TGGFIFVDSV LNQGTKFTLY FPVYRSDASH DVEEAVAPAK TKAPAAQHGE GVVLLVEDEA
PVRAFASRAL RMRGYTVLEA ESAEEALDTL EDPNLSVDVF VTDVIMPGMD GPTWVREALK
TRPGTKVVFV SGYSEGAFGD AEPDVPNSVF LAKPFSLSQL TETVQSQLQ