Gene TM1040_0112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0112 
Symbol 
ID4078697 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp117301 
End bp119205 
Gene Length1905 bp 
Protein Length634 aa 
Translation table11 
GC content63% 
IMG OID638005399 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_612107 
Protein GI99079953 
COG category[T] Signal transduction mechanisms 
COG ID[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains
[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACAGA ACACGGCACA ACGCATGGCG ATGATCGACG CAGGCCTGAA CCTCATTGCG 
CAGGCCATGT CGATCTACGA CCGCAACCTG CGCCTTGCGG CCTGCAACCA TCGCTTTCAG
GAAATGTTTT CACTGCCGCC CGAATTGGTG CAAGAGGGCG CGACGTTCGA AGACACCATC
CGCTTTATCG CGGTACAGGG CGACTATGGC CCCATTGATG ACATCGATTC GTTTGTGCAG
CTGCGGGTGG AACAGGCCCG CGCCTTTGTG CCCCATTACG TCGAGCGCCA GCGTGCCAAT
GGCCATGTGA TTTCGATCGA GGGCGCGCCT CTTCCGGAGG GGGGCTGGGT CACGGTCTAC
ACCGACATCA CCCGCACCAA GCGGGTTGAA GAATTGCTCC GCGCCCGCTC CGAGGAGCTC
TCGGATCAGG TGATGAGCTA CACAGAAGAG CTCTCGGCCA CCAACCGAAA GCTTGCAGCC
ACGATCACCA CACTTGAAGA GACCCAGCGC CAGCTGACGC AGACCGAGGC CCGCACCCGG
CTCACGACCG AAATGATGCC CGCCCATATC GCGCATGTGG ATGCGGACGG GCATTATACC
TTTACCAACG GACGTTTGAG CAAAGTGTTT CCCGGCCGCC CCTCCGACAT TCTCGGCCAG
CATATTGCCG AGGCGCTCGG GTCCGCCGCC TATGCCCGCA TCGCACCGCA CCTCTCGGCG
GCCTACCAGG GTGAAAGCCC GGTGTTTGAG TTCACCGAGG ATCAGGACAG CCGCCGCCTG
CGCGTGGCGT TTACGCCCGA CAACGAAGGC GGGGTGTTCA TCCTGTCGAT GGATGTGACC
GAGGAAACCC AGACCCGCGT GGCCCTGCAA CAGGCCCGCA AGCGCGAGAT CGCCGCGCAG
ATGACCAGCG GGCTTGCGCA TGATTTCTCC AACCTTCTGA CCATCATCCT CGGAATGCAG
ACACGCCTTG ATCGGATGAC CCTGCCCGAA GGCGCGGGCG AGCTCGTGGA GGCCACCCTG
TCAGCCGCGC GGCGCGGCGG GCGGCTCTTG GATCGGATGG CTGAAATGAC CAGTCACCGT
GGCCTGCGCC CGCAAGCCAC CGACCTGCAC GCGCTCTTGG ACGAGATGAA GATCCTCGCC
ACGCCGTCAC TACCACAGGG CATCGGTCTC AGCGTGCTGG ACAACACAAG CGAGGGGCCG
GTCCTTTTGG ATCCGGGGCG GCTGCAGGAC GCGCTTTTGA ACCTGATTTT GAATGCGCGC
GACGCCTGCG GCACCAGCGG CCAGATCACG GTTGCCGCGC ATCTGGTCGG GCAGACCTGG
ATCGAATTCT CCGTAAGCGA CACAGGCCCC GGCTTTTCAG TGCAAGCGCT TGAGAATGCG
CTCAACCCGT TTTTCACCAC CAAGGGGCAA GAGGGCTCGG GGCTGGGGTT GTCGATGGTC
TATGACATGG TGAAATCCGC CGGGGGGGAC ATCCGCATCA GCAACACCGT CTCCGGCGCG
ATGGTGACGC TGCGCCTGCC CTACCGCCCA GCGCCGATGG CGGCCGGCGG CATTGCCCTT
CTGGTCGAAG ACAGCGACAC GCTTCGGGCG ACCTATCGTC AGGTGCTGAT GGATCTTGGA
TACTCCGTCA TCGAGGCCAC CAGCGTTGAT GAGGCCGTTG CGCTTTTGGC GGATGTCGAA
GGGATTGCAC TGATCCTGTC GGACATCAAA CTGGAAGGCG ATGCCACCGG CGTCGATCTC
TGCACCCGGC TGGGGCCGGA TGCGCCACCC GTGGTACTGA TGACCTCGTT GCCGCATACC
GATCCGCTCT ATCGCGCGGC CCTCACTCTG GCGCCACTCT TGCCAAAACC CTTTGAAAGC
GCGCATCTGA TGGCGCTCCT GCAACAAAAG GCCGACCATG CCTGA
 
Protein sequence
MKQNTAQRMA MIDAGLNLIA QAMSIYDRNL RLAACNHRFQ EMFSLPPELV QEGATFEDTI 
RFIAVQGDYG PIDDIDSFVQ LRVEQARAFV PHYVERQRAN GHVISIEGAP LPEGGWVTVY
TDITRTKRVE ELLRARSEEL SDQVMSYTEE LSATNRKLAA TITTLEETQR QLTQTEARTR
LTTEMMPAHI AHVDADGHYT FTNGRLSKVF PGRPSDILGQ HIAEALGSAA YARIAPHLSA
AYQGESPVFE FTEDQDSRRL RVAFTPDNEG GVFILSMDVT EETQTRVALQ QARKREIAAQ
MTSGLAHDFS NLLTIILGMQ TRLDRMTLPE GAGELVEATL SAARRGGRLL DRMAEMTSHR
GLRPQATDLH ALLDEMKILA TPSLPQGIGL SVLDNTSEGP VLLDPGRLQD ALLNLILNAR
DACGTSGQIT VAAHLVGQTW IEFSVSDTGP GFSVQALENA LNPFFTTKGQ EGSGLGLSMV
YDMVKSAGGD IRISNTVSGA MVTLRLPYRP APMAAGGIAL LVEDSDTLRA TYRQVLMDLG
YSVIEATSVD EAVALLADVE GIALILSDIK LEGDATGVDL CTRLGPDAPP VVLMTSLPHT
DPLYRAALTL APLLPKPFES AHLMALLQQK ADHA