Gene TM1040_0406 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0406 
Symbol 
ID4078800 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp416325 
End bp418301 
Gene Length1977 bp 
Protein Length658 aa 
Translation table11 
GC content60% 
IMG OID638005701 
Productperiplasmic sensor signal transduction histidine kinase 
Protein accessionYP_612401 
Protein GI99080247 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.135462 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAAAT CTGTCCGCTT GCGCCTGCTC ATTCTGGCGC TGCTACCGCT TATCGTGTTG 
ATGCCGCTTT TGATGGTGGT GGCGATGGCG CGTTGGAATG CGGATTACGA TAAAATATTG
ATCGAAAAGG TCGAAAGCGA TCTGCGCATC GCCGAGCAAT ATCTCGGCCA CCTTCAGAGC
AACTCGGCGC GTGCGCTGCA AGGACTTGCC CAATCAGCCG AGATCGCAGA CGTCCTCACG
CGCGACCCAG GTTTAACGCC AGCCTTTCTG GAGCAAAGCC AGGATCGCCT CGGACTCGAC
TTTCTCTATC TTCTACAAGG CGATGAGCTG ACTGACGCCA AAGAAAAATG GCCGGTCATC
TTTCAAGCCG CCGCTGGCCG CATGGTCTCG GAGATCGATA TTCTCTCGGC CAGCGATCTG
GCGGCCCTTT CCCCTGCGCT GGCGGCAGAA GCCCGGATTG AGCTGATCGA GACCGAAGCC
GCCGCCCCCG CCGATCACAC CGTGGAGGAT CGGGGTATGG TGGTGCACAC CGCCGCCCCA
GTGCGCGTGA ATGGCGAGAT CCGGGTGTTG GTGGGGGGGA TCTTGCTCAA CCGCAACCTT
GATTTCATCG ATACGCTCAA CGAACTCGTC TATCTGCAAG GACAGTACAG CGATGGGCGA
CAAGGCACCG CCACCCTGTT TTTATCAGAC ACCCGCGTCT CCACAAATGT GCGCCTCTTC
GAGGACGTGC GTGCGCTTGG CACGCGGGTG TCTGCTGCCG TACGTGACGC GGTGCTCGGC
AAGGGGCGCA CATGGCTAGA CAGCGCCTTT GTCGTGAATG ATTGGTACAT TTCGGGGTAT
CTGCCGATCA GCGACAGCTT TGGCAAACGG GTCGGCATGC TCTACGTCGG CTTCCTGGAA
GCCCCCTATG AGACCGCAAA ACGCGCGGCT TATTGGACGG TGGTTGGGGC ATTTGTGCTG
GTGATCGCAC TCACCGCGCC GCTGTTCTTG TGGTTGGCCA AAGGGATCTT TGCCCCACTC
GAGCAGATGA CGCGCACCAT GGCGAGGGTG GAGCGCGGCG ACCTCACCGC CCGCAATCAG
ACCTCGGCGC GCGGTGACGA GATCGGTCAG GTGGCCAATC ACCTCAACAC ATTGCTTGAT
CAGGTCCAGG AACGCGACCG GCGGCTGCGC AACTGGGCCG GAGAGCTGAA CCTGCGCGTC
GAAGAGCGCA CGCGAGAGTT GCAAGTCGCA AATGCCAAAC TCGAAGAGAC CTTTCGCCAG
CTGGTGATGA GTGAAAAACT GGCCTCGATC GGGGAAATCA CCGCAGGCGT CGCCCATGAG
ATCAACAATC CCGTCGCGGT GATCCAGGGC AATGTGGATG TGATGCGTAT GACCCTTGGC
GACGCAGCAG AGCCGGTGCG CACCGAGCTC GATCTGATCG ACAATCAAGT GATGCGCATC
GGGGCCATCG TGGGCAAACT CTTGCAGTTC GCGCGCCCGT CGGAGTTTGG CACCTTTGCC
GAGCATGTGG ATGTGGCCGC CGTGGTCCGC GACTGTCTTG TGTTGGTGGA TCACGTCATC
TCGAAACAAG AGGTGCGGGT CCTCACACAG TTTGACGCCG CGCCACACGT CCGCATCGAC
CCGGGCGAAT TGCAACAGGT GGTGATCAAT ATCATCATCA ACGCCAGCCA GGCGATGGAG
GGCAAAGGCA CGCTTGCGAT CTCTGTTGCG GGCGAGGTAC GCGATGCACA GACAGGTGTT
GCCCTGCGGA TCGCCGATAC CGGCCCCGGC ATCCCCATGG AGCGGATCGA TCACATATTC
GACCCTTTCT TTACGACAAA ACAGGGCGAA GGCACGGGGC TTGGGCTGTC CATCAGCCAG
ACCCTCATTC AGCGGGCTGG CGGGCGCATC ACGGTGCGCA ACAGACGCGC GCGGGGGGCG
GAGTTCCTGA TCTGGCTGCC AGAGCCAACT GACGCGGATG ACCCGGATCA GAGATAA
 
Protein sequence
MLKSVRLRLL ILALLPLIVL MPLLMVVAMA RWNADYDKIL IEKVESDLRI AEQYLGHLQS 
NSARALQGLA QSAEIADVLT RDPGLTPAFL EQSQDRLGLD FLYLLQGDEL TDAKEKWPVI
FQAAAGRMVS EIDILSASDL AALSPALAAE ARIELIETEA AAPADHTVED RGMVVHTAAP
VRVNGEIRVL VGGILLNRNL DFIDTLNELV YLQGQYSDGR QGTATLFLSD TRVSTNVRLF
EDVRALGTRV SAAVRDAVLG KGRTWLDSAF VVNDWYISGY LPISDSFGKR VGMLYVGFLE
APYETAKRAA YWTVVGAFVL VIALTAPLFL WLAKGIFAPL EQMTRTMARV ERGDLTARNQ
TSARGDEIGQ VANHLNTLLD QVQERDRRLR NWAGELNLRV EERTRELQVA NAKLEETFRQ
LVMSEKLASI GEITAGVAHE INNPVAVIQG NVDVMRMTLG DAAEPVRTEL DLIDNQVMRI
GAIVGKLLQF ARPSEFGTFA EHVDVAAVVR DCLVLVDHVI SKQEVRVLTQ FDAAPHVRID
PGELQQVVIN IIINASQAME GKGTLAISVA GEVRDAQTGV ALRIADTGPG IPMERIDHIF
DPFFTTKQGE GTGLGLSISQ TLIQRAGGRI TVRNRRARGA EFLIWLPEPT DADDPDQR