Gene TM1040_2002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2002 
Symbol 
ID4077459 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2106428 
End bp2108020 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content62% 
IMG OID638007317 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_613996 
Protein GI99081842 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAAGC TAAGAGAGCC GCACGCGGAC ACCTCAGCCA TTGATAGCCC TGCTCGCCCC 
ATTGCAAAGG CGGCGAACAA GCGACGAATT CAGCGGGTCA CCAGTTGGCT GCTGGTCCCC
CTGCCTGCCG CGGCCGCGTA TTTTGTCAAC GGGCTCCCCC TCTGGTGGGC CTTTGCCGCG
CTGAGCCTTG TGCTCGGTGC CATGGCCTGG GTCTCCCGCA AACTGCCGGA GTCGACGCGC
GATTACCTGC TGAGTTTCTG CTTCATCGCC CACTGTATCT TGCTCACCGC CTCGCTCAGC
GGTCACGCGT GGCAGCTTGA TACGCATATG ATGTTCTTTG CCGCCCTGGC CATCGTATCG
ACCCTTTCAA GTCCGCGCGC GCTGATCTTT GCAACGGTTC TGATCGCGCT GCACCACATT
TCATTCAGCG TCCTGATGCC CTCTCTCGTC TATCCGGGTG GCGGCATTGC CGAGAACCTG
CAACGCACTG TGATGCATGC GGTGATCGTG CTTCTGGAAG CCGGTGTTCT GCTGCTCAGC
ATGCTCAGGA GCCTTGCCGC CGACACGGAG CTGAAAACCC AGCAGAGCGC GGCCGAACAT
CAGGCGCAAG CGGCAGAACG CGCCGAAGCC CTCGCGATGC AAAGCCACAA GAACGCCGAG
CGCGTCGTCA GCATCGTCGG CGATCATCTG CGTGAACTCG CCGCCGGGCG GCTGGATTGC
AAGATCGACA CCTCTTTCCC ACAGGAATAT GCGCAGCTTC AGGAGAGCTT CAATTCCACC
GTCGATACGC TGAAAGGCAC CATCGAACAG GTCAAAGACG CCACCTATCG CCTCAGCAAA
GGCGCGACGG ACATCGATCA GGCCTCTGAA AATCTCTCCA ACCGCACCGA AAGCCAGGCC
GCGACGCTGG AGCAATCCGT GGCCGCCCTC GAAGAACTGA CCACTTCCGT CAAATCTTCG
GCCGAAGGCG CGCTCAGCGT TCAGCGCACG ATGGATGATG CCCGCTCCGA AGCCGTAAGC
AGCGGTGGCG TCGTGAAGGA TGCGGTCTCC GCCATGAGCG CGATCGAGGA CTCCTCCTCC
CAGATTGCGC GCAACATCAG CGTGATCGAC GACATCGCCT TCCAGACCAA CCTGCTGGCG
CTCAATGCCG GGGTCGAGGC CGCCCGTGCG GGCGAGGCCG GCAAAGGCTT TGCCGTGGTC
GCCGCCGAGG TGCAGGCGCT TGCGCAGCGG TCGGCTGATG CCGCAACCGA GATCAAGAGC
CTGATCTCGC AGAGTTCCCA GCACGTTGAT CACGGGGTTG ATCTGGTGGG TCAGGCTGGC
GAGGCGATCG AGAAGATCGT TGAACGCGTC GAGCAGATCT CTGAACTTGT GTCGGGCATC
GCCACCAGCG CGGCAGAGCA GTCTTCCGGG CTGGGCGAGA TCAACACCGG GATGAGCCAG
CTGGATCAGG TGACCCAGCA GAATGCGGCC ATGGTGGAAC AGGTCAGCGC AGCCAGTCAT
CTGCTGCATT CGGACTCAAA GCGGCTTGCA CAGTTGATGG CGCATTTCGA GACGGGCGCC
GCCGACCAGA CCCAAAGCGC CGCTGCGGCC TGA
 
Protein sequence
MQKLREPHAD TSAIDSPARP IAKAANKRRI QRVTSWLLVP LPAAAAYFVN GLPLWWAFAA 
LSLVLGAMAW VSRKLPESTR DYLLSFCFIA HCILLTASLS GHAWQLDTHM MFFAALAIVS
TLSSPRALIF ATVLIALHHI SFSVLMPSLV YPGGGIAENL QRTVMHAVIV LLEAGVLLLS
MLRSLAADTE LKTQQSAAEH QAQAAERAEA LAMQSHKNAE RVVSIVGDHL RELAAGRLDC
KIDTSFPQEY AQLQESFNST VDTLKGTIEQ VKDATYRLSK GATDIDQASE NLSNRTESQA
ATLEQSVAAL EELTTSVKSS AEGALSVQRT MDDARSEAVS SGGVVKDAVS AMSAIEDSSS
QIARNISVID DIAFQTNLLA LNAGVEAARA GEAGKGFAVV AAEVQALAQR SADAATEIKS
LISQSSQHVD HGVDLVGQAG EAIEKIVERV EQISELVSGI ATSAAEQSSG LGEINTGMSQ
LDQVTQQNAA MVEQVSAASH LLHSDSKRLA QLMAHFETGA ADQTQSAAAA