Gene TM1040_3209 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3209 
Symbol 
ID4075313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp203191 
End bp205539 
Gene Length2349 bp 
Protein Length782 aa 
Translation table11 
GC content58% 
IMG OID638004718 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_611445 
Protein GI99078187 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.684679 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGCAC TCTCTCAGCT AAAGATACGC TACAAATTAC CCGCGTTTCT TGTGGGTTTC 
GCGCTGCTGG CCAGCGCGAT TCTGGTGACG GTGAGCACCG TGAACTATCA ACGCAACGCA
TGGATAACAG TAGAGAATCG CTTCGAATCG ATCGCTGCGG ATCGCGCTTC AGTTCTCGAG
GCGCTTTTCC AGACCATGCG CGCGGATGTC GAAGTGCTTG CCGAGCTGCC CTCCACCGCA
ACCGCGGCAC AGCGGATCTC TGCGGCGTGG AGCGGGGTGA GCGATTCTCC GGCTGAGACG
CTGCGCCAGA TGTACATTTC TGAAAACCCG CACCCCCCAC ATGAACGCTT CATGATGGAA
CGCGCGCAAA AGACGATTCC CTACAACATC CACCACGCGA ACTTTCACCC CTCCTTCCGG
TCGCTGCTGA TTTCCAAGGG ATACAGCGAC GCCTATCTGG TGAATATCGC GGGGGACGTC
ATCTATAGCG TCGCCAAGCA GGACGACTAT GGCAGCAACC TCTTGTCCGG CCCGCATCAG
ACCAGCAATC TCGCCGCGAC ATTTAAGCGG ATCATGACCG CGGCACCCGA AGAGGTCCTC
TTTTCGGATT TTGAGCACTT TGCGCCGCGA GATGACACGC CGATGGCGTT TGTCGGCACC
CAGATCGTCG CCAGCAGTGG TCAGCTGGTT GGTGTCTTTA TCCTACAGAT CCCTGACACG
CTCGTGAGCG AGATTATCTC GCCCGCAGAG GGCTTGGGAG TGAGTACCGA GGTGTTTGTT
GTCGGAACAG ACGGCAAGGC ACGGTCGCAA TCCAGATTCG ACGACGGACA CAACGTTCTG
GAGGATCTGG CATTCTCTCT GCAAGAGCGG GCGGCGACCG AACCACAGAT CTTTGACAGC
AATGCCACAG GCATTCGCGG CAAACCCGTC GTTGCCATGG CACGTCAACT GCCGATTGAA
GAAAAAGTGT GGATTCTCGC CGTCGAGCAG GACCGCGACG AGGTTCTGGC ACCGGTTCGT
GGAGACCGGA TGCTCTTGAT CCTGACATCG CTTGCAAGCG CTCTTGTTAT GACAGGGATT
GGCTGGTGGT TTGCCCGCAG TTTCCTCAAG CCTATCGACG GACTTTGCGC GCGCATGAGT
GAGATCAGCG AAGGCCAACT TGATGCTCCT ATTCCCGAAG CGGATCGCGC AGATGAATTT
GGCCACATGG GGCAAATTCT GCGCACCATG CAGGGCGATC TACAGCGCGC CAAAGATGCC
GATGCCCACA GGCAACACCT GCAAGAGCAA CAAGCTGAAG TCGTGCGTCA CATCAGTGAG
GGGCTTGTGC AGGTTGCAAA CGGAAATCTC GCACATCGCA TCACCGAACC TTTTGATGCG
GACTACGAAA AACTCAGATC AGACTTCAAC TCCGCGCTAA CGGAATTGAG CGACGTGGTC
AGCCAGGTCA CCGAGACGGC GGATGGAATT CGCTCCGGCG CTGACGAAAT CAGTCAAGCA
TCAGATGACC TGTCGTCGCG CACGGAATCC CAGGCCGCGA CCCTTGAAGA AACGGTTGCT
GCTCTGGATG AGCTGACGGC AAGCGTGAAG TCAGCCGCTG AAGGCGCGCG AAACGTCGAA
GACATCGTGA AGCAGGCGCG CGGCGAGGCT GAAAAGAGCG ACGAGGTCGT GCGCAATGCC
GTGGATGCGA TGACGAAGAT CGAGACATCT TCCGCCAAGA TCTCCCAGAT CATTTCGGTG
ATCGACGATA TTTCCTTCCA GACGAACTTG CTTGCCCTGA ACGCCGGTGT TGAAGCTGCA
CGGGCCGGAG AAGCAGGTCG CGGGTTTGCA GTGGTCGCGT CAGAAGTGCG CGGTCTTGCG
CAGAGATCCT CTCAGGCCGC ACTCGAAATC AAGAACCTGA TTTCAGAGTC GACACAGCAA
GTGGGCGAAG GCGTCGAACT CGTTGACGCG GCGGGAGACT CCCTGAGATC CATCGCGGAG
CGGGTGTCGC ATATCTCGAG CTTGGTTTCG GAAATCGCCC AAGGCGCGAC AGATCAATCC
GCCGGATTGA GCGAAATCAA CGACGGTATG ACCCAGCTTG ACCAAGTGAC GCAGAAAAAT
GCCGCCATGG TCGAAGAGTC AACAGCCGCC AGCCACCTCT TGAAATCAGA TGCCAACAAA
CTGGCCGAGC TGGTTTCCCA CTTTGAGACG GGCCAAGCCT CTGCGCCAAA GCGCGCGCAA
CCGGCCGATG ACCGCAGGCC CGAAACGCCG CGAGCCTCGG CTCATGGCGA AGACATCGCC
TTTGATCCTC CACCGCCCAT TGCCACATCG ACAGGGTCCG CTGCGCGAGA CCTTTGGCAA
GACTTTTGA
 
Protein sequence
MIALSQLKIR YKLPAFLVGF ALLASAILVT VSTVNYQRNA WITVENRFES IAADRASVLE 
ALFQTMRADV EVLAELPSTA TAAQRISAAW SGVSDSPAET LRQMYISENP HPPHERFMME
RAQKTIPYNI HHANFHPSFR SLLISKGYSD AYLVNIAGDV IYSVAKQDDY GSNLLSGPHQ
TSNLAATFKR IMTAAPEEVL FSDFEHFAPR DDTPMAFVGT QIVASSGQLV GVFILQIPDT
LVSEIISPAE GLGVSTEVFV VGTDGKARSQ SRFDDGHNVL EDLAFSLQER AATEPQIFDS
NATGIRGKPV VAMARQLPIE EKVWILAVEQ DRDEVLAPVR GDRMLLILTS LASALVMTGI
GWWFARSFLK PIDGLCARMS EISEGQLDAP IPEADRADEF GHMGQILRTM QGDLQRAKDA
DAHRQHLQEQ QAEVVRHISE GLVQVANGNL AHRITEPFDA DYEKLRSDFN SALTELSDVV
SQVTETADGI RSGADEISQA SDDLSSRTES QAATLEETVA ALDELTASVK SAAEGARNVE
DIVKQARGEA EKSDEVVRNA VDAMTKIETS SAKISQIISV IDDISFQTNL LALNAGVEAA
RAGEAGRGFA VVASEVRGLA QRSSQAALEI KNLISESTQQ VGEGVELVDA AGDSLRSIAE
RVSHISSLVS EIAQGATDQS AGLSEINDGM TQLDQVTQKN AAMVEESTAA SHLLKSDANK
LAELVSHFET GQASAPKRAQ PADDRRPETP RASAHGEDIA FDPPPPIATS TGSAARDLWQ
DF