Gene TM1040_3194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3194 
Symbol 
ID4075298 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp187476 
End bp189320 
Gene Length1845 bp 
Protein Length614 aa 
Translation table11 
GC content59% 
IMG OID638004703 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_611430 
Protein GI99078172 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.311641 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACCA CTGAAATCAA ATCTTCCGCC CTTAAGGGAT GGTCCATTTT CACCAAGCTC 
AGCTTGTTGT TGGCAGCGGC AACGCTGGTG ATCGTAACCT TCATCACCGT CGCAAACAAA
TTGATCATTG ACCAAACCGT CCGGGAAGGT GTGCGTACCC TTGGTCAGAA TGTCACCTAT
TCAGTCGCCT CGCGCAGCGG CGGGGCGATC CGTTTTGGCG ATCAGGACAA GCTGCTCGCA
GATTTGTCTT TGGTCATCGA AATGTCCGAA GGGCGCTCGC TGAACGGGAT TGCGGTCAAC
GGGGACGGCA ACCTGATTGC CTCCGCTGGC GACGCAGGTG AAGCGCAACT TGACCTGCTC
TCCGAACTCG CTCTTGCGGC ATTTGAGAGC AACGAGATGC AAGTGAGCGC CAATGGTGAA
GCCTTTGCCG CTCCCGCCAC CACCTCCAAA GGTGTCCCCG TTGGGGCGAT TGCCATGACT
TGGTCAAGCA AAGCAGCCAT GATCGGCGCT CTTCAGCAAC AAATGATTGC CTATGGCGCC
GGTGCTGTGC TGTTTTTTGC GATGATGTTC ATTGCCTCCA TGGTCCTGCG CCGGATCGTC
AGCGCTCCTT TGAAGGATCT CGGCACCAGC ATCGTCGACA TTGCCGCAGG GCAGTATGAC
AAGCCTGTCG AGTATCTGGA ACGAGCGGAT GAGGTGGGGC GCATCGCCCG AAATGTTGAA
AACCTCAAGC AGCAGCTGGC CCTGGCACAC GCCAAGGAAA CAGAACGCGA AACCGCACAA
GCCCATCAGA AACACGTCGT CGATGCCCTG AACAACGCGC TGAAAAGCAT GTCTGACGGC
GACCTTACCC AGGCGATTGA TGATCCTTTT GCCGAGGAAT ACGAAACCCT GCGCCGCAAC
TTCAATACGA CGCGAGCCAC CATGGTCAGC ATCATCGACT CCGTCATAGA GAGCAGCGAG
CGCATTCGCT CAAGTGCGGA ACAGATCAGC GTCTCGTCTG CTGACTTGTC TCAGCGCACC
GAAAGCCAGG CTGCCACGCT TGAGGAAACC GCGGCCGCGA TGGAAGAGCT CAACGGCAGC
GTGCGCTCCG CAGCCGGTGG CGCGCGTGAA GTCGAAGGCA TCATGGAAGA AACCCGTGCC
ACTGCCGAAC AGAGCAGCAA GGTGGTTTCA GATGCAGTGG AAGCCATGTC CAACATTGAG
GCCTCCTCGG TCAAGATCTC CAAGATTCTG ACCATGATCG ACGACATTGC ATTCCAGACG
AACCTCCTCG CGCTCAACGC AGGCGTCGAA GCCGCGCGCG CCGGCGAAGC AGGCCGTGGC
TTTGCGGTTG TGGCCTCTGA GGTCCGCGCC CTCGCGCAAC GGTCATCGGA TGCCGCGCAA
GAGATTAAGC ACCTGATCGT CGAGAGCACG GAACAGGTTG GTGAAGGGGT GCGCCTTGTA
GGCCGTACGG GTGAGGAGCT CGACAAGATC ATCGGTCGTG TCGGGACGAT CTCCGGTCAC
GTCAGCGGCA TCGCAACCGG GGCAGAGGAA CAGTCGACCA CTTTGAACGA GATCAACACC
GGCGTTTCGC AACTGGATCA GGTGACGCAA CACAACGCGG CCATGGTTGA GGAAACGACC
GCTGCGAGCC AGGTGTTGCG CAATGATGCC ACGCAACTCG CGCGTGTCGT GGCGATCTTC
AAGACCGGTG ACACCAAGCG CAAATCCCAA TCAGAACCTG AAGCCCCCAC TGTGGCACAG
GCAGATGTGT CGCATGCCTA CGCGGCAGAA GAATCCCCCC AAGAGGCGCC CGCGGCAGAC
GCGCAGATTC TGCAGAAGAA AGCGGTGGGG TGGGAGGACT TCTAA
 
Protein sequence
MATTEIKSSA LKGWSIFTKL SLLLAAATLV IVTFITVANK LIIDQTVREG VRTLGQNVTY 
SVASRSGGAI RFGDQDKLLA DLSLVIEMSE GRSLNGIAVN GDGNLIASAG DAGEAQLDLL
SELALAAFES NEMQVSANGE AFAAPATTSK GVPVGAIAMT WSSKAAMIGA LQQQMIAYGA
GAVLFFAMMF IASMVLRRIV SAPLKDLGTS IVDIAAGQYD KPVEYLERAD EVGRIARNVE
NLKQQLALAH AKETERETAQ AHQKHVVDAL NNALKSMSDG DLTQAIDDPF AEEYETLRRN
FNTTRATMVS IIDSVIESSE RIRSSAEQIS VSSADLSQRT ESQAATLEET AAAMEELNGS
VRSAAGGARE VEGIMEETRA TAEQSSKVVS DAVEAMSNIE ASSVKISKIL TMIDDIAFQT
NLLALNAGVE AARAGEAGRG FAVVASEVRA LAQRSSDAAQ EIKHLIVEST EQVGEGVRLV
GRTGEELDKI IGRVGTISGH VSGIATGAEE QSTTLNEINT GVSQLDQVTQ HNAAMVEETT
AASQVLRNDA TQLARVVAIF KTGDTKRKSQ SEPEAPTVAQ ADVSHAYAAE ESPQEAPAAD
AQILQKKAVG WEDF