Gene TM1040_0984 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0984 
Symbol 
ID4078146 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1052015 
End bp1053841 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content60% 
IMG OID638006288 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_612979 
Protein GI99080825 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0666136 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.983928 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCTTAG CATCTTGGTT CCGGAACATG CCCGTGAAGC GTAAGATCCT GCTTCCTGGG 
ATGGCTGGAT TGCTCATGAT GATCTGCGTG ATCACGACTT ACTGGACTCA GCGCCTGTCG
ACGGCGCTCT ATAGCGGCTT TGAAGAGCAG GTCGCGCTGA CTGAGTCCTA CATCGCGGCG
CCCCTCGCGA CGGCCGCATG GAACTATGAT GGCGACTTGG CCAATACGAC ACTGGCTTCG
CTTGCCGAGC GCGAGTCCTT TGTGTTTGCG CGTGTGGTCA GCAGCGGCGA TGTGCTTGCC
GAAGCCTTCA AGGGAGAGGC GATGGAAGAG GCGTGGATCG CGCAATCTAC CAGTCTGCTT
GAATCCGATC AGGTGCGGCT GGAGGATGGC GATTTCACCT ACTTCAAGAC GCCCTTGATG
TTCGAAGGTG AAGAAGCCGG CAATATGGTC TGGGCGCTCG ACACCTCGAT CATTGCCAAC
CAGATCATGA ATGCCAACAT TATCGCGGCC AGCCTTGGTT TTGCGATTTT CGCAGGCTTT
TCGGTCGTCT TCTACCTCAT CGCAGTTGCG GTATCGCGCC CGATCGAGAA TGTGGTGACG
CATATTGACG CGCTGCAGCA CGGTGACACG ACGCGTGAAA TCCCCGAGGC GAACCGCCGT
GATGAAATTG GCGCATTGGG CAAGGCGCTG GTCGATTTCC GCGACACATC CGCCGAGCAG
AAGCGCATGG AAGAAGAAAA ACGCAAGCAG GACGCGGTGC AGGAACATGT CGTGACCGTT
CTCTCCGAGG CGCTTGGCAA GCTGTCCACC GGCGACCTCA CGGTGAGCAT CAAGGACGAT
TTCCCGGCGG ATTACGAGAA GCTCAGCAAG GATTTCAACG CTCTGGTCAA TCGCCTGTTC
GATACGGTTT CGGCTGTGGT CGATGCGGCG GACAGCATCC AGAACGGTTC CACCGAGATC
AGCTCGGCCT CTGACGATCT TGCGCGCCGC ACCGAGAGCC AGGCGGCCAC CCTTGAAGAA
ACCGCTGCTG CACTGGATGA GCTGACTGCA TCGGTGCGTC AGGCCGCCGA AGGGGCTGGA
AGCGTGTCCA ACACCATGGA AGAGGCAAAA GCCGAAGCCG TGAACAGCGG CACCATCGTC
AACAATGCCG TTTCGGCCAT GACCGAGATC GAGCAATCCT CGAATCACAT CTCTCAGATC
ATCGGCGTGA TTGATGACAT TGCCTTCCAG ACCAACCTTC TGGCGCTGAA CGCGGGTGTC
GAAGCCGCGC GTGCAGGTGA AGCAGGGCGC GGGTTTGCCG TGGTGGCCTC CGAGGTGCGC
GCCTTGGCGC AACGCTCCTC CGATGCAGCC ATGGAGATCA AGACCCTCAT CGGCGACAGC
TCCAAACAGG TGGAACGTGG TGTGGATCTC GTCGGCAAGG CAGGCGATGC GCTTCACAAC
ATCGTTGAGC GTGTCACCCA GATCTCCGGC CTGATCTCTG ACATTGCACA AGGCGCGAGC
GAGCAATCCG CAGGTCTTGG CGATATCAAC AGCGGCATGG TGGAACTGGA TCAGGTGACC
CAGCAGAACG CCGCCATGGT GGAAGAGGCG ACTGCCGCGA GCCATATGCT CAAGGCCAAT
GCGGTCAACC TCGCGCAGAT GGTTGCTCAT TTCCAGCTCG GCGCCGGTGG GCGCGCGGCT
TCTGCTGCTC CAGCCCCTGC TGCAAAGGAT GCCGAGACGA TCGCGCCGTC GGCCCATGGC
GAGGATTGGG ACTATACGCC CGAACCAAGT CAGGTGGCCG TCGCCAGCAG CGGCAACGCG
GCGGCGAAGA TCTGGGAAGA CTTCTGA
 
Protein sequence
MFLASWFRNM PVKRKILLPG MAGLLMMICV ITTYWTQRLS TALYSGFEEQ VALTESYIAA 
PLATAAWNYD GDLANTTLAS LAERESFVFA RVVSSGDVLA EAFKGEAMEE AWIAQSTSLL
ESDQVRLEDG DFTYFKTPLM FEGEEAGNMV WALDTSIIAN QIMNANIIAA SLGFAIFAGF
SVVFYLIAVA VSRPIENVVT HIDALQHGDT TREIPEANRR DEIGALGKAL VDFRDTSAEQ
KRMEEEKRKQ DAVQEHVVTV LSEALGKLST GDLTVSIKDD FPADYEKLSK DFNALVNRLF
DTVSAVVDAA DSIQNGSTEI SSASDDLARR TESQAATLEE TAAALDELTA SVRQAAEGAG
SVSNTMEEAK AEAVNSGTIV NNAVSAMTEI EQSSNHISQI IGVIDDIAFQ TNLLALNAGV
EAARAGEAGR GFAVVASEVR ALAQRSSDAA MEIKTLIGDS SKQVERGVDL VGKAGDALHN
IVERVTQISG LISDIAQGAS EQSAGLGDIN SGMVELDQVT QQNAAMVEEA TAASHMLKAN
AVNLAQMVAH FQLGAGGRAA SAAPAPAAKD AETIAPSAHG EDWDYTPEPS QVAVASSGNA
AAKIWEDF