Gene TM1040_2647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2647 
Symbol 
ID4077950 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2781272 
End bp2782768 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content60% 
IMG OID638007971 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_614641 
Protein GI99082487 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.835336 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGACAT CACCATTTCA AAAGGTTTAT GTGATGTCGC AGATCCATCG AGACGAGCTG 
CTTTTCTGGT TCGGACTTGG CGAGGAAGAT CGTTCTTCTA TTGTCGGTAT GGCGCCCCTC
GTGGAGCGCC ACATTGATGC TGTTCTCGAC GATTTCTACG ACCTCTGCCT GTCGCGGCCG
GAAACCAAAG GGTATTTTCC GACCGAACAG ATCGTGGCCC ATGCCAAGGC TGCGCAGCGC
GCCCATTGGG TGAAGCTCTT TTCGGGACGT TTTGACGACA GCTACATGAG CTCTGCCGAC
AAGGTTGGAC GGGTGCATTT TGAGGTTGAT CTGCCGTTTC ACCTGTTTCT GGGCGGCTAT
GCCACGGTGG GCGACCGTAT CCTCAAGGTG GTCCTGACGC AGAAAAACGG CTGGCGCGGC
ACCAAGGCAA GGATGAAACA GGCGCGCGCG CTGCAGCGGC TCATTCTGTT TGATTGCGAG
CGGGTGATCG CAGGGTATAT CGACGCGCAG CTGCAGGAAC GTGAAAAGGC GCTCACGGTC
ATGACCCAGG GGATTCAGCG ATTGGAATCC GGGGATCTGA CCACCGAGAT GCCGTCGGGC
GACAAGGGCG GCTTGCCGGA GCGCTATGAC AATGTGCGTC TGTCGTACAA TCACCTGTTG
CGCCACTGGT CGGGGATCCT GTCGGATGCC ACCCGGCGCG CGCATTCGGT GGATGCCAAG
ATGGCCGAAA CAGCGCGTAT GACCCGCGAG ATGGCGCATC GCTCGGGCGA ACAGGCCAGC
ACCCTGAACG AGACTGTGAC GTCTGTGAAC AGCATGACCT GCAGCACACG ACTAACCAGC
GAGAAGGTCA ACGATGCGGC CAAGCAGATC GAGGCCAACC GACTTGTGGC CGAGGAGGGC
GGCGCGGTTG TGCGCGATGC GATTACGGCG GTCGAACGGA TCGAACAATC CTCCAGCCAG
ATCAACAAGA TCGTTGATGT GATTGATGAA ATCAGCTTTC AGACCACATT GCTTGCGCTC
AATGCCGGGG TGGAGGCCGC GCGCGCGGGC GAAGCTGGCA AAGGCTTTGC CGTCGTGGCA
TCGGAGGTGC GCGCGCTTGC AAGCCGGGCC ACGGATGCGG CGGATGATAT CAAGAAACTG
ATTGCCACCA GCTCTGATCA GGTCCTCGAG GGGTGTCAAT TGGTGCATCA GACCGGGGAG
CGGCTCGACA GCATCCGATC AAGCGCGGAC AACGTGGCGC GGATCATGCA AGAGGTCGAC
GGTGTGATCT TTGAGCAGAC CGGCCAGCTT GAAAACATCA GCGAACGGAT GGAGCGGCTG
GATCGATACA CGCAGGACAG CGCCGATCAG GCCAATGAGG TCTCCAGCAC GGCGGATGCG
CTGGCGCAGG ACTCGGCGGC GCTGAAGGCC AGCATGGACG GGTTCAAAAC CGGCGATATT
GCGTTGCGAG AGGCGGCGCA GGATAAGGAG TTCGAACGTC TCGCCAATCG CGGCTAG
 
Protein sequence
MVTSPFQKVY VMSQIHRDEL LFWFGLGEED RSSIVGMAPL VERHIDAVLD DFYDLCLSRP 
ETKGYFPTEQ IVAHAKAAQR AHWVKLFSGR FDDSYMSSAD KVGRVHFEVD LPFHLFLGGY
ATVGDRILKV VLTQKNGWRG TKARMKQARA LQRLILFDCE RVIAGYIDAQ LQEREKALTV
MTQGIQRLES GDLTTEMPSG DKGGLPERYD NVRLSYNHLL RHWSGILSDA TRRAHSVDAK
MAETARMTRE MAHRSGEQAS TLNETVTSVN SMTCSTRLTS EKVNDAAKQI EANRLVAEEG
GAVVRDAITA VERIEQSSSQ INKIVDVIDE ISFQTTLLAL NAGVEAARAG EAGKGFAVVA
SEVRALASRA TDAADDIKKL IATSSDQVLE GCQLVHQTGE RLDSIRSSAD NVARIMQEVD
GVIFEQTGQL ENISERMERL DRYTQDSADQ ANEVSSTADA LAQDSAALKA SMDGFKTGDI
ALREAAQDKE FERLANRG