Gene TM1040_1083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1083 
Symbol 
ID4076316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1157674 
End bp1159446 
Gene Length1773 bp 
Protein Length590 aa 
Translation table11 
GC content62% 
IMG OID638006387 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_613078 
Protein GI99080924 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAATC TCTCGCTGCG CAAACAAATC TTTGGTTTTG CGGGAATCTT TATCGCGATG 
ATCCTGATCA TCGCCGCGAT TTCGTGGTTT GCAAACCAAC GCCTTGCCGG GGCCACCTAT
CACTATCGCG CGGTCAGCAC CCAAAGCAAA AGCTTCGACG CCATCAAGGA AGACATCGAA
CAAGGTATCG GCGATCTCTT GTCCTATACC GTCGGCATGC CCGAGGGGCT GAGCGACCTG
CGCGCCAATA TCGAAGAGAT CCGTAGCGAG CTTGCCGTCG CAGAGGATAA TTTCAAATCC
ACCCCCATCA TCGCCGCACG GGACATGCAG GCCTACGACG CGCTGGTCTC GACCGAGCCG
CTTCTGGATC AGCTTGAGGC AACATTGCAG GAGGTAGAGC GCACCGAAGG CGAGGCGCAG
CTGCGCGTGG TCTTTGACAA GGTGTTCCCG CTCGCCGGAC AGGTGCGCGA TGTGGTGGAT
GCGCTTCAGG ACAAACTTGC CGCCACCAGC AAATCCGTCC GCGCAGAGGT CGACTCCCTC
ATTTTGTTCT GCCAGATCAT ACAGATCGCC ACCAGCGTCG CAGCGGCGCT TGTCGCGGTG
ACCGTGGCCT TTGTCTTTGG GCGCAAATTG AGCCAACCAG TGTCTGACGC AGCGCAGAGC
ATCGCGGCGC TGGCAAAGAA GGACTACGTG GCAGAGATTT CCGGCACTCA GCGCGGCGAC
GAACTCGGTC AGATTGCCCG CAATCTCAAG GATTTGCGCA CTCAGCTCGC CGAGGCTGAC
GCGCATGACC GCCAGAACGC CGCCGAAAAC GCGCGCCGGG TCGAGCTATT TGGCGTCCTC
GGTGCCTCCA TGAGCGGTCT CAAGAGCGGC GATCTCGACC AGAACATCGT GGCGCAGGAC
TGGGAAGACC TCGGCCCCGG TTACGCCACG CTCTGCGAGG ATTTCAACGC GCTCTCCTCC
TCGCTTTCGG ATCTGGTGGC CCAGCTCAAT CAAAGCTCCA CCGTCGTGGA ACAAAACGCG
CGCGAAATGG AACGGATGTC GGATCAGATG TCGCAGCGCT CCGAGACCCA GGCCGCCACG
CTGGAAGAAA GCGCTGCCGC GCTGGAAGAA ATGTCGACCG CCGTGCAATC CAGCGCCGCG
CAGGCCAAGG CCGCCGACCG CGAAGTTGAG GAAGGCCGCC GCCGCGCCGA ACAGGGCGGC
GAGGTGATGG CGCAGGCGAG CCGCGCCATG GCCTCGATTG CGGAATATTC CAACCGCATC
TCCCAGATCA TCACCGCGAT CGACGATATC GCCTTTCAGA CCAGTCTGTT GGCGCTCAAC
GCGGGCGTCG AGGCCGCGCG GGCCGGGGAA GCCGGCCGTG GCTTTGCGGT GGTGGCCTCC
GAAGTGCGCG GACTGGCCAT GAAAGCGGCC CATTCTGCAA GTGAAATCAA GCAGTTGGTT
CAGGAAAGCT CCAGCCAGGT CGAAGAGGGA GAGCAGCTGG TACAGGCCAC CGCCGAAACC
CTGACCCAGA TCGTCGAGAG CGTCACCAAT GTCTCTGGCA TGGTCTCCGC CATCGCCAGC
TCCTCCAGCG AGCAGGCTGC CGGCATCCAG GAAATCAACA TCGGCGTGGC GCAGCTCGAC
AAGGCCACGC AGGAAAACGC CGCCATGGTG CAGGAAACCT ATTCCGCCAG CCATGAGATG
CGCACCCAGG CCTCCCGCCT CACCAACCTG CTAGAAGGCT TTACCGGTGG GCAGGCAAGC
TCTAGCACCG CAGCTCCCGC CCGCGCGGCC TGA
 
Protein sequence
MTNLSLRKQI FGFAGIFIAM ILIIAAISWF ANQRLAGATY HYRAVSTQSK SFDAIKEDIE 
QGIGDLLSYT VGMPEGLSDL RANIEEIRSE LAVAEDNFKS TPIIAARDMQ AYDALVSTEP
LLDQLEATLQ EVERTEGEAQ LRVVFDKVFP LAGQVRDVVD ALQDKLAATS KSVRAEVDSL
ILFCQIIQIA TSVAAALVAV TVAFVFGRKL SQPVSDAAQS IAALAKKDYV AEISGTQRGD
ELGQIARNLK DLRTQLAEAD AHDRQNAAEN ARRVELFGVL GASMSGLKSG DLDQNIVAQD
WEDLGPGYAT LCEDFNALSS SLSDLVAQLN QSSTVVEQNA REMERMSDQM SQRSETQAAT
LEESAAALEE MSTAVQSSAA QAKAADREVE EGRRRAEQGG EVMAQASRAM ASIAEYSNRI
SQIITAIDDI AFQTSLLALN AGVEAARAGE AGRGFAVVAS EVRGLAMKAA HSASEIKQLV
QESSSQVEEG EQLVQATAET LTQIVESVTN VSGMVSAIAS SSSEQAAGIQ EINIGVAQLD
KATQENAAMV QETYSASHEM RTQASRLTNL LEGFTGGQAS SSTAAPARAA