Gene TM1040_1027 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1027 
Symbol 
ID4078539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1100965 
End bp1103352 
Gene Length2388 bp 
Protein Length795 aa 
Translation table11 
GC content58% 
IMG OID638006331 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_613022 
Protein GI99080868 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.551962 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGCGTT TGAGTAATCT TAAACTCGGT TTGAAATTAC CCCTTTTGAT TGTATTTCCA 
GTTGTCGTGA TTGTGCTCGT TTCAGGCGCG CTGCAGTTGT TCCAGATGAA GCAGGCGGTG
GAACGCGAGC ATGAAGTCTC CTTCCTCGGT GTTGTGGAGG GGCGCAAAGA CGCGCTGGAA
TACTGGCTTG CCGAAGTTGA GGCAGAGGTC GTCGCTCTTT CTGACAGCTA CGCCGTCAGA
TTGGCGACAC AGGAATTCTC GGAGGCCTGG CAGACTTTTG ACGGGGAGGC GTCCTCGGCG
CTGCGCAGTC TCTATATTGA CGATAACCCC AACCCGGTCG GCCAGAAAGA TGTCCTGAAT
ACCGCCGAGG ATGGCTCGGC TTGGTCACAG GTGCATGAAC GCCATCACGC CGGTCTGCGC
GCGCATCTTC GCGCTCATGG CTATTATGAT GTCTTCCTCT TCGATCTGGA CGGGAACCTG
ATCTATTCGG TGTTCAAGGA GGATGATTTT GGCCTGAACT TTGAAACCGG GAAATACAAG
GAAAGCGGTC TAGGCCAAGT CTACCGCAAT GGGCTGACCC TCGCAGAGGG GCAGTTCTTC
ATGAGTTCCA TGGCGCCCTA TGCGCCCAGT GCCGGCGCGC AGGCGATGTT CATGTCGACG
CCCGTGTTTC TCGAGGGAGA GCGCATCGGG GTGCTGGCGG TTCAGTTCCC GCTTGATGGC
ATCATGCATA TTCTGTCGCA ATCCGAGCAA CTTGGCGAAA CCGGCGTGGT CTATCTGGTC
AATGAGCAAG GGCTGTCTCT GACCGCGTCC CCTCGTGAGG GTGGGTTTGC CGCCTTCGAG
GCGTTGCCGG AGCGCGAACA GATCAAGAGC GCCCTCTCGG GCCAGTCCGG CTATTTTGAT
CCCGTGGTGG GGATGCATGG GGATGAGGTC CTCGCGGCCA CGACCAGCGT GACCGTTCCC
AACGGGAGCG AATGGGGGCT GGTGCTGGAA ATCAACCACG ATGAGGCGAT GGCAGTTGTC
AACACGGCGA TACGCAGCGC CGTGATCGAA TTGGCCATTA CCATCGCGTT GGTTGTTGGC
ATCTCGAGCA TGGCGGTGCG CGGCCTGGTA AAACGCCTCG AACGCCTTGC GGGCCATGTG
GAGAGCCTCG CTGACGAAAA CTATGAAGAG GAAATCTCGG GTTGCGATCA GCGCGACGAG
GTGGGCTTTA TTTCCGAGAC ATTGGTTCAC CTGCAGGGGC GCTTGCAGGA AGGGGCTGCG
GCCAAAGAAC GCGAGCGTGT GATCCAGGAG GGCAATGAGA TGGTTGTCCG CACGCTCAGC
AAGGCCTTGA TGAACCTCGC CAAAGGCGAC TTTCGCAACC ATATCCTCGA GTTTTTCCCG
GTCGAACACA AGAAACTGCG CTACAGCATC AATGATGCGA TGACGGAGTT GAGCGATGTG
ATCGAAGAGG TGCGCGAGGT GGCCAGTAGC ATCTCCAAAG GTGCTGCGGA GCTGAGCGAA
TCTGCGGACG ACATGTCTTC GCGCACCGAA AGCCAGGCCG CCACGCTTGA AGAGACCGCC
GCCGCGCTAG AAGAAGTCAC GGCGAGCGTC AAATCTGCCA ATGAACACGT CATGAATGTG
GAGCACACCG TCAGCCAGGC GCGCGGCATG GCCGAGAACA GCGGTGTGAT CGTGACCGAA
ACCATTGAGG CGATGAACGG GATCGAGAGC TCTTCGCAAC AGATCAGCCA GATCATTAGC
GTCATTGACG ATATTTCGTT CCAGACCAAC CTCTTGGCCC TGAACGCAGG GGTTGAGGCC
GCACGCGCAG GTGAGGCGGG ACGCGGCTTT GCGGTTGTTG CCTCCGAGGT GCGCAGCCTC
GCCCAGCGGT CCTCCGAAGC GGCGTTGGAA ATCAAGACCC TGATCGAAAC CAGTGGCGAG
CAGGTCGGGC GCGGGGTGCA GATGGTTGGC AAGACCGGCG ATGCGCTGAC ACAGATCGTG
GATCAGGTGA AGGAAATCGC AGGCCTCATC GAAGAGATCG CCAAATCTTC GCAAGAACAA
TCGACCGCGC TCATCGAAAT CAACGTGGGC ATGTCGCAGC TTGATCAAGT CACTCAGGCG
AACGCGGCTA TGGTCGAAGA GAACACAGCG GCTTCGCATC TCTTGCGCCA GGACTCGCAG
CGTCTTGCAG AATTTGTGAG CCGGTTCCGC ACGCAGAAAG ACGCCAAATC CGAAGAGGCG
CCCAAGGCCT TGCCGGTTCC GGCAGAAGCT GCAAAACCCA CAGCTCACGG GGAAGAGTGG
CAAGACAGCG CCACCGAGGC GACCCCGGTC ACCGTCGACC TGGCTGCCGA CGAGGTGGAC
GACGTGCCCC AACCAAAGCG CGCAAATGAA AGATGGTCGG ATTTCTGA
 
Protein sequence
MLRLSNLKLG LKLPLLIVFP VVVIVLVSGA LQLFQMKQAV EREHEVSFLG VVEGRKDALE 
YWLAEVEAEV VALSDSYAVR LATQEFSEAW QTFDGEASSA LRSLYIDDNP NPVGQKDVLN
TAEDGSAWSQ VHERHHAGLR AHLRAHGYYD VFLFDLDGNL IYSVFKEDDF GLNFETGKYK
ESGLGQVYRN GLTLAEGQFF MSSMAPYAPS AGAQAMFMST PVFLEGERIG VLAVQFPLDG
IMHILSQSEQ LGETGVVYLV NEQGLSLTAS PREGGFAAFE ALPEREQIKS ALSGQSGYFD
PVVGMHGDEV LAATTSVTVP NGSEWGLVLE INHDEAMAVV NTAIRSAVIE LAITIALVVG
ISSMAVRGLV KRLERLAGHV ESLADENYEE EISGCDQRDE VGFISETLVH LQGRLQEGAA
AKERERVIQE GNEMVVRTLS KALMNLAKGD FRNHILEFFP VEHKKLRYSI NDAMTELSDV
IEEVREVASS ISKGAAELSE SADDMSSRTE SQAATLEETA AALEEVTASV KSANEHVMNV
EHTVSQARGM AENSGVIVTE TIEAMNGIES SSQQISQIIS VIDDISFQTN LLALNAGVEA
ARAGEAGRGF AVVASEVRSL AQRSSEAALE IKTLIETSGE QVGRGVQMVG KTGDALTQIV
DQVKEIAGLI EEIAKSSQEQ STALIEINVG MSQLDQVTQA NAAMVEENTA ASHLLRQDSQ
RLAEFVSRFR TQKDAKSEEA PKALPVPAEA AKPTAHGEEW QDSATEATPV TVDLAADEVD
DVPQPKRANE RWSDF