Gene TM1040_1487 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1487 
Symbol 
ID4077043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1588830 
End bp1590218 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content57% 
IMG OID638006800 
Productmethyl-accepting chemotaxis sensory transducer with Pas/Pac sensor 
Protein accessionYP_613482 
Protein GI99081328 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTTTTA AGAAGTCCAA AACCGCTGCG TCCTCCAGCC TGAATGACGA TCAGAAGCTG 
CGCGCGATGA TCGAAAACAC CCAGGCCGTC ATCGAGTTCA AGGTCGATGG GACCATCATT
CGCGCAAACA AGGTCTTTTT GGACACGCTG GGCTACACGC TTGAGGAAAT CGAGGGCCAA
CACCACAGCA TGTTTGTCTA TCCGAGTTTT GTGAGAAAAG ACGCCTACAA GCAGATGTGG
CGCGATCTGG CTGCGGGCAA TCCGCTGACG GATCAGTTTC CGAGGCTGCG CAAGGACGGA
GAAGTTGTTT GGATTCAAGC AACCTACGCG CCGTTCTTCA ATGAAGATGG GACCGTCGAC
CGGATCGTGA AAATCGCGTC CAACATCACC AAGCGCCGAA TGGAAATTCT TGGAATTGCC
GCCGCGCTGG AACAACTGAG AAACGGTGAT TTGACCTATC GATGCGCCCC CTCCGACATC
GATGACATCG GGCGTCTCTC CACAGCCTAT AACGAGGCTG TCAGCTCTCT GCAAAACACC
ATTCGCACGG TGCGCGAAGT GGCAGATGGC GTCACACGCA CGGCGGGTCA GTTGAATGAC
TCCTCCTCTG AACTGTCGCA ACGCACGGAA AATCAGGCGG CGACACTGGA GGAAACAGCC
GCCGCAGTCG AGGAGTTGAC AGCAACCGTC CGCTCTTCGG CGGATGGCGC ACGCGAAGTC
GAGGACAATG TTCGCCAGGC CCGCGTGACC GCTGAGAAAA GCGGCCAGGT CGTCAGTGAC
GCTGTAAAAG CAATGTCCAA GATCGAAGAA TCCTCGCAAA GCATTTCAAA GATCATCTCC
GTGATCGACG ACATCGCCTT TCAGACCAAC CTGCTGGCCT TGAACGCTGG CGTCGAAGCT
GCGCGCGCAG GCGAAGCGGG GCGCGGATTT GCCGTGGTTG CCTCCGAGGT CCGAAATCTA
GCGCTTCGCT CTGCGGATGC CGCTGGTGAA ATCAAAGGCT TGATCGATCA AAGCGCCACC
CATGTCTCTG ACGGCGTGAC CCTGGTGGGC CGCACCGGCA CAGAACTCGA GGCGATCATC
AGCAGCGTGA ACTCGATTTC TGAGAACGTC AGCGTCATCG CACGCGGAGC CGAGGAACAA
GCCGTAACGC TCAGCGAAAT CAACAGCGGC ATCGGTCATC TTGATGAGGT GACACAAAAC
AACGTCGCTA TGGTGGAAGA GACCACTGGC GCAAGCCAGA CCCTCGCCAG CGATGCCTCG
GTGATGTCGA AAGAGATCGC GAATTTTGTA TCCGAGGGCG AAATCAAGAT CGCAAAACAT
GTCGCGCCGA TCCCGAACAC GCCGCCGCCA ACCGTGGAAC CTGAACTGGA ACTGCGCACC
GGCACCTAG
 
Protein sequence
MLFKKSKTAA SSSLNDDQKL RAMIENTQAV IEFKVDGTII RANKVFLDTL GYTLEEIEGQ 
HHSMFVYPSF VRKDAYKQMW RDLAAGNPLT DQFPRLRKDG EVVWIQATYA PFFNEDGTVD
RIVKIASNIT KRRMEILGIA AALEQLRNGD LTYRCAPSDI DDIGRLSTAY NEAVSSLQNT
IRTVREVADG VTRTAGQLND SSSELSQRTE NQAATLEETA AAVEELTATV RSSADGAREV
EDNVRQARVT AEKSGQVVSD AVKAMSKIEE SSQSISKIIS VIDDIAFQTN LLALNAGVEA
ARAGEAGRGF AVVASEVRNL ALRSADAAGE IKGLIDQSAT HVSDGVTLVG RTGTELEAII
SSVNSISENV SVIARGAEEQ AVTLSEINSG IGHLDEVTQN NVAMVEETTG ASQTLASDAS
VMSKEIANFV SEGEIKIAKH VAPIPNTPPP TVEPELELRT GT