Gene Rsph17029_1104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1104 
Symbol 
ID4895119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1138487 
End bp1140865 
Gene Length2379 bp 
Protein Length792 aa 
Translation table11 
GC content64% 
IMG OID640111690 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_001042986 
Protein GI126461872 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCGCA CCGACCCTTC GCGCTTATGG CGCTTCAAGC AGCCTCTGGT GCTTCTGGGT 
ATGCCCTTGC CCATGTTGCT CGGCATCGCA ACCGCGCTCT GGCTCAACCA CGAAGGGGCG
CTACGCGCGA CGGAACAACA GACGCAGGCC TTCTCCAGGC TGATTGCGGC CGAAGGGCGC
GCGGCCCTTG CCTTCCGCGA TGGTCATCGT CTGTCGCAAC TGTTCGGCAC GGCCGCCGCA
ACTTATCCGG GCGAAACCTT CTCGGCTCTC GCGATGGATG TGGAGGGGCG CGTCATTGCC
TCGATGCCGG CGGATCTGGC CGGCGCCGAG AACCTGCAGG CCCAAGCGCT GAGCGCAATG
ACCGTGGGGG CTCCCGTGAA GGCCAAGGGC GGCACCTCCA TTGCCATCCC GGTCACCCGC
GAGGGGGACG ACTCTCTCGC CGGCGTCTTT GCCGTGTCCC TGCCCGCGCT GAATGGCTCC
TACACATTGC TGATGCCGGC AACCGCACTG GTCGCGGGCC TGATGATCTC AATGGGGATT
GCGGGACATC TCTGGAGGCG CCGGAAAGAG ACCGAGCGGC TGCTCATCGA GACGACCCGG
CGGATCAAGA ACAACCAGAG GCCCGACGCC ACCGCCATGA CGCGGATCGA GCTGTCGATC
CCAACACTTG CGAACGAAAT CGATGCCCTG TCCGCCGCTC TGCAGGGCGA GCGGGAGCAG
TTCGAGGCAG CGCATAGCCG AGCCATGGCA CTCGACGCCC TCCCAACCCC CTGGATCCTC
GTCGCCAGCG ATGGTCGCGT GCTCATGATG AACCACCCCG CGCGGCAGGT GGTTGCGGGC
CTGCCGGAGC CACTTCTCGA AGGTCAACCG GTTGCAAGTC TCCACCATGA TCTCGCACGG
GCTTGGCCGG ACCGCACAGG CAGGACGATG TCCCTGGCGA TCGGGGGACG CCAATATCAG
GCGACACGGG CTCCCGTCGG CGCAACGGGT GCCGAGGTCC TTAGTTTCAC CGACCGGACC
GAAGAGACAC AACTTGATCT CCTCCTCCAA GGGGTCATTC GCGAAGCCAT CGCTGCGACT
TATGACATCA GAGGTCAGAT GATAAAGGCG AGCGACGGCT TTGCGAAGCT CTTCGGATCC
TCGGGGGCCT CGCTGCGCTC CCTGCTCGAG GCAACGCCGG ACAGTGCCGA ACTCCTTGCT
GCGGTCGAAC GGGACGGCGA AGGGCGCAGT TTCCTGAGCC GAGAGGGCTA TGGTTCAGAC
CAGGTGCAGG TCGGCATCTC GATCTTGCGA CGACCGGGCG GCGGCCACTT GGTCCTAGTG
ACGGAGATCC GCCACCAGTA CGCGCAGGCA CCGACGACTG CCGCTGCGGA CCAGCTTCCG
TCGGCACCTG AGCGGCTGAT TTCAGCGCTT CGGCAGCTTG CCCAAGGTGA TCTCGGATCC
CGTCTGGATC ATCCGCTTCC CGAACCCTTC GAAGGCCTGC GCCCGGACTT CAACAGCGCC
CTGCAAGGGC TCGCCTCTCT CGTCGAAGAC GTGATCTCTG CCGCTGAAAG CATCCGCAAC
GAAGCCCGAG ACATCAGTTC CGCTGCACAA TCTCTGGCTC AACGAACGGA AAGCACCGCC
GCCACTCTGG AAGAGACAGC GGCAGCGCTT GACGGGCTAA CAGTTTCGGT CCGGTCCGCT
GCGGACGGCG CCGCAGAAGC AGATCGGGTG GTCGCCGACG CGCGGGCGAA CGCTGAGGAG
AGCGGGCATG TCGTCGTCGA GACAGTGGCA GCGATGGACA TGATTGCCGC CTCATCCGAC
AAGATCACCT CGATCGTGAA AGTGATCGAC GACATCGCGT TTCAGACAAA TCTGTTGGCG
CTCAATGCCG GCGTAGAGGC CGCCAGGGCA GGAGACGCCG GCCGCGGATT TGCGGTCGTC
GCCTCCGAAG TGCGGGCGCT CGCTCAAAGG TCTTCCGAAG CTGCGAGAGA GATCACGGAC
CTCATCCTGA AGAGCGGCAA TCAGGTGCGG CGCGGCGTGG ATCTGGTTGG AAAGACGGGT
GACGCGTTGA AGCAGATCGT ATCCTCGGTT TCCGAAATCT CGACTCTGGT CTCCGATATC
GCCGTGTCGT CGAGGCAGCA GTCTGTGTCT CTCGCGGAGA TCAACTGCGC GGTGAACAAC
CTCGACCAGT CAACCCAGCA GAATGCGGCG CGTCTCGAGG AGGCGACCGC CGCAAGCGAG
TCGCTTACGA CGAGCGCCAA CGCGCTTTTC GACACGGTGC AGCAGTTTCA CCTGGACGCT
CCTCCCAAGC GCAACCGCCC CACACCCCCC CTGACAGCGG CGACGCCACA CAACTCCAGG
GCGCTCGTGC GGGCAGAGCC CGGATGGGAG GACTTTTGA
 
Protein sequence
MNRTDPSRLW RFKQPLVLLG MPLPMLLGIA TALWLNHEGA LRATEQQTQA FSRLIAAEGR 
AALAFRDGHR LSQLFGTAAA TYPGETFSAL AMDVEGRVIA SMPADLAGAE NLQAQALSAM
TVGAPVKAKG GTSIAIPVTR EGDDSLAGVF AVSLPALNGS YTLLMPATAL VAGLMISMGI
AGHLWRRRKE TERLLIETTR RIKNNQRPDA TAMTRIELSI PTLANEIDAL SAALQGEREQ
FEAAHSRAMA LDALPTPWIL VASDGRVLMM NHPARQVVAG LPEPLLEGQP VASLHHDLAR
AWPDRTGRTM SLAIGGRQYQ ATRAPVGATG AEVLSFTDRT EETQLDLLLQ GVIREAIAAT
YDIRGQMIKA SDGFAKLFGS SGASLRSLLE ATPDSAELLA AVERDGEGRS FLSREGYGSD
QVQVGISILR RPGGGHLVLV TEIRHQYAQA PTTAAADQLP SAPERLISAL RQLAQGDLGS
RLDHPLPEPF EGLRPDFNSA LQGLASLVED VISAAESIRN EARDISSAAQ SLAQRTESTA
ATLEETAAAL DGLTVSVRSA ADGAAEADRV VADARANAEE SGHVVVETVA AMDMIAASSD
KITSIVKVID DIAFQTNLLA LNAGVEAARA GDAGRGFAVV ASEVRALAQR SSEAAREITD
LILKSGNQVR RGVDLVGKTG DALKQIVSSV SEISTLVSDI AVSSRQQSVS LAEINCAVNN
LDQSTQQNAA RLEEATAASE SLTTSANALF DTVQQFHLDA PPKRNRPTPP LTAATPHNSR
ALVRAEPGWE DF