Gene RPB_3610 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3610 
Symbol 
ID3911412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4145184 
End bp4146875 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content66% 
IMG OID637885512 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_487216 
Protein GI86750720 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.735593 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.400148 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTCCA GACTGTCGAT CCGCGCCAAG ATCACGATCG TGGTGGCGTT CATGCTGGTG 
ACGATGTCCG TGTTGGGCGC TGTCGCCGTG CAGAAGCTGT ACGCGATGAA CGCCAGCACC
GAAGACATAG CCACCAACTG GCTGCCGAGC GTTCGGGTGC TCGGCGAACT GAGGGCCGGG
GGCATCACCT ACCGCAACGT CATTCGCCAG CACATGCTGT CCTTCGCGGC CGAAGAGAAG
CAGGCGATGG AGAAAACCCT CGAGACCGTC AAGGGCAACA TCGCGAAATC GCGGGCGGCC
TATGAACCGC TGATCACCAC GCCTGCAGAA CGCGCGCTCT ATCAGGAATG GTCCGGGCTG
TGGGGAGACT ACGTCCGGGC GACCGAGGAG GTGCTCAACC GATCCCGCGC CGATATCGGC
AAGGTCTCGC AGGAGACCCG TGATTTCAAT GCCAAGACCG CGAACCCGAT CGGCGTGAAG
GCGGACGACG TGCTGCAGAA GGATATCGCC CTGAACAACG CTGGCGCCGA CGCGGCGACC
AAGTCCGCCG AGCAGACGTT CCGTTCGGCG ATGGTCATGT TCGTCAGCAT CCTCGTTCTC
GCCGCGGCCG CCAGCATCGG TCTCGGCATA TTCCTGATCC GCGACGTGTC GCGCGGCATT
TCCTCGATCA TTCAGCCGAT GCAGGCGCTC GGCCGTGGCG ACCTCACCGC GACCGTGCCG
CATCAAGGCG AGAAAACGGA AATCGGCGCG ATGGCCGATG TGCTGCAGGT GTTCAAGCAG
GCGCTGATCG ACAAGAAGGC CGCGGATGAA GCCGCCGCGG TCGACGCCGA AGCCAAGATC
GCACGTGGCC AGCGTGTCGA CGCGATCACC CGGCAGTTCG AGTCGATGAT CGGCGAGATC
GTCCAGACCG TGTCCTCGGC GTCGACCCAA CTCGAAGCGT CGGCGGCGTC GCTGTCCTCG
ACCGCCGGTC GATCCCAGGA AGTCACCACC GTCGTCGCCT CGGCCTCCGA AGAGGCGTCG
AACAACGTCC AGTCGGTCGC CTCCGCGACC GAAGAAATGG CCTCGTCGGT CAACGAGATC
AGCCGCCAGG TCCAGGATTC GGCGCGGATC GCGGGCGAGG CGGTGTTGCA GGCGCAGAAG
ACCAATGCCC GCGTCAGCGA CCTGGCGCAG GCCGCGACCC GGATCGGCGA TGTCGTCGAG
TTGATCAATT CGATTGCCGG CCAGACCAAC CTGCTCGCGC TCAACGCCAC CATCGAGGCG
GCGCGGGCGG GCGACGCCGG ACGCGGATTC GCCGTCGTTG CCAGCGAGGT GAAGGCACTC
GCCGAGCAAA CCGCCAAAGC CACCGGCGAA ATCAGCCAGC AGATCGCCGG CATCCAGTCC
GCCACCCAGG AATCCGTCGG CGCGATCCAG GAGATCGGCA ACACCATCGG ACGGATGTCG
GAAATCGCCT CGACCATCGC TTCCGCGGTG GAAGAGCAGG GCGCCGCGAC CAAGGAGATC
GCCCGCAACG TCCAGCAGGC GGCCCAGGGC ACACAACAGG TGTCGGCGAA CATCGTCGAC
GTTCAGCGCG GCGCTGGCGA GGCCACGGCG GCTTCGACGC AGGTGCTGTC GGCGGCCCAG
TCATTGTCGG CGGAAAGCGG CCGGCTGAGG GCCGAGGTCG GCAAGTTCCT GGAGTCGGTG
CGCGCAGCCT GA
 
Protein sequence
MFSRLSIRAK ITIVVAFMLV TMSVLGAVAV QKLYAMNAST EDIATNWLPS VRVLGELRAG 
GITYRNVIRQ HMLSFAAEEK QAMEKTLETV KGNIAKSRAA YEPLITTPAE RALYQEWSGL
WGDYVRATEE VLNRSRADIG KVSQETRDFN AKTANPIGVK ADDVLQKDIA LNNAGADAAT
KSAEQTFRSA MVMFVSILVL AAAASIGLGI FLIRDVSRGI SSIIQPMQAL GRGDLTATVP
HQGEKTEIGA MADVLQVFKQ ALIDKKAADE AAAVDAEAKI ARGQRVDAIT RQFESMIGEI
VQTVSSASTQ LEASAASLSS TAGRSQEVTT VVASASEEAS NNVQSVASAT EEMASSVNEI
SRQVQDSARI AGEAVLQAQK TNARVSDLAQ AATRIGDVVE LINSIAGQTN LLALNATIEA
ARAGDAGRGF AVVASEVKAL AEQTAKATGE ISQQIAGIQS ATQESVGAIQ EIGNTIGRMS
EIASTIASAV EEQGAATKEI ARNVQQAAQG TQQVSANIVD VQRGAGEATA ASTQVLSAAQ
SLSAESGRLR AEVGKFLESV RAA