Gene RPD_1918 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1918 
Symbol 
ID4022400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2155174 
End bp2157240 
Gene Length2067 bp 
Protein Length688 aa 
Translation table11 
GC content65% 
IMG OID637962111 
Productchemotaxis sensory transducer 
Protein accessionYP_569054 
Protein GI91976395 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.537222 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGAA CAACCCAGTC GATTGCCGCA CCAGAGTCTT TCAGACTCCT CGCACTGTTT 
ACCAATCTCA AGATCGGCAC CAAGATTTCG ATCGGCTTCG CCGCCCTTCA GGTGCTGATG
GCAGTGCTCG CCGTCATGAA CTACTCCAGC TTCGGTACGA TTGGGGCGGC GTTTACGTCG
TTCGGTCAAC GCGTCGAGGT CGTCGACGTC GCTCACGGGA TCGATCGCGG ATTCGGCTCC
TTCCGCAGCA TCGTGCAAGA ATACGGCCTG ACCGGAGACG ACGCCCTGCT CGGCGAAGCG
GAGCAGCGCA AGCAACAGCT CGCCGACAGC ATCGCGCAGG GCTCGCGGCA TATCCAGAGT
CCCGAGACCC AAGCTGTCAT GGCCGATATC GGCCGACAGT TCGGCACTTA CAGCATGCAG
TTCGCTGAGG TCATCAAGCT GCGGCAGAGT CAGAACCAGC TGACGCTCGA GGTGCTCGGT
CCGGTCGGCC AACGTATCAT GACCAAGCTC GAGCAATTGC AGAGCACGAC AATGGCCGGC
GATGGCAATA GCGGCAACGC GCTGCTGGTC GGGCAGGCGT TGAAGCAGGT CCTGTTGATG
CGGCTTGCCG TCGTCAAGGT GCTGGGGATC CGGATCGAGA AGGATGCGTC GACCGCTGCC
GAGAAGGCGC TGTCGGAGCT GAAGATCACG ATGTCGGCGC TGAAGACGTC GCTCCGCACC
GACGCGGAGC AGAAACAGAT CGGCGCGATC GAGACCGATC TCGCGCTGTA TTCCGACGGC
TTCCGTCGGG CGTCGCGGGA CGTCAGTGCG GTCGAGGCGA AGATCGGCGA GATGGCCCAG
ATCGCGCAGT CGCTGTCCGC TTCGGCCGTG CGAATCAATC AGACCGGCGT CGCCGAGCAG
AAGCAGATCG GAGATGAGAC CAGGGCGCTG GTCGCGGATA CAAGGTACCA AACGCTGATC
ATCACCATCG CGGCTCAGTT GCTCGGCTTC GTGCTCGGCT GGCTAATCGG TCGGGCGGTG
TCGCGCCCAA TCGTGAAGAT CTCCGACACG ATGCGGGAGC TTGCGTCCGG CAATCTCGAC
GTCGCCGTCG CCGGTTCGGG GCGGCAGGAC GAGATCGGCC TGATGGCCTG CACCGTCGAA
GTGTTCAAGA CCAATGCGCT CGACGTCCGG CGCTTGAAGG TCGAGCAGGA GGCGGCCGAG
CAGCGCGTCG CCGAGGAGCA CAAGGCGGAG ATGCGGCGGC TGGCCGATGA TTTCGAAGGT
GCGGTCGGCC AGATCATCCA GACCGTCACC TCGGCCTCGA CCGAACTGGA AGCCTCTGCC
GGCACGCTGA CGTCGACGGC TGATCGCTCG CAGGAGCTCG CGACCTCGGT TGCGGCGGCG
TCCGAGCAGG CGTCGGCCAA TGTGCAGTCG GTGGCCTCTG CGACCGAGCA GATGGCGTCG
TCGATCTCCG AGATCAGCCG CCAGGTTCAG GAATCGGCGC GGATCGCCGG AGAAGCGGTG
GATCAGGCGC GCAAGACCAA CCAGCGGATC GGCCACCTCT CCGAGGCGGC GAACCGGATC
GGCGATGTCG TCGACCTGAT CAACACCATC GCGGGACAGA CCAACCTGCT GGCGCTGAAC
GCCACAATCG AAGCGGCGCG CGCCGGCGAC GCTGGTCGCG GCTTTGCGGT GGTCGCCTCT
GAGGTCAAGG CGCTTGCCGA GCAAACTGCC AAGGCGACCG GAGAGATCAG CCAGCAGATC
AGCGGCATGC AGGCGGCGAC GCAGGACTCG GTCGGTGCCA TCCGTGAAAT CGGCGGCACC
ATCGAGCGGA TGTCGGAGAT TGCCTCGACG ATCGCTTCGG CGGTGGAGGA GCAGGGAGCT
GCGACCCAGG AGATCTCCCG CAACGTGCAG CAAGCGGCGC AGGGCACTCA CCTGGTCTCG
ACCAACATCA CCGACGTGCA GCGCGGCGCC ACCGAGACCG GTTCGGCTTC CTCTCAGGTC
CTCGCCGCCG CGCAATCGTT GTCGCGCGAC AGTGGCCTGC TGCGGCAGGA GGTCAGCCGC
TTTGTCGAGA CGGTGCGTGC GGCGTAG
 
Protein sequence
MTRTTQSIAA PESFRLLALF TNLKIGTKIS IGFAALQVLM AVLAVMNYSS FGTIGAAFTS 
FGQRVEVVDV AHGIDRGFGS FRSIVQEYGL TGDDALLGEA EQRKQQLADS IAQGSRHIQS
PETQAVMADI GRQFGTYSMQ FAEVIKLRQS QNQLTLEVLG PVGQRIMTKL EQLQSTTMAG
DGNSGNALLV GQALKQVLLM RLAVVKVLGI RIEKDASTAA EKALSELKIT MSALKTSLRT
DAEQKQIGAI ETDLALYSDG FRRASRDVSA VEAKIGEMAQ IAQSLSASAV RINQTGVAEQ
KQIGDETRAL VADTRYQTLI ITIAAQLLGF VLGWLIGRAV SRPIVKISDT MRELASGNLD
VAVAGSGRQD EIGLMACTVE VFKTNALDVR RLKVEQEAAE QRVAEEHKAE MRRLADDFEG
AVGQIIQTVT SASTELEASA GTLTSTADRS QELATSVAAA SEQASANVQS VASATEQMAS
SISEISRQVQ ESARIAGEAV DQARKTNQRI GHLSEAANRI GDVVDLINTI AGQTNLLALN
ATIEAARAGD AGRGFAVVAS EVKALAEQTA KATGEISQQI SGMQAATQDS VGAIREIGGT
IERMSEIAST IASAVEEQGA ATQEISRNVQ QAAQGTHLVS TNITDVQRGA TETGSASSQV
LAAAQSLSRD SGLLRQEVSR FVETVRAA