Gene RPD_2018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2018 
Symbol 
ID4022500 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2261930 
End bp2264245 
Gene Length2316 bp 
Protein Length771 aa 
Translation table11 
GC content65% 
IMG OID637962211 
ProductATP-binding region, ATPase-like 
Protein accessionYP_569154 
Protein GI91976495 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.88254 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.468508 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGCG TGCAGGCGGC GAACGCTTGC GTTCAATCCG ATTCAATCAA GGGATTGGCG 
CAATCGATCG CTAAACCGGC CTACCATCGG CTGCTGACGG CCGAGCCGGT GCTCCGCCGC
GCCGTCCCGA TCCTGATCAT CGCTTTCCTC GTGACCATCT GCTTCGGCGC CGCGGTTCAG
GTCATCGACC AGAGCCGGCA GAAGCGCGCC GCGCTCAATC GCGACCTCTC CGCACTCGCC
GACCTGCTCG CGGAACGGCT CGAACATATC GGTGTGGTGC GGCTCGACCG CGCGGCGTCG
ATCGAACGCC TGCAGAGCCT GCTACCCGGC CTCGTTCCAT CGTGGGGCAT CGCCGCGGGA
CGGCATGTCA TCGTCACCGG CGCCGATCAA CGCGTGCTGG CGCGGGTGCC GGGCGATGAA
GGCGCCAGCG CCGCCAACAG ATTCCTCGAC ATCATCAGCG CCACGCCGAC GCTGACGAAG
TCCTCACAGC AAGGCGTCCT CAGCGAGATC ACCTTGCCGA GCGGCGCCTC GGCGCTGGCG
ACGCTGCACA CCATCAAGTC GTTGCCGGGC CAGGTCATCA TCATCCAGGA AGATACCGGC
TCGATCCTGC GCTCCGATGC GGCGCTGCAG ATCACCCTGT CGGCCACCAC CGGCTTCGTC
GTGCTGATCC TCGGCTTCGC CTTCCACTGG CAGTCGACCC GCGCGCGCGA AGGCGACCTG
ATCAACGACG CGGTGCGCAG CCGGATCGAC ACCGCGCTCA ATCGCGGCCG CTGCGGCCTG
TGGGATTGGG ACCTGTCGCG CGGCCGGATC TTCTGGTCGC AGTCGATGTT CACCCTGCTC
GGCCTGGAGA GCCGCAACGA CCTCCTGACC TTCGGCGAGG TCAACGCGCT CGTGAACAGC
GACGACATCG ACCTGTTCAC GATCGGCGAC GAACTGATCG CCGGCGCCGC CGATCACATC
GACCACAGCT TCCGGATGCG GCACGCCAAC GGTCACTGGA TCTGGCTGCG GATGCGTTGC
GAGCTCAGCC AGGAATCGCC CACCGAAAAC AAGCATCTGA TCGGCATCGC GGTCGACGTC
ACCGAACAGA AGAGCCTCGC CGAACGCACC GTCGAAGCCG ATCTGCGGCT GCGGGACGCG
ATCGAAACCC TGCCCGAGGC GTTCGTGCTG TGGGACTCCG ACAACCGACT CGTGCTGTGC
AACTCGCACT TCCAGCGGCT GCACAAGCTG CCGGATTCCG CGATTGCGCC CGGCACGTCC
TATGAGACGG TGATCGAGGT CGGGCAGATG CCGGAAATCC GCACCCGGCA ATGCGACGTC
GGCGCGGGGT CGCCACCCGG CGCGCGCACC TTCGAGGCCC AACTCGACGA CGGAAGCTGG
CTGCACATCA GCGAACGCCG CACCAAGGAC GGCGGCTACG TCGCCGTAGG CACCAACATC
ACCCGCATCA AGGCGCACGA GCAGAAGCTC GTCGACAACG ATTTGCGGCT TCGCGCAACC
GTCGCCGACC TGAAAATCTC GCAGGTCAAG CTGGAGCGCC AGGCGATCGA GCTCGCCGAT
CTCGCGCGAA AATACTCCGA GGAAAAAAAC CGGGCCGAAG AAGCCAACCA GGCCAAGTCG
AAATTCCTCG CTAATATGAG CCACGAGCTG CGCACGCCGC TCAACGCGAT CATCGGCTTC
TCGGAGATTA TGGGCAGCGG CATGTTCGGC ACGCTGGGCT CGGAGAAGTA TCAGGAATAC
TGCCACGACA TCATGACCAG CGGTCACTAT CTGCTGGAGG TGATCAACGA CATTCTCGAC
ATGTCGAAGA TCGAGGCCGG CCGCATGCGG CTCGAGATGG AGGAGCTCGA TCTCGCCCGA
ACGCTTGGCG AATCGCTCAA GGTCGTCGCC GGTCGCGCCG ACAACAAGCA TCTCGAGCTT
CGCGCCGAAA TCGAGGACGG CATCCCGATC GTGGCGGATC GCCGCGCGAT CAAGCAGATT
CTGATCAACC TGTTGTCGAA CGCCGTGAAA TTCACCCCCG ACGGCGGGCG GGTGACGGTG
CGCAGCCGGA CGCTGGAAGA TTCGATCGTG ATGATGATCG CCGATTCAGG CATCGGCATC
GCCCCGCAAT CGCTGCGAAG GCTCGGACAG CCGTTCGAGC AGGTCGAGAG CCAGCTCACC
AAGACCTATC ACGGCTCCGG GCTGGGTCTG GCGATCGCCA AGTCGCTCAC CAGGCTGCAT
GGCGGCTCGA TGCGTCTGCG CTCGACGCTC GGCGCCGGAA CGGTGGTGAT GGTCACGCTG
CCGCGCGACT GCCAGAAGCG CCGGATGGCG GCCTGA
 
Protein sequence
MARVQAANAC VQSDSIKGLA QSIAKPAYHR LLTAEPVLRR AVPILIIAFL VTICFGAAVQ 
VIDQSRQKRA ALNRDLSALA DLLAERLEHI GVVRLDRAAS IERLQSLLPG LVPSWGIAAG
RHVIVTGADQ RVLARVPGDE GASAANRFLD IISATPTLTK SSQQGVLSEI TLPSGASALA
TLHTIKSLPG QVIIIQEDTG SILRSDAALQ ITLSATTGFV VLILGFAFHW QSTRAREGDL
INDAVRSRID TALNRGRCGL WDWDLSRGRI FWSQSMFTLL GLESRNDLLT FGEVNALVNS
DDIDLFTIGD ELIAGAADHI DHSFRMRHAN GHWIWLRMRC ELSQESPTEN KHLIGIAVDV
TEQKSLAERT VEADLRLRDA IETLPEAFVL WDSDNRLVLC NSHFQRLHKL PDSAIAPGTS
YETVIEVGQM PEIRTRQCDV GAGSPPGART FEAQLDDGSW LHISERRTKD GGYVAVGTNI
TRIKAHEQKL VDNDLRLRAT VADLKISQVK LERQAIELAD LARKYSEEKN RAEEANQAKS
KFLANMSHEL RTPLNAIIGF SEIMGSGMFG TLGSEKYQEY CHDIMTSGHY LLEVINDILD
MSKIEAGRMR LEMEELDLAR TLGESLKVVA GRADNKHLEL RAEIEDGIPI VADRRAIKQI
LINLLSNAVK FTPDGGRVTV RSRTLEDSIV MMIADSGIGI APQSLRRLGQ PFEQVESQLT
KTYHGSGLGL AIAKSLTRLH GGSMRLRSTL GAGTVVMVTL PRDCQKRRMA A