Gene RPD_3865 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3865 
Symbol 
ID4024381 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4303889 
End bp4307257 
Gene Length3369 bp 
Protein Length1122 aa 
Translation table11 
GC content63% 
IMG OID637964069 
ProductATP-binding region, ATPase-like 
Protein accessionYP_570987 
Protein GI91978328 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTTGGGC GACAGCGCAT AGACCGCGTG CGGCGCCAGT ACAATCAATG GGTCGCCAAC 
CAGACACTCG AAGATTATGC GCTGCGTTTC ACCGCGAAGA GCGCGCGACG CTGGTCGGCG
TTGCGGGTCG CCAATACGGC GCTCGGCGCG ATTTCCTTCC TCGCGCTGGA AGCGATCGGC
GGCACCATCA CGCTGAACTA CGGCTTCGCC AACGCCACCA CCGCGATCCT GGTGGTCAGC
GCGATCATCT TCTGTTGCGG CCTGCCGATT GCCTATCATG CAGCCAGATG TGGCGTCGAT
ATAGACTTGC TTACGCGCGG CGCCGGATTC GGCTACATCG GATCGACGGT CACGTCGCTG
ATCTATGCCT CATTTACATT CATCTTCTTC GCGATCGAAG CCGTTATTCT CGCGACCGCG
CTGGAGTTGT GTTTCGGTCT GCCGCTGCCG CTGGGTTATC TGTTCAGCGC GGTGGTGATC
ATTCCGCTGG TGACGCACGG CATCACGTTG ATCAGCCGAT TTCAGCTCTG GACCCAGCCG
TTCTGGATCT TGCTTCACAT CCTGCCGTTC GCGGCGATCG CCATTAAAAG CCCGGAATCC
TTCGTCGGCT GGACGAATTT CGCCGGCGCG CACGGCGATC CCGGAGGGCA TCTCGACCTG
CTGCTGTTCG GCACTGCGGC TTCGGTCGTG TTCTCGCTGG TGGCGCAGAT CGGCGAACAG
GTCGATTTTC TGCGCTTCCT GCCGCGCGAC CGCCACACCT CGAAAGCGGC GTGGTGGACC
GCGATGGTCT GCGCCGGACC GGGCTGGATC GTGCCGGGCG CAATCAAGCT GCTGGCCGGC
TCCTTCCTCG CCTATTTCGC ACTGAGCCAC GGCATCGAGC CGGAGAATGC GGCCGAGCCG
TCGCACATGT ATCGCGAGGC GTTCCGCTAC GTGCTGTCGC AACCGGAACT TGCGTTGGCG
TTGACCGGCG CGTTCGTGAT GCTCTCGCAG ATCAAGATCA ACGTCACCAA CGCCTATGCG
GGCTCGATCG CCTGGTCGAA CTTCTTTTCG CGGCTCACCC ACAGCCATCC CGGCCGCGTC
GTCTGGCTGG TGTTCAACGT GCTGGTCGCC CTGCTGCTGA TGGAGATCGG CGTCTACAAG
GCATTGGAGC AAACGCTGGC GCTGTACTCC AACGTCGCGA TCGCCTGGGT CGGCGCGCTG
GTCGCTGACC TCGTCATCAA CCGGCCGCTC GGCCTGCGGC CGCGGCATAT GGAGTTCAAA
CGCGCGCATC TCTACGACAT CAACCCGGTC GGCGTCGGCT CGATGGCGAT CGCCACGGTG
ATTTCGATCG CCGCGTTCTA CGGCGTATTC GGGCCGACCG CCAAGGCGCT GGCGCCATTC
GTTGCGCTGC TGACGGCCGT CGTCACCGCG CCGGTCATCG CCATCCTGAC GCGCGGGCGA
TTCTACATCG CCCGCAAGCA GAAGCGTTCG TGGCAGAATC TGCCGGCGAT CAATTGCTGT
ATCTGCGAAC ATTCGTTCGA GCCGGAGGAC ATGGCGTCTT GCCCCGCTTA TGCGGGACCG
ATCTGCTCGC TGTGCTGCTC GCTCGACGCG CGCTGCCATG ATCTGTGCAA GCCGCACGCG
CGGTTCGACG CGCAGTTCAC CGACGCGCTC GGCAAAGTTC TGCCGCGGCG CGTGCATGCG
CTGATCAATT CCCAGGTCGG ACATTATCTC AGCGTGTTCA TGGTGTCGGC CGGGCTGGTC
GGGCTGACGC TGATGATGAT CTATTTGCAG ACCTCGCCCG CGACCGAGGC GCAGCAGATC
GTGCTGTGGG ACATTCTCTG GAAGCTGTTC TTCGCACTCA CCATCATCGT GGGCGTGGTG
ACCTGGTTGT TCGTGCTGGC GCGGCAGAGT CGTCAGGCTG CCGAGGCCGA GACGCGGCGG
CAAACCACTT TGCTGATGCA GGAGATCGAT GCGCACCGCC AGACCGACGC CGAGTTGCAG
CGCGCCAAGG AGGTCGCGGA ATCCGCCAAT CTCGCCAAGA GCCGCTATGT GGTCGGGCTC
AGCCATGAGT TGCGCTCACC GCTGAATGCG ATCAGCGGCT ACGCGCAGTT GCTCGAGCAG
GACAGCAGCC TGCCGCCGCG ACCGCGCGAC CAGATCCGGC TGATGCGGCG CAGCGCCGAT
CACCTCTCCG GCCTGATCGA CGGATTGCTC GACATTTCCA AGATCGAGGC CGGGCGGCTG
TATCTGTCGC GCGACGAGGT GCGGCTGGGG GAATTCCTCG ACCAACTCGT CGGCATGTTC
CGGCTGCAGG CTGCGGCCAA GGATGTCGAG TTCGTCTTCA AGCGCCCGCA ATATCTGCCG
CCGGTGGTCT ATGCCGACGA AAAGCGGCTG CGCCAGGTGC TGATCAACCT GCTGTCCAAC
GCGATCAAGT TCACCCAGAG CGGCAGCGTG CATTTCATCG TGCACTATCG CAACCCGGTG
GCCGAACTCG AAGTGCGCGA CACCGGGCCG GGCATCCACG CCGACGATCT CGAACGGATC
TTCGCGCCGT TCGAGCGCGG CGCGCTCGGC GTATCACAGC CGCACACCGG CACCGGCCTC
GGCCTGACGA TCAGCAAATT GCTCGCCGGC GTGATGGGCG GCGACCTCAG CGTCACCAGC
GCGCTCGGCC AGGGCTCGAC ATTCCGAGTC AAATTGCTGC TGTCGGAAGT CACCAACCCG
ACCCGCAGCC GCGCGATCCG CGCGCCGATC CTCGGCTATC ACGGCCCGCG CAAAACCATT
CTGATCACCG ACGACGATCC GGCCCAGCGT AATCTGCTGC AGGAACTGCT GGCGCCGATC
GGCTTCATCG TGCTGAGCGC GCCCGACGGC TACACCTGCA TCAGCCTCGC CGAACACTGC
CAGCCCGATC TGTTTCTGCT CGACATCTCG ATGGCCGGAA TAGACGGCTG GACCGTCGCT
GAAACGCTGC GGACCAACGG ACATCACTAC GCACGCATCC TGATGGTCTC GGCCAGCGCT
ATCGAGGCCC ATGGCGCCCC GCTGGCGCAG CCATACCACG ACGGCTATCT GATGAAGCCG
GTCGACATCC CACGGCTGCT CGAACAGATC GGACAATTGC TCAAGCTGGA GTGGATTCAC
AAAGGCGAAG CTCAGGAGAC GCTCGACTTC ACCGGCGAGT TTCAGAGTCC GCCGATGCAG
CACGTCGAGG AATTGATCGA ACTCGGCAAG ATCGGCTACA TCCGCGGTAT CGAATCAAAA
CTCGATCAGA TCAACAATGA TTATCCGGAG TCAGGACTGT TCGTGTCGAA GATGCGCGCG
CTGGTCGAAC AATTCGACCT TGACCAGTAC ATGAAGACCC TGAACACGCT GTATCGCCAT
GACCAATGA
 
Protein sequence
MVGRQRIDRV RRQYNQWVAN QTLEDYALRF TAKSARRWSA LRVANTALGA ISFLALEAIG 
GTITLNYGFA NATTAILVVS AIIFCCGLPI AYHAARCGVD IDLLTRGAGF GYIGSTVTSL
IYASFTFIFF AIEAVILATA LELCFGLPLP LGYLFSAVVI IPLVTHGITL ISRFQLWTQP
FWILLHILPF AAIAIKSPES FVGWTNFAGA HGDPGGHLDL LLFGTAASVV FSLVAQIGEQ
VDFLRFLPRD RHTSKAAWWT AMVCAGPGWI VPGAIKLLAG SFLAYFALSH GIEPENAAEP
SHMYREAFRY VLSQPELALA LTGAFVMLSQ IKINVTNAYA GSIAWSNFFS RLTHSHPGRV
VWLVFNVLVA LLLMEIGVYK ALEQTLALYS NVAIAWVGAL VADLVINRPL GLRPRHMEFK
RAHLYDINPV GVGSMAIATV ISIAAFYGVF GPTAKALAPF VALLTAVVTA PVIAILTRGR
FYIARKQKRS WQNLPAINCC ICEHSFEPED MASCPAYAGP ICSLCCSLDA RCHDLCKPHA
RFDAQFTDAL GKVLPRRVHA LINSQVGHYL SVFMVSAGLV GLTLMMIYLQ TSPATEAQQI
VLWDILWKLF FALTIIVGVV TWLFVLARQS RQAAEAETRR QTTLLMQEID AHRQTDAELQ
RAKEVAESAN LAKSRYVVGL SHELRSPLNA ISGYAQLLEQ DSSLPPRPRD QIRLMRRSAD
HLSGLIDGLL DISKIEAGRL YLSRDEVRLG EFLDQLVGMF RLQAAAKDVE FVFKRPQYLP
PVVYADEKRL RQVLINLLSN AIKFTQSGSV HFIVHYRNPV AELEVRDTGP GIHADDLERI
FAPFERGALG VSQPHTGTGL GLTISKLLAG VMGGDLSVTS ALGQGSTFRV KLLLSEVTNP
TRSRAIRAPI LGYHGPRKTI LITDDDPAQR NLLQELLAPI GFIVLSAPDG YTCISLAEHC
QPDLFLLDIS MAGIDGWTVA ETLRTNGHHY ARILMVSASA IEAHGAPLAQ PYHDGYLMKP
VDIPRLLEQI GQLLKLEWIH KGEAQETLDF TGEFQSPPMQ HVEELIELGK IGYIRGIESK
LDQINNDYPE SGLFVSKMRA LVEQFDLDQY MKTLNTLYRH DQ