Gene RPB_1038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1038 
Symbol 
ID3909162 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1192660 
End bp1194297 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content69% 
IMG OID637882931 
ProductABC transporter related 
Protein accessionYP_484659 
Protein GI86748163 
COG category[R] General function prediction only 
COG ID[COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase 
TIGRFAM ID[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.448383 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGAGA CCAAAGCCGC CGTCCTGACG CTCGACCGGC TCAACGTCCG CTTGCCCAAG 
GGTGCCGACA GGACGCACGC CGTGCGCGAC GCGTCGCTCA CGATCGCCGC CGATGAAATC
CTCTGCGTCG TCGGCGAGTC CGGCTCCGGC AAGTCGATCA TGGCCAACGC GGTGATGCGG
CTGTTGCCGG GCGGCGTCGC GCTCGACGGC GGCCGCGTGC TGTTCGAGGG CCGCGATCTG
GCGCAGGCCA GCGCCGCCGA GATGCGCGCC GTGCGCGGCG CCGGCATCGC CATGGTGTTT
CAGGAGCCGA TGACGGCGCT CAACCCGCTG CGCTCGATCG GCGACCAGAT CGGCGAGATG
TTCCGCATTC ACACCGACCT GTCGAAGAAG GACATCCGCG CCAAGGTGCT GGCGCTGCTC
GAGGACGTGC GGATTCCCGA TCCGGCGCGC GCCGCCGACG CCTATCCGCA CGAACTCTCA
GGCGGGCAAC GCCAGCGCGC GATGATCGCG ATGGCGCTGG CGCTCGACCC GAAGCTGTTG
ATCGCCGACG AACCGACCAC CGCGCTCGAC GTCACCACGC AGGCGCAGAT CCTGACGCTG
ATCCGCGAGT TGCAGCATCG CCGCAACACC GCGGTGCTGT TCATCACCCA TGATTTCGGC
GTGGTCGCCG AAATCGCCGA TCGCGTCGCG GTGATGCAGC ACGGAATCAT CGTCGAGCAG
GGCACCGCCG ACGACGTGCT GCACCGGCCG CAGCATCCCT ATACGCGGCA ACTGATCGCC
GCGGTACCGC CTCTGACCGC GCCGCCGCCG CGGCCGATCG CGGCCGAGAC TATTCTGGGC
ATCGACAAGG TCTCGAAGAC ATTTCGCACC GGCGGCTTTC TCGGCCGCGG CGCGCGCGTC
ACCCACGCCG TGAAATCAGT GTCGCTGCAA TTGCCGCGCG GCGCGACGCT CGGCATCGTC
GGCGAATCCG GCTCCGGCAA GTCGACGCTG GCGCGCTGCA TCATCCGGCT GCTCGACCCC
GATTGCGGCT CGATTCTGCT CGACGGCCGC GACTGGGCGA CGATGCCGCG CGAAGACGTA
CGTCGCGAGA CCCGCCACAT GCAGATGGTG TTTCAGGACC CGTTCGCCTC GCTCAACCCG
CGCCACAAGG CCGAGGAATT AGTTGCGCAG GGCCCGATCA TTCACGGCAC GCCGCGCAAG
CAGGCGATCG CCGAGGCGCG CGAGCTGTTC GCGCTGGTCG GGCTCGATCC CGCCGCCGGC
GACCGGCTGC CGCACGAATT CTCCGGCGGC CAGCGCCAGC GCATCGGGCT GGCCCGCGCA
TTGGCGCTGA AGCCCGACGT GCTGATCGCC GACGAGGCGG TGTCGGCGCT CGACGTCTCG
GTGCAGGCGC AGGTGCTGCG GCTGCTCGCC GATCTGCGCG AGCGGCTCTG CCTGTCGATC
GTCTTCATCA CCCACGATCT GCGCGTCGCC GCGCAGATCT GCGACCTCGT CGCTGTGATG
AAGGACGGCG AAGTCGTCGA ACACGGCCCC GCCGGCGAGG TCTTCAACGC GCCGAAGCAT
CCCTACACGC AGGCGCTGCT GGCCTCGATC CCGGGCGGCG ATTTCGCGCG GAGCCATCCG
GTGGAAGCGG TGGTGTAG
 
Protein sequence
MTETKAAVLT LDRLNVRLPK GADRTHAVRD ASLTIAADEI LCVVGESGSG KSIMANAVMR 
LLPGGVALDG GRVLFEGRDL AQASAAEMRA VRGAGIAMVF QEPMTALNPL RSIGDQIGEM
FRIHTDLSKK DIRAKVLALL EDVRIPDPAR AADAYPHELS GGQRQRAMIA MALALDPKLL
IADEPTTALD VTTQAQILTL IRELQHRRNT AVLFITHDFG VVAEIADRVA VMQHGIIVEQ
GTADDVLHRP QHPYTRQLIA AVPPLTAPPP RPIAAETILG IDKVSKTFRT GGFLGRGARV
THAVKSVSLQ LPRGATLGIV GESGSGKSTL ARCIIRLLDP DCGSILLDGR DWATMPREDV
RRETRHMQMV FQDPFASLNP RHKAEELVAQ GPIIHGTPRK QAIAEARELF ALVGLDPAAG
DRLPHEFSGG QRQRIGLARA LALKPDVLIA DEAVSALDVS VQAQVLRLLA DLRERLCLSI
VFITHDLRVA AQICDLVAVM KDGEVVEHGP AGEVFNAPKH PYTQALLASI PGGDFARSHP
VEAVV