Gene RPD_1149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1149 
Symbol 
ID4021625 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1306521 
End bp1308161 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content69% 
IMG OID637961341 
ProductABC transporter related 
Protein accessionYP_568288 
Protein GI91975629 
COG category[R] General function prediction only 
COG ID[COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase 
TIGRFAM ID[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.108293 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGAGA CCAAGCCCGC CGTCCTGACG CTCGACCGGC TCAGCGTGCG ACTGCCCGGA 
GGCGCTGATC GCGCGCACGC GGTGCGCGAT GCGTCGCTGA CAATCGCCGC CGATGAAATT
CTCTGCGTGG TCGGCGAGTC CGGCTCCGGC AAATCGATCA TGGCGAACGC GGTGATGCAG
CTGTTGCCGG GCGGCGTCGC GCTCGACGGC GGACGCGTGC TGTTCGAGGG CCGCGACCTG
GCGCAGGCCA GCGCCGCGGA GATGCGCAGC GTGCGCGGCG CCGGCATCGC GATGGTGTTT
CAGGAGCCGA TGACCGCGCT CAATCCGCTG CGCAGCATCG GCGACCAGAT CGGCGAGATG
TTCCGCATCC ACACAAGGCT GTCGAAGCAG GAGATCCGCG CCAAGGTGCT GGCGCTGCTC
GAAGACGTCC GGATTCCCGA TCCGCCACGC ACCGCCGATG CCTATCCGCA CGAGCTGTCG
GGGGGCCAGC GCCAGCGCGC GATGATCGCG ATGGCGCTGG CGCTCGACCC TAAACTGTTG
ATCGCCGACG AGCCGACCAC CGCGCTCGAC GTCACCACGC AGGCGCAGAT CCTGACGCTG
ATCCGCGAAT TGCAGCATCG CCGCAACACC GCGGTGCTGT TCATCACCCA TGATTTCGGC
GTGGTCGCCG AGATCGCCGA TCGGGTCGCG GTGATGCAAC GCGGCGTCAT CGTCGAACAG
GGCGCGGCCG ACGCCGTGCT GCATCGTCCG CAACATCCCT ACACACGGAA ACTGATCGCG
GCGGTGCCGC CGCTGACCGC GCCGCCGCCG CGGCCGATCT CGACCGAGAC CATCCTCGAC
ATCGCCGACG TCACCAAGAC CTTCCGCACC GGCGGCTTTC TCGGCCGCGG CGCGCGCGTC
ACTGATGCGG TGAAGTCGGT GTCGCTGAAG CTGCCGCGCG GCGCCACGCT CGGCATCGTC
GGCGAAAGCG GCTCCGGCAA GTCGACGCTG GCGCGCTGCA TCATCCGGCT GCTCGATCCC
GACGGCGGCT CGATCCTGCT CGAGGGCCGC GACTGGGCGA CGATGCCGCG CGAGCAGGTG
CGGCGCGAGA CCCGGCACAT GCAGATGGTG TTTCAGGACC CGTTCGCCTC GCTCAATCCG
CGGCACAAGG CCGAGGAGCT GGTGGCGCAG GGGCCGATCA TCCACGGCAC GCCGCGCGCG
CAGGCGATCA GGGAAGCGCG CGAGCTGTTC GCGCTGGTCG GGCTCGATCC CGCGTCCGGC
GACCGGCTGC CGCACGAATT CTCCGGCGGC CAGCGCCAGC GGATCGGGCT GGCGCGGGCG
CTGGCGCTGA AGCCGGACGT GCTGATCGCC GACGAGGCGG TGTCGGCGCT CGACGTGTCG
GTGCAGGCGC AAGTGCTGCG GCTGCTCGCG AATTTACGCG AGCGGCTCGG CCTGTCGATC
GTGTTCATCA CCCACGATCT GCGCGTGGCG GCGCAGATCT GCGATCTCGT CGCGGTGATG
AAGAACGGCG AAGTGGTCGA GCACGGCCCC GCCGGCGAGG TGTTCAACGC GCCGAAGCAT
CCGTATACGC AGGCGCTGCT CGCCTCGATC CCCGGCGGGG ATTTTGCGCG GAGTCATCCT
GTCGAGCCGG TCGAGGCCTG A
 
Protein sequence
MTETKPAVLT LDRLSVRLPG GADRAHAVRD ASLTIAADEI LCVVGESGSG KSIMANAVMQ 
LLPGGVALDG GRVLFEGRDL AQASAAEMRS VRGAGIAMVF QEPMTALNPL RSIGDQIGEM
FRIHTRLSKQ EIRAKVLALL EDVRIPDPPR TADAYPHELS GGQRQRAMIA MALALDPKLL
IADEPTTALD VTTQAQILTL IRELQHRRNT AVLFITHDFG VVAEIADRVA VMQRGVIVEQ
GAADAVLHRP QHPYTRKLIA AVPPLTAPPP RPISTETILD IADVTKTFRT GGFLGRGARV
TDAVKSVSLK LPRGATLGIV GESGSGKSTL ARCIIRLLDP DGGSILLEGR DWATMPREQV
RRETRHMQMV FQDPFASLNP RHKAEELVAQ GPIIHGTPRA QAIREARELF ALVGLDPASG
DRLPHEFSGG QRQRIGLARA LALKPDVLIA DEAVSALDVS VQAQVLRLLA NLRERLGLSI
VFITHDLRVA AQICDLVAVM KNGEVVEHGP AGEVFNAPKH PYTQALLASI PGGDFARSHP
VEPVEA