Gene RPD_3452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3452 
Symbol 
ID4023966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3833015 
End bp3834664 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content62% 
IMG OID637963656 
Productputative ABC transporter ATP-binding protein 
Protein accessionYP_570576 
Protein GI91977917 
COG category[R] General function prediction only 
COG ID[COG0488] ATPase components of ABC transporters with duplicated ATPase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.747993 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCGCC AGTTCGTCTA TTTCATGCAG GGCCTGACCA AGGCCTATCC GACCCGCAAG 
GTGCTGGATA ACGTCCATCT GTCGTTCTAC CCCGACGCCA AGATCGGCGT GCTCGGCGTC
AACGGCGCCG GCAAGTCGAC GCTGCTCAAG ATCATGGCCG GGATCGACAA GGAATACACC
GGCGAGGCCT GGGTCGCCGA AGGCGCCCGC GTCGGCTATC TCGAACAGGA ACCGCAGCTC
GATGCCGCGC TGAACGTGCG CGAGAACGTC ATGCTCGGCG TTGCCAAGCA GAAGGCGATC
CTCGATCGCT ACAACGAGCT GGCGATGAAC TATTCCGAGG AAACCGCCGA CGAGATGACC
GCGCTCCAGG ACCAGATCGA GTCCGCGGGG CTGTGGGATC TCGACAGCAA GGTCGACCAG
GCGATGGACG CGCTGCGCTG CCCGCCCGAT GACGCAGACG TCACCAAGCT GTCCGGCGGC
GAGCGCCGTC GCGTCGCGCT GTGCAAGCTG CTGCTCGACC AGCCCGAACT GTTGCTGCTC
GACGAACCGA CCAACCATCT CGACGCCGAG TCGGTGTCTT GGCTCGAAAA TCATCTGCGC
AACTATCCGG GCGCGATCCT GATCGTCACC CACGATCGTT ACTTCCTCGA CAACGTCACC
TCCTGGATTC TCGAGCTCGA CCGCGGCAAG GGAATTCCCT ACGAGGGCAA CTACTCGTCC
TGGCTGGTGC AGAAGCAGAA GCGGCTGCTG CAGGAGGGGC GCGAGGATGC GGCCCACCAG
AAGACGCTCG AGCGTGAGCA GGAGTGGATC GCGTCGTCGC CGAAGGCACG CCAGGCCAAG
TCCAAGGCGC GCTACCAGCG CTACGATGAA CTGCTTGCCA AGGCCAGCGA GAAGCAGACC
CAGGCCGCGC AGATCATCAT TCCGGTGGCC GAGCGTCTCG GTAACAATGT GGTCGAATTT
GATCACCTGA CCAAGGGCTT CGGCGACAAG CTGCTGATCG ACGACCTGAC CTTCAAGCTG
CCGCCCGGCG GCATCGTCGG CGTGATCGGC CCGAACGGCG CCGGCAAGAC CACGCTGTTC
CGGATGATCA CCGGGCAGGA AAAGCCCGAC CAAGGTACCA TCACGGTCGG CGAGACCGTG
CATCTTGGCT ATGTCGATCA GTCGCGCGAC AGCCTCGACG CCAAGAAGAC CGTTTGGGAA
GAGATTTCCG GCGGCAATGA GCAGATCCTG CTCGGCAAGA AGGAAGTTAA TTCGCGCGGC
TATTGCTCGT CCTTCAACTT CAAGGGCGGT GACCAGCAGA AGAAGGTTGG TTCGCTGTCA
GGCGGCGAGC GTAACCGCGT CCACCTCGCC AAGATGCTGA AGTCCGGCTC CAACGTGCTG
CTGCTCGACG AACCGACCAA CGACCTCGAC GTCGATACGC TGCGGGCGCT GGAAGAGGCG
CTCGAGGATT TCGCCGGCTG CGCCGTGATC ATCAGCCATG ACCGCTGGTT CCTCGACCGT
ATCGCCACGC ATATCCTCGC CTTCGAGGAC GACAGCCACG TCGAATGGTT CGAAGGCAAC
TTCCAGGACT ACGAGAAGGA CAAGATGCGC CGGCTCGGTC AGGACTCGGT GATCCCGCAC
CGGGCGAAGT ATAAGAAGCT GACGCGGTGA
 
Protein sequence
MARQFVYFMQ GLTKAYPTRK VLDNVHLSFY PDAKIGVLGV NGAGKSTLLK IMAGIDKEYT 
GEAWVAEGAR VGYLEQEPQL DAALNVRENV MLGVAKQKAI LDRYNELAMN YSEETADEMT
ALQDQIESAG LWDLDSKVDQ AMDALRCPPD DADVTKLSGG ERRRVALCKL LLDQPELLLL
DEPTNHLDAE SVSWLENHLR NYPGAILIVT HDRYFLDNVT SWILELDRGK GIPYEGNYSS
WLVQKQKRLL QEGREDAAHQ KTLEREQEWI ASSPKARQAK SKARYQRYDE LLAKASEKQT
QAAQIIIPVA ERLGNNVVEF DHLTKGFGDK LLIDDLTFKL PPGGIVGVIG PNGAGKTTLF
RMITGQEKPD QGTITVGETV HLGYVDQSRD SLDAKKTVWE EISGGNEQIL LGKKEVNSRG
YCSSFNFKGG DQQKKVGSLS GGERNRVHLA KMLKSGSNVL LLDEPTNDLD VDTLRALEEA
LEDFAGCAVI ISHDRWFLDR IATHILAFED DSHVEWFEGN FQDYEKDKMR RLGQDSVIPH
RAKYKKLTR