Gene RPD_3534 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3534 
Symbol 
ID4024048 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3923295 
End bp3925175 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content66% 
IMG OID637963738 
Productextracellular solute-binding protein 
Protein accessionYP_570658 
Protein GI91977999 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.173833 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGCAGC TCAACCGCCG CAATGTGCTC GGCCTCGGAA TCGGCGCGCT GGCCGCGGCG 
CATCTTCGCC CCGCGGCTGC GGCCGAGGGA GAGACGGTCG CCCACGGCAT GTCCGCCTTC
GGCGACCTGA AGTACCGGGC CGATTTTCCG CATTTCGACT ACGTCAATCC TCGGGCGCCG
AAGGGCGGGC TGTTCTCGAC CATTCCGTCG GTGCGCGCCT TCAACCAGTC GTTTCACACG
TTCAATTCGC TCAATGCCTA CGTCCTGAAG GGCGATGGCG CTCAGGGCAT GGGCCTCACT
TTCGCGACGC TGATGGCGCG GGCCGGCGAC GAGCCCGACG CGATGTACGG CCTCGCGGCG
TCGTCGGTGG CGATCTCTCG CGACGGTCTG ACCTATCGCT TCACCATGCG CCCGGAGGCG
CGCTTCCACG ACGGCAGCAA GCTCACCGCG CGAGACGCCG CGTTCTCGCT GAACATCCTG
AAGGCCAAGG GCCATCCGCT GATCACCCAG CAGATGCGCG ACTTCATCAA GGCGGAAGCG
ACCGACGACG CCACGCTGGT CGTGACGTTC GCGCCGAAGC GCGGCCGCGA CGTACCGCTG
TTCACGGCCT CCCTGCCGCT GTTCTCCGAG GCCTACTACG CGAAACGGCC GTTCGACGAA
TCGACCATGG AGGTGCCGCT CGGCAGCGGC CCCTACAAGG TGGGCCATTT CGAATCCGGC
CGCTCCATCA CCTTCGATCG CGTCAAGGAC TGGTGGGGCG CGAAGCTGCC GGTCAATGTC
GGGTCGAACA ATTTCGACAC CGTCCGGTTC GAGTTCTATC GCGATCGCGA CGTCGCTTTC
GAGGGCTTCT CCGGCCGCAA TTATCTGTAT CGCGAGGAGT TCACCTCGCG GATCTGGAGT
ACGCGCTACG ATTTCCCCGC GGTCCATGAC GGCCGGGTCA AGCGCGAGCA GCTTCCTGAC
GGGACCCCGT CCGGCTCGCA GGGCTGGTTC ATCAACACGC GGCGCGACAA GTTCAAGGAC
CCGCGGGTGC GCGAGGCGAT CGGCTGCGCG TTCGATTTCG AATGGACCAA CAAGACCATC
ATGTACGGCG CCTATCAACG CACGGTGTCG CCGTTCCAGA ATTCCGATCT GATGGCGGTG
GGGCCGCCGT CGCCCGACGA ACTGGCGCTG CTCGCACCGT ACCGCGGCAA GGTGCAGGAC
GAGGTGTTCG GCGCGCCGTT TCTGCCGCCG GCGTCCGATG GCTCGGGACA GGACCGCGCG
CTGCTGCGCA GAGGCGGTCA GCTTCTGACC GAGGCCGGCT TCGCGATCAA GGATCGCCAG
CGGCTGACGC CGCAGGGCGA GCCGATGCGG ATCGAGTTTC TGCTCGACGA GCCGTCGTTC
CAGCCGCACC ACATGCCGTT CATCAAGAAC CTCGGCACCC TGGGGATCGA GGCGACGTTG
CGGCTGGTCG ACCCGGTGCA GTTTCGCGCC CGACGTGACG ATTTTGATTT CGATATGGCG
ATCGAGCGCT TCGGCTTCTC GACCGTGCCG GGCGATGCGC TGCGCAGTTT CTTCTCGTCG
CAATCGGCCG CGACCAAGGG CTCGAACAAT CTCGCCGGCA TCGCCGATCC CGCGATCGAT
GCGATGATGG ATCAGGTGAT CGCTGCCGAC ACCCGGGCGA AGCTGGTTGT CGCCGCGCGG
GCGCTCGACC GGCTGATCCG CGCCGGCCGC TATTGGGTGC CGCAATGGTA CTCCGCCTCG
CACCGGCTGG CCTATTGGGA CGTGTTCGGC CATCCGCCGA ACCTGCCGAA ATATATCGGC
GTCGGCGCGC CGGATCTGTG GTGGTCGGAG CCGAAAGCCG CGGCCGCCGC CGACGGCGAC
GTCAAAGGCG AGGGAAAATA G
 
Protein sequence
MVQLNRRNVL GLGIGALAAA HLRPAAAAEG ETVAHGMSAF GDLKYRADFP HFDYVNPRAP 
KGGLFSTIPS VRAFNQSFHT FNSLNAYVLK GDGAQGMGLT FATLMARAGD EPDAMYGLAA
SSVAISRDGL TYRFTMRPEA RFHDGSKLTA RDAAFSLNIL KAKGHPLITQ QMRDFIKAEA
TDDATLVVTF APKRGRDVPL FTASLPLFSE AYYAKRPFDE STMEVPLGSG PYKVGHFESG
RSITFDRVKD WWGAKLPVNV GSNNFDTVRF EFYRDRDVAF EGFSGRNYLY REEFTSRIWS
TRYDFPAVHD GRVKREQLPD GTPSGSQGWF INTRRDKFKD PRVREAIGCA FDFEWTNKTI
MYGAYQRTVS PFQNSDLMAV GPPSPDELAL LAPYRGKVQD EVFGAPFLPP ASDGSGQDRA
LLRRGGQLLT EAGFAIKDRQ RLTPQGEPMR IEFLLDEPSF QPHHMPFIKN LGTLGIEATL
RLVDPVQFRA RRDDFDFDMA IERFGFSTVP GDALRSFFSS QSAATKGSNN LAGIADPAID
AMMDQVIAAD TRAKLVVAAR ALDRLIRAGR YWVPQWYSAS HRLAYWDVFG HPPNLPKYIG
VGAPDLWWSE PKAAAAADGD VKGEGK