Gene RPB_1771 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1771 
Symbol 
ID3909758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2030473 
End bp2032350 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content67% 
IMG OID637883665 
Productextracellular solute-binding protein 
Protein accessionYP_485390 
Protein GI86748894 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.235807 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0344403 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCAGC TCAACCGCCG CGAAGTTCTC GTCCTCGGCG TCGGCGCCCT GGCGGCGGCC 
CGGATCAGCC CCGCATCCGC GGCCGACGGC GAGACCACAG CCCACGGCAT GTCGGCGTTC
GGCGACCTGA AATACAAGGC GGACTTTCCG CATTTCGACT ACGTCGATCC GCAGGCCCCC
AAAGGCGGGC TGTTCTCGAC CATCCCGTCC AGCCGCGCCT TCAATCAATC GTTCCAGACC
TTCAACTCGC TCAACGCCTA CATCCTCAAG GGCGACGGCG CGCAGGGCAT GGGTCTCACC
TTCGCGTCCT TGATGGCCCG GGCCGGCGAC GAGCCCGACG CGATGTACGG CCTCGCGGCC
GCGAGGGTCG CGATCTCCGC CGACGGGTTG AGCTATCGCT TCACCATGCG TCCAGAGGCG
CGCTTCCACG ACGGCAGCAA GCTCACCGCG CGCGACGCGG CGTTCTCGCT GAACATCCTC
AAGGCCAAGG GGCATCCGCT TATCACCCAG CAGATGCGCG ATTTCATCGA GGCGAAGGCA
GTCGACGATT CGACGCTGTT GGTGACGTTC AAGCCGAAGC GCGGCCGCGA CGTGCCGCTG
TTCGTGGCCG CGCTGCCGTT GTTCTCCGAG GCGTACTACG CCAAGCGGCC GTTCGACGAA
TCGACCATGG AGATCCCGCT CGGCAGCGGG CCCTACAAGG TCGGCCCTTT CGAATCCGGC
CGCTTCATCG CTTTCGAGCG CGTCAAGGAC TGGTGGGGCG CGGCGCTGCC GGTCAATGTC
GGGGCGTTTA ATTTCGACAC CGTCCGGTTC GAGTTCTATC GCGATCGCGA CGTCGCCTTC
GAGGGCTTCA CCGGCCGCAA CTATCTGTAT CGCGAGGAAT TCACCTCGCG GATCTGGAAC
ACGCGCTACG ACTTTCCGGC GATCCATGAG GGCCGGGTCA AGCGCGAGAC GCTGCCCGAC
GAGACGCCGT CCGGCGCGCA GGGCTGGTTC ATCAACACGC GGCGCGACAA GTTCAAGGAT
CCGCGGGTGC GCGAGGCGCT GGGCTGCGCG TTCGACTTCG AATGGACCAA CAAGACCATC
ATGTACGGCG CCTATGCGCG CACGGTGTCG CCGTTCCAGA ATTCCGACAT GATGGCGGTC
GGGCCGCCGT CGGCCGACGA ACTGGCGCTG CTCGAGCCGT TCCGCGGCAA GGTGCCCGAC
GAGGTGTTCG GCGCGCCGTT CGTGCCGCCG GCCTCCGACG GCTCGGGGCA GGACCGCGCG
CTGCTGCGCC GCGGCGGTCA GTTGCTGACA GAGGCCGGAT TCGTCGTCAA GGACCGCAAG
CGCCTGATGC CGAACGGCGA GCCGATGCGC GTCGAGTTTC TGCTCGACGA GCCGGCGTTC
CAGGCCCACC ACATGCCGTT CGTCAAGAAC CTCGCCACGC TCGGCATCGA GGCGACGGTG
CGGCTGGTCG ACCCGGTTCA GTCGCGGGCG CGCCGCGACG ATTTCGACTT CGACATGGCG
ATCGAGCGTT TCAGCTTTTC GACCGTGCCG GGCGAGGCAC TGCGCAATTT CTTCTCGTCG
CAATCGGCCG CCATCAAGGG CTCGAACAAT CTCGCCGGCA TCGCCGATCC GGCGATCGAC
GCGATGATCG ATCGGGTGAT CGCCGCCGAC AGCCGTGCCG ACCTCGTCGT CGCCGCGCGC
GCGCTCGACC GGCTCGTGCG CGCCGGCCGC TATTGGGTGC CGCAGTGGTT TTCGTCGTCG
CATCGGCTGG CCTATTGGGA CGTGTTCGGC CATCCGCCGA ACCTGCCGAA ATACACCGGC
GTCAGCGCCC CGGACCTGTG GTGGGCGAAA AGCAATCCCG CCGCCGAGCG AAGCGACCCG
AAGGGCGAGG GGAAGTAG
 
Protein sequence
MAQLNRREVL VLGVGALAAA RISPASAADG ETTAHGMSAF GDLKYKADFP HFDYVDPQAP 
KGGLFSTIPS SRAFNQSFQT FNSLNAYILK GDGAQGMGLT FASLMARAGD EPDAMYGLAA
ARVAISADGL SYRFTMRPEA RFHDGSKLTA RDAAFSLNIL KAKGHPLITQ QMRDFIEAKA
VDDSTLLVTF KPKRGRDVPL FVAALPLFSE AYYAKRPFDE STMEIPLGSG PYKVGPFESG
RFIAFERVKD WWGAALPVNV GAFNFDTVRF EFYRDRDVAF EGFTGRNYLY REEFTSRIWN
TRYDFPAIHE GRVKRETLPD ETPSGAQGWF INTRRDKFKD PRVREALGCA FDFEWTNKTI
MYGAYARTVS PFQNSDMMAV GPPSADELAL LEPFRGKVPD EVFGAPFVPP ASDGSGQDRA
LLRRGGQLLT EAGFVVKDRK RLMPNGEPMR VEFLLDEPAF QAHHMPFVKN LATLGIEATV
RLVDPVQSRA RRDDFDFDMA IERFSFSTVP GEALRNFFSS QSAAIKGSNN LAGIADPAID
AMIDRVIAAD SRADLVVAAR ALDRLVRAGR YWVPQWFSSS HRLAYWDVFG HPPNLPKYTG
VSAPDLWWAK SNPAAERSDP KGEGK