Gene RPB_2847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2847 
Symbol 
ID3910640 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3240518 
End bp3242389 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content65% 
IMG OID637884747 
Productextracellular solute-binding protein 
Protein accessionYP_486460 
Protein GI86749964 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACCGAA CCGAATCAGA TCGCCGATTC GCGCCCCGCG CGGGCCTGCT CGCAGGCGCG 
CTGGCGATCG CCACCGCCGT CGGCCTTGCC ATCCCGGTCA ACCACGCGGC GATGGCCGGA
ACGGAGCTCG CGGCGAAGCC GGCCTATGCG TTGGCGATGC ATGGCGATCC CGCCCTGCCC
GCCGAGTTCA AGGCAATGCC CTATGCCGAC CCCGACGCGC CCAAAGGCGG CCGGCTGGTG
GAGGGCCTGC TCGGCACTTT CGACAGCCTC AATCCGTTCA TCGTCAAGGG CATCGCCGTG
CAGCGGATGC GCGGCTATGT GGTCGAAAGC CTGATGGCGC GCGGCAATGA CGAGCCGTTC
ACCCTGTACG GCCTGCTGGC GCAATCGATC GAGACCAACG ACGCCCGCAG CTACGTCACC
TTCCGCATCG ATCCGCGAGC CCGGTTCTCC GACGGCAAGC CGGTGCAGGC CGAAGACGTG
CTGTTCTCCT GGCAATTGCT GCGCGACAAG GGGCGCCCCA ATCATCGGCA GTACTACGCC
AAGGTGGCGC GCGCGAAGGC CCTTGATCCC CTCACGATCC GCTTCGATTT CGACGATGTG
CAGGATCGCG AACTGCCGCT GATCCTCGGC TTGATGCCGG TCTTTCCGAA GCACGCCGTC
AATGCCGACA CGTTCGAGGA GACATCGCTG TCGCCGCCGA TCGGATCAGG TCCCTATCGC
GTCACCGAGG TGAAGGCCGG CGCCAGCGTG ACGCTGACGC GCAACCCGGA CTATTGGGGC
CGCGATCTCC CGGTCAATCG CGGCTTGTGG AACTTCGACG AAATCCGGAT CGATTATTTC
CGGGAGGCAA ATTCGCATTT CGAAGCCTTC AAGCGCGGGC TTTACGACTA TCGCGTCGAG
ACCGAGCCGC TACGCTGGCA CGATGGCTAC GACTTTCCGG CCGCGAAGAG CGGCAATGTG
ATCCGCGACG CCTTCAGGAC CGGGATGCCG CAGCCGAGCG AGGTGCTGGT CTTCAACACG
CGGCGGCCGG TGTTCGCCGA TATCCGCGTC CGCGAGGCGC TGCTGCAACT GTTCGATTTC
GCCTGGATCA ACCGCAACTA CTTCTTCGAT TTGTACGCCC GCGCCGGCGG CTTCTTCGCC
GGCTCGGAGC TTTCCGCCTA TGGCCGACCG GCAGGCCCCG GCGAATTGAA GCTGCTGGCG
CCGTATCTGT CGCGGCTGCG GCCCGAGTTC CTCGATGGCA GCTACCGCCT GCCTGTCAGC
GACGCGTCGG GCCGCGACCG CACCACGCTG CGACGCGCCC TGTCGCAACT GGCGGAGGCG
GGGTACGAAC TCGACGGCAC GGTGCTGCGC CGACGCGACA ACCACCAGCC GCTGACATTC
GAAATTCTGG TGACCACGCG GGATCAGGAG CGCATCGCAC TGGCGTTCGC GCGCGACGTC
AAACGGGTCG GTGTTCAGGC CTCGGTCCGC GTGGTCGACG CGGTGCAGTT CGACCAGCGC
CGGATCGCGT TCGATTTCGA TATGATCCCG AACCGCTGGG ACCAGTCGCT GTCTCCGGGC
AACGAACAGT ATTTTTATTG GGGCGCGGAG GCCGCCGACA CCCAGGGCAC CCGCAACTAC
ATGGGCGTCA AGGATCCCGC GGTCGACGCC ATGATCGCGG CGATGATCGG CGCGCGGGAG
CATCCGCAAT TCGTCGATTC GGTGCGCGCG CTCGACCGGG TTCTGACCTC GGGTGTCTAC
GTGATCCCGC TCTACAACAT TCAGGAACAA TGGATCGCAA GATGGAATCG GATAGAACGG
CCGAAAGCAA ATGCCCTGAC CGGCTACCTG CCCGAAACCT GGTGGGCGAG GCCATCGACG
CAGCAAAGGT GA
 
Protein sequence
MDRTESDRRF APRAGLLAGA LAIATAVGLA IPVNHAAMAG TELAAKPAYA LAMHGDPALP 
AEFKAMPYAD PDAPKGGRLV EGLLGTFDSL NPFIVKGIAV QRMRGYVVES LMARGNDEPF
TLYGLLAQSI ETNDARSYVT FRIDPRARFS DGKPVQAEDV LFSWQLLRDK GRPNHRQYYA
KVARAKALDP LTIRFDFDDV QDRELPLILG LMPVFPKHAV NADTFEETSL SPPIGSGPYR
VTEVKAGASV TLTRNPDYWG RDLPVNRGLW NFDEIRIDYF REANSHFEAF KRGLYDYRVE
TEPLRWHDGY DFPAAKSGNV IRDAFRTGMP QPSEVLVFNT RRPVFADIRV REALLQLFDF
AWINRNYFFD LYARAGGFFA GSELSAYGRP AGPGELKLLA PYLSRLRPEF LDGSYRLPVS
DASGRDRTTL RRALSQLAEA GYELDGTVLR RRDNHQPLTF EILVTTRDQE RIALAFARDV
KRVGVQASVR VVDAVQFDQR RIAFDFDMIP NRWDQSLSPG NEQYFYWGAE AADTQGTRNY
MGVKDPAVDA MIAAMIGARE HPQFVDSVRA LDRVLTSGVY VIPLYNIQEQ WIARWNRIER
PKANALTGYL PETWWARPST QQR