Gene RPB_0225 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0225 
Symbol 
ID3909467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp253123 
End bp254784 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content65% 
IMG OID637882107 
Productextracellular solute-binding protein 
Protein accessionYP_483847 
Protein GI86747351 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCATC TTCATTCCTT CAGCCGCACT GCGGTGCTGC TCGCTCTGAC GGCCGTCGGC 
TCTGTCGCCG TGCCGCAGAT CGCTTCATCC GAAACCGTGC TGCGGATCGG CATGACGGCC
GCGGACATTC CGCGCACGCT CGGCCAGCCG GATCAGGGTT TCGAAGGCAA CCGCTTCACC
GGCCTGACGA TGTATGACGG GCTGACGATG TGGGACCTGT CGTCCGCCAC CAAGGCGAGC
GTGGTGATCC CGGGGCTCGC GACCGAATGG AAGGTCGAGG ACGCCGACAA GACCAAATGG
ATCTTCAAGC TGCGCCCCGG CGTCACCTTC CACGACGGCA CGCCTTTCAA TGCCGACGCC
GTGGTCTGGA ATGTCGACAA GGTGCTGAAC AAGGAGGCTG TTCAGTTCGA CGCCAGCCAG
GTCGGCGTCA CCGCGTCGCG GATGCCGACG CTGGTCTCGG CCAAGAAGAT CGACGACATG
ACGGTGGAGC TCACCGCCAA GGAGCCCGAC AGCTTCCTGC CGATCAACCT CACCAATCTG
TTCATCGTCA GCCCGTCGAA ATGGCAGGCG CTGTACGAGA AGGCCGAGGG CGCCGACGCC
AAGGCGCGGT CGCAGGCCGC CTGGGCCGCG TTCGCCAAGG ACGCCGCCGG CACCGGGCCG
TGGAAGATGG CGAAGTTCAC GCCGCGCGAG CGGCTCGAAC TGGTGAAGAA CGACAATTAC
TGGGACAAGG CGCGGGTGCC GAAGACCGAC CGCATGGTGC TGCTGCCGAT GCCCGAAGCC
AACGCGCGCA CCGCGGCGCT GCTGTCCGGC CAGGTCGACT GGATCGAGGC GCCTGCGCCC
GACGCGGTGA AGGAGATCAC CGCGCGCGGC TTCAAGATCG AGAAGAACGA ACAGCCGCAT
GTCTGGCCGT GGCAGTTCTC GCGGATCGAG GGCTCGCCCT GGAACGACAT CCGCGTCCGC
CGCGCCGCCA ATCTGTGCAT CGACCGCGAA GGCCTGCGTG ACGGCCTGCT CGCCGGCCTG
ATGGTGCCGG CGACCGGCAC CTTCGAGCCC GGCCATCCGT GGCGCGGCAA TCCGTCGTTC
CAGATCAAGT ACGATCTGCC GGCGGCGCAG AAGCTGATGA AGGAGGCCGG CTTCGGCCCC
GACAAGAAGC TCAGCGTCAA GGTGCAGACC TCGGCGTCCG GCTCCGGCCA GATGCTGCCG
CTGCCGATGA ACGAATATCT GCAGCAGGCT CTCGCCGAGT GTTACTTCGA CGTCAAGCTC
GATGTCATCG AATGGAACAC GCTGTTCACC AATTGGCGGC GTGGCGTCAA GGATCCCTCG
GCCAACGGCA GCAACGCCAC CAATGTCACC TACGCGGCGA TGGATCCGTT CTTCGCGCTG
GTCCGCTTCC TGCAATCGTC GATGGCGCCG CCGGTGTCGA ACAATTGGGG CTACATCAAC
AACCCCAAGT TCGACGAGTT GGTGAAGAAG GCACGGACCA GCTTCGACGC CGCCGAGCGC
GACAAGGCGC TGGCCGAGCT GAACGCCGCC TCGATCGACG ACGCGGCCTT CCTCTACGTC
GCCCACGACG TCGGCCCGCG CGCGATGAGT CCGAAGGTCA AGGGCTTCGT GCAGCCCAAG
AGCTGGTTCG TCGACTTCTC GCCGGTGTCG ATGGCGCCGT AA
 
Protein sequence
MRHLHSFSRT AVLLALTAVG SVAVPQIASS ETVLRIGMTA ADIPRTLGQP DQGFEGNRFT 
GLTMYDGLTM WDLSSATKAS VVIPGLATEW KVEDADKTKW IFKLRPGVTF HDGTPFNADA
VVWNVDKVLN KEAVQFDASQ VGVTASRMPT LVSAKKIDDM TVELTAKEPD SFLPINLTNL
FIVSPSKWQA LYEKAEGADA KARSQAAWAA FAKDAAGTGP WKMAKFTPRE RLELVKNDNY
WDKARVPKTD RMVLLPMPEA NARTAALLSG QVDWIEAPAP DAVKEITARG FKIEKNEQPH
VWPWQFSRIE GSPWNDIRVR RAANLCIDRE GLRDGLLAGL MVPATGTFEP GHPWRGNPSF
QIKYDLPAAQ KLMKEAGFGP DKKLSVKVQT SASGSGQMLP LPMNEYLQQA LAECYFDVKL
DVIEWNTLFT NWRRGVKDPS ANGSNATNVT YAAMDPFFAL VRFLQSSMAP PVSNNWGYIN
NPKFDELVKK ARTSFDAAER DKALAELNAA SIDDAAFLYV AHDVGPRAMS PKVKGFVQPK
SWFVDFSPVS MAP