Gene RPB_0234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0234 
Symbol 
ID3907861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp267473 
End bp268528 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content70% 
IMG OID637882116 
Productoligopeptide/dipeptide ABC transporter, ATP-binding protein-like 
Protein accessionYP_483856 
Protein GI86747360 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCATGA TCGAGGCGTC CGCATCGACC GGCACCGGCA CGCACGGCGC CGTGCGTTCC 
GGCATCGCGC CGCAACCTTT GCTCGAGGTG CGCAATCTGT CGCTGTCGTT CGCGACCGCC
GCCGGCGCGT TGCCGGTCAC CCGCAACGTG AGTTTCACGG TCGCGGCCGG CGAGCGTGTC
GGCCTGGTCG GCGAGAGCGG CTGCGGCAAG ACCGTCACCG GGCTCTCCTT GCTTCGATTG
CTGCCGGCGC ATTCCGCACG GATCGAAGGC GACGTGCTGT TCGACGGCAC CGATCTCTTG
AAGCTGTCGC CGCGGCGGAT GCGCGCGGTG CGCGGCCGTG ACATCGCGAT GATCTTTCAG
GAGCCGATGA GCGCGCTCGA TCCGGTGTTC ACCGTCGGCG ATCAGATCAG CGAGGCCTAT
CGCATCCATT TTCCCGCCGG CAAGGCGGAG GGGCGCGAAC GCGCCATCGC GGCGCTGCGC
GAGGTCGGGA TTCCGGCGCC GGAGCGGCGC TGCGACGAGT ATCCGCATCA GCTTTCCGGC
GGCATGCGCC AGCGCGTGAT GATCGCGATG GCGTTGATCT GCAAGCCGAA GCTGCTGATC
GCCGACGAGC CGACCACCGC GCTCGACGTC ACCGTGCAGG CGCAGATCAC CGATCTGTTG
CGCTCGCTGA GCGAGCGCAC CGGCACGGCG CTGATCTTCA TCACCCACGA TCTCGGCGTC
GTCGCCGAGA CCTGCACCCG GATGATCACG ATGTATGCGG GCGAAGTCGT CGAGGACGCG
CCGGTCGACG ATGTGCTGAC CCGGCCGCGC CATCCCTACA CGTCCGGACT GCTGCGATCT
CTGCCGGGGC TGAGCAAGCG CCGCGGCGTG CTGGCCTCGA TCCCCGGCCG CGTGCCGTCG
CCACAAGCGA TGCCCGCCGG CTGCCGCTTC CGCTCGCGCT GCGCCCATGC GGCGCCGGGC
TGCGAGCAGG ACCAGCGCAT GATCGCGATC GAGGCTGGCC ACGCCGCGCG ATGCCGGCGC
GCGCCGGATC TCGCTTTGCC TGGGGCGGTG AACTGA
 
Protein sequence
MGMIEASAST GTGTHGAVRS GIAPQPLLEV RNLSLSFATA AGALPVTRNV SFTVAAGERV 
GLVGESGCGK TVTGLSLLRL LPAHSARIEG DVLFDGTDLL KLSPRRMRAV RGRDIAMIFQ
EPMSALDPVF TVGDQISEAY RIHFPAGKAE GRERAIAALR EVGIPAPERR CDEYPHQLSG
GMRQRVMIAM ALICKPKLLI ADEPTTALDV TVQAQITDLL RSLSERTGTA LIFITHDLGV
VAETCTRMIT MYAGEVVEDA PVDDVLTRPR HPYTSGLLRS LPGLSKRRGV LASIPGRVPS
PQAMPAGCRF RSRCAHAAPG CEQDQRMIAI EAGHAARCRR APDLALPGAV N