Gene RPB_3079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3079 
Symbol 
ID3910880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3510369 
End bp3511373 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content64% 
IMG OID637884984 
Productoligopeptide/dipeptide ABC transporter, ATP-binding protein-like 
Protein accessionYP_486689 
Protein GI86750193 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.41088 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGCGC TGCTCGAAAT TCGCGATCTC GAAGTTGATC TGTTCACGCG GCGCGGCGTG 
ATGCGCGCGA TCGACCGCTT TTCCCTCACC GTCGAACACG GGCAGACGCT TGGCCTGGTC
GGTGAGTCCG GCTGCGGCAA ATCCATGACT TCGCTGGCGA TCATGGGGCT GCTACCGATG
CCGCCGGCAA AGGTGACGGG CGGCGCCATT CTGCTCGAAG GCCGCGATCT CATCGGCTTG
AGCGACACCA GGATGCAAGC GGTGCGGGGA CGCGAGATCG GGATCATCTT CCAGGACCCG
ATGAGCTCGC TGAACCCGGT CTACACCGTC GGCTATCAGC TCACCGAGGT GCTCCGCCGG
CATTTCAAGC TGGATCGACG CGCCGCGGAC AGGCGGGCGT TGGAATTGCT GGACCGGGTG
CACATTCCGG ATGCCCGGCG TCGCTTCGAT GCCTATCCGC ACGAATTGTC CGGCGGCATG
AATCAGCGCG TCGTCATCGC CATGGCGATC GCGTGCGAGC CGAAGTTGCT GATCGCCGAC
GAACCGACCA CGGCGCTCGA CGTCACCATC CAGGCACAGA TCATCGAACT GCTCAAGGAC
ATCCAGCGCG AATCCCGGAT GGGGATGATC TTCATCACCC ACGATCTCGG GGTGATCGCC
GATGTCGCCG ACCGCGTCAC CGTGATGTAC GCGGGCAAGA AGGCCGAGGA GGCACCGGTT
GGCGTACTCT TCGACGATCC ACGCCATCCC TATACGCGGG GTCTGATCGG CGCCACGCCG
AAGCCGGGCG AGGAACGGCG GCGGCGGCTC GTCGAAATTC CCGGGACCGT GCCGGGCTTA
TCGGACCGTC CGAAAGGCTG CGCTTTCGCC AACCGCTGTC CGCTGGTGTT CGAGCGTTGC
CGCGTCGAAC ATCCGGCTTT GATCAATCAG GGTGCGGGCC ACGAGGCTGC CTGCTTCCTG
GCTACCGAAA CGGAGGACTA CAATGTCGCT TCTGTCGGTG CGTAG
 
Protein sequence
MAALLEIRDL EVDLFTRRGV MRAIDRFSLT VEHGQTLGLV GESGCGKSMT SLAIMGLLPM 
PPAKVTGGAI LLEGRDLIGL SDTRMQAVRG REIGIIFQDP MSSLNPVYTV GYQLTEVLRR
HFKLDRRAAD RRALELLDRV HIPDARRRFD AYPHELSGGM NQRVVIAMAI ACEPKLLIAD
EPTTALDVTI QAQIIELLKD IQRESRMGMI FITHDLGVIA DVADRVTVMY AGKKAEEAPV
GVLFDDPRHP YTRGLIGATP KPGEERRRRL VEIPGTVPGL SDRPKGCAFA NRCPLVFERC
RVEHPALINQ GAGHEAACFL ATETEDYNVA SVGA