Gene RPB_4051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4051 
Symbol 
ID3911858 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4621428 
End bp4622348 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content67% 
IMG OID637885955 
Productbinding-protein dependent transport system inner membrane protein 
Protein accessionYP_487655 
Protein GI86751159 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG1173] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.722195 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGCG TCGCGCCCCC GATCACCGAG CCCGCCAAGC CCTCAACGGC CGCAAAGCCC 
GTGCGGCGCT CCGGCCTGGT CGAAATGATC GCGCACACCC GCTACGTCCT CGGCGACAAC
CGCGTCACCG CCTTCGCCTT CGGGCTGCTG GTGGTGATCG TTTTCGCCGC ATTGTTCGGC
CCGTATATCG TTCCCTACGA CCCGCTCGCC AGCAATACCG CGCAGGCGCT GAAGCCGCCG
TCCGCCGCCA ACTGGTTCGG CACCGACCAG CTCGGCCGCG ATATCTTCAG CCGCGTCGTC
GTCGCCACCC GGCTCGATCT GTTCATCGCC GTCGCCTCGG TGGTGCTGGT GTTCCTGATG
GGCGGCCTCG CCGGCATCGC AGCCGGTTAT TTCGGCGGAT GGACCGACCG CATCGTCGGC
CGCATCGCCG ACACCATCAT GGCGTTTCCG CTGTTCGTGC TGGCGATGGG CATCGTCGCG
GCGCTCGGCA ACACCGTGCA GAACATCATC ATCGCCACCG CGATCGTCAA CTTCCCGCTT
TACGCCCGGG TCGCCCGCGC CGAGGCCAAT GTCCGGCGCG AGGCCGGCTT CGTCATGGCA
GCAAGGCTTT CGGGCAACAG CGAGATGCGC ATCCTGCTGG TGCACATCCT GCCGAACATC
ATGCCGATCA TGATCGTGCA GATGTCGCTG ACGATGGGCT ACGCCATCCT CAACGCCGCC
GGGCTGTCGT TCATCGGCCT CGGCGTCCGC CCGCCCACCG CCGAATGGGG CATCATGGTC
GCCGAGGGCG CCTCGTTCAT GGTCTCGGGC GAGTGGTGGA TCGCGCTGTT CCCCGGCCTC
GCGCTGATGA CCGCCGTGTT CTGCTTCAAC CTGCTCGGCG ACGGCCTGCG CGACATCTTC
GACCCGCAGC GGAGGACGTG A
 
Protein sequence
MSSVAPPITE PAKPSTAAKP VRRSGLVEMI AHTRYVLGDN RVTAFAFGLL VVIVFAALFG 
PYIVPYDPLA SNTAQALKPP SAANWFGTDQ LGRDIFSRVV VATRLDLFIA VASVVLVFLM
GGLAGIAAGY FGGWTDRIVG RIADTIMAFP LFVLAMGIVA ALGNTVQNII IATAIVNFPL
YARVARAEAN VRREAGFVMA ARLSGNSEMR ILLVHILPNI MPIMIVQMSL TMGYAILNAA
GLSFIGLGVR PPTAEWGIMV AEGASFMVSG EWWIALFPGL ALMTAVFCFN LLGDGLRDIF
DPQRRT