Gene RPB_4050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4050 
Symbol 
ID3911857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4620409 
End bp4621422 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content66% 
IMG OID637885954 
Productbinding-protein dependent transport system inner membrane protein 
Protein accessionYP_487654 
Protein GI86751158 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0601] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.208624 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCACCA TGATCGGCAA GCGTCTGATG TTCGCGATTC CGAGCCTGAT CGGCGTCGTG 
ATCGTCACCT TTCTGCTGAC CCGGGCGCTG CCCGGCGATC CGGCCGCTTA CTTCGCCGGG
CCCGCTGCCA GCAAGGAGGC GATCGAGCAG ATCCGGCAGA AGCTCGGCCT CGACAAGACG
CTGGCCGAGC AATTCGTGCG CTACACCACC GAACTCGCCC AGGGCGATCT CGGCCAGTCG
CTGACCACCG GACAGCCGGT CGCCACCGAG ATCCGCAACC GGCTGCCGGC CTCCGCCGAG
CTCACCTTGC TCGGCCTGAT GCTGGCGATC GTGATCGCGA TCCCGCTCGG CATCATGGCG
GCGACGCGAC CGGGCTCGTG GATCGATCAC ATCTGCCGCG TCACCACCAC GGCCGGCGTC
TCGCTGCCGG TGTTCTTCAC CGGGCTGCTG CTGGTCTATG TGTTTTACTT CAAGCTCGGC
TGGTCGCCGG CGCCGCTCGG CCGGCTCGAC GTGTTCTACT CGGCGCCGCC GAACGTCACC
GGCTTCTATC TGATCGACAG CCTGATCGCG CGCGAATTCG AGACCTTCCG ATCGGCGTTG
AGCCAATTGC TGCTTCCGGC ATTGACGCTG GCGATCTTCT CGCTGGCGCC GATCGCGCGC
ATGACGCGGG CCTCGATGCT GGCGATCCTG TCGTCCGATT TCGTCCGCAC CGCGCGCGCC
TCCGGCCTGT CGCCCGGCAA GGTGACGATG ACCTACGCCT TCCGCAACGC GATGCTGCCG
GTGATCACCA CGCTCAGCAT GGTGTTCTCG TTCCTGCTCG GCGCCAATGT GCTGGTCGAG
AAGGTGTTCG CCTGGCCGGG CATCGGCTCC TACGCGGTCG AGGCGCTGAT CTCGTCGGAC
TTCGCCCCGG TGCAGGGCTT CGTGCTCACC ATGGCGATCA TGTATGTGCT GCTGAACCTG
GTGATCGACA TCCTCTACGG CGTCATCGAT CCGCGCGTGC GGCTGGAAGG ATAG
 
Protein sequence
MLTMIGKRLM FAIPSLIGVV IVTFLLTRAL PGDPAAYFAG PAASKEAIEQ IRQKLGLDKT 
LAEQFVRYTT ELAQGDLGQS LTTGQPVATE IRNRLPASAE LTLLGLMLAI VIAIPLGIMA
ATRPGSWIDH ICRVTTTAGV SLPVFFTGLL LVYVFYFKLG WSPAPLGRLD VFYSAPPNVT
GFYLIDSLIA REFETFRSAL SQLLLPALTL AIFSLAPIAR MTRASMLAIL SSDFVRTARA
SGLSPGKVTM TYAFRNAMLP VITTLSMVFS FLLGANVLVE KVFAWPGIGS YAVEALISSD
FAPVQGFVLT MAIMYVLLNL VIDILYGVID PRVRLEG