Gene RPC_3726 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3726 
Symbol 
ID3971471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4145792 
End bp4146970 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content63% 
IMG OID637926836 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_533580 
Protein GI90425210 
COG category[R] General function prediction only 
COG ID[COG4239] ABC-type uncharacterized transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCTCA TCGCGCGCCA GCCGATCGAA TCGACCACCA CCGCGCCGCT CGGCGAGGCG 
GTGCCGCCGG CGCGCCGGCT GTTGGCGCCG TCGCCGCTCA ACCGCCGGCG TTGGCAGAAC
TTCAAATCCA ACCGCCGCGG CTATTGGTCG TTCTGGATCT TTCTGTTGTT GTTCTTCGTT
TCGCTGTTCG CCGAACTGAT CGCCAACGAT CGGCCGTTCC TGATCAAGTT CGACGGCAAA
TTGTATTTCC CGGCCTTTGT CAGCTATTCG GAGACGACGT TCGGCGGCGA TTTCGAGACC
GCGGCGGATT ATCGCGACCC GTTCCTGCAG AAGCTGATCG CGGAGAAAGG CGGCACCACG
ATCTGGCCGC TGATTCGCTA TTCCTACGAC ACCCACAATC TCGACCTGCC GACGCCGGCG
CCGTCGAAGC CGACCTGGAT GCTGACCGAG GCCGAATGCA AACCGGTGGT GCAGAAGAAA
GGCCTCAATA GCTGCCGCGA CCTCGAATAC AACTGGCTCG GCACCGACGA CCAGGGCCGC
GACGTGGTGG CGCGGCTGAT CTACGGCTTC CGCATCTCGG TGCTGTTCGG CCTCAGCCTG
ACCATCATCT CCTCGGTGAT CGGCGTCGCC GCCGGCGGCA TCCAGGGCTA TTTCGGCGGC
TGGGTCGATC TCGGTTTCCA GCGTTTCATC GAGGTGTGGA GCGCTATTCC GTCGCTATAT
CTGTTGCTGA TCCTGTCCTC GGTGCTGGTG CCGGGCTTCT TCGTGCTGCT CGGCATTCTC
TTGTTGTTCT CCTGGGTGTC GCTGGTCGGC CTGGTGCGCG CCGAGTTTCT GCGCGGGCGC
AATTTCGAAT ACATCATGGC GGCGCGCGCG CTCGGCGTCT CCAACGCCAA GATCATGATC
CGGCATCTTT TGCCGAACGC CATGGTCGCC ACCATGACGT TCCTGCCGTT CATCGTGTCG
TCCTCGGTGA TGACGCTGAC CGCGCTGGAT TTCCTCGGCT TCGGACTGCC GCCGGGATCG
CCCTCGCTCG GCGAGTTGCT GTCGCAAGGC AAGGCCAACG CCCAGGCGCC GTGGCTCGGC
TTCACCGGCT TCTTCGCGGT GGCGATCATG CTGTCGCTAC TGATCTTCAT CGGCGAGGGC
GTCCGCGACG CCTTCGACCC GCGCAAGACG TTCAGGTGA
 
Protein sequence
MTLIARQPIE STTTAPLGEA VPPARRLLAP SPLNRRRWQN FKSNRRGYWS FWIFLLLFFV 
SLFAELIAND RPFLIKFDGK LYFPAFVSYS ETTFGGDFET AADYRDPFLQ KLIAEKGGTT
IWPLIRYSYD THNLDLPTPA PSKPTWMLTE AECKPVVQKK GLNSCRDLEY NWLGTDDQGR
DVVARLIYGF RISVLFGLSL TIISSVIGVA AGGIQGYFGG WVDLGFQRFI EVWSAIPSLY
LLLILSSVLV PGFFVLLGIL LLFSWVSLVG LVRAEFLRGR NFEYIMAARA LGVSNAKIMI
RHLLPNAMVA TMTFLPFIVS SSVMTLTALD FLGFGLPPGS PSLGELLSQG KANAQAPWLG
FTGFFAVAIM LSLLIFIGEG VRDAFDPRKT FR