Gene RPC_1218 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_1218 
Symbol 
ID3969095 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp1332450 
End bp1334117 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content65% 
IMG OID637924329 
Productextracellular solute-binding protein 
Protein accessionYP_531100 
Protein GI90422730 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0270957 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTATGA TTGGTTTGGT GGCTCGGCGC ACAGCCGCAG CCGTGTTCGT GCTGACGACC 
GCGAGTGCGA TCGCGCTGCC GCAATCTGCC GGCGCGGAAG CCGTGCTGCG CATCGGCATG
ACAGCCGCAG ACGTGCCGCG CACGCTCGGC CAACCCGATC AGGGCTTCGA AGGCAATCGC
TTCACTGGCC TCACCATGTA CGACGCGCTG ACGATGTGGG ACCTGTCGTC TGCAACCAAA
GCCAGCGTGG TGATCCCGGG GCTTGCCACC GAATGGGCGG TGAATGAGAG CGATAAGACC
AAATGGACCT TCAAGCTGCG TCCCGGCGTC AGCTTTCATG ACGGCTCGCC GTTCAACGCC
GATGCGGTGG TCTGGAATGT GGAGAAGGTG CTGAAGCAGG ACGCGCCGCA ATTCGACGCC
AGCCAGGTCG GCGTCACCGC ATCGCGGATG CCGACATTGG TCTCGGCGAA GAAGATCGAC
GACATGACGG TGGAACTGAC CACCAAGGAG CCGGACAGCT TCCTGCCGAT CAACCTCACC
AACCTGTTCA TGGTCAGCCC GAGCAAGTGG CAGGCGTTGT ATGAGAAGGC CGAAGGCGCC
GACGCCAAGG CGAAGTCGCA GGCCGCCTGG GCGTCGTTCG CCAAGGACGC CTCCGGCACC
GGGCCGTGGA AGATGTCGAA GTTCACGCCG CGCGAACGGC TCGAACTGGT GAAGAACGAC
AAATATTGGG ACGCCACACG CGTGCCGCAT GTCGACCGCC TAGTGCTGTT GCCGATGCCG
GAAGCCAACG CCCGCACCGC GGCGCTGTTG TCCGGCCAGG TCGACTGGAT CGAGGCGCCC
GCCCCCGACG CGGTCAAGGA AATCACCGCG CGCGGTTTCA AGATCGAGAA GAACGAGCAG
CCGCACGTCT GGCCCTGGCA GTTCTCCCGC GTCGAAGGCT CGCCGTGGAA CGACATCCGG
GTGCGCCGCG CCGCCAATCT GTGCATCGAT CGCGAAGGCT TGCGCGACGG CCTGCTCGCA
GGATTGATGG TGCCGGCGAC CGGCACCTTC GAGCCCGGCC ATCCGTGGCG CGGCAAGCCG
GCATTCCAGA TCAAATACGA TCTGCCGGCG GCACAGAAGC TGATGAAGGA AGCCGGCTAC
GGCCCGACCA AGAAGCTCAG CGTCAAGGTG CAGACCTCGG CGTCGGGCTC CGGCCAGATG
CTGCCGCTGC CGATGAACGA ATATCTGCAG CAGGCGCTCG CGGAGTGCTA CTTCGACGTC
AAGCTCGACG TCATCGAGTG GAACACGCTG TTCACCAATT GGCGCCGCGG CACTAAGGAT
CCCTCCGCCA ACGGCGCCAA CGCCACCAAC GTCACCTATG CGGCGATGGA CCCGTTCTTT
GCGATGGTGC GCTTCCTGCA GTCGTCGATG GCGCCGCCGG TGTCGAACAA TTGGGGCTTC
ATCAACAACC CGAAGTTCGA CGCGCTGGTG ACCAAGGCGC GCACCACCTT CGATGCCTCG
CTGCGCGACG AAGCCTTGGC CGAACTGCAC GCCGCCTCGG TCGACGACGC CGCCTTCCTC
TACGTCGCCC ACGACGTCGG CCCGCGCGCG CTGAGCCCGA AGGTCAAGGG CTTCGTGCAG
CCGAAGAGCT GGTTCGTCGA CTTCTCGCCG GTGACGCTGG CGCCGTAA
 
Protein sequence
MRMIGLVARR TAAAVFVLTT ASAIALPQSA GAEAVLRIGM TAADVPRTLG QPDQGFEGNR 
FTGLTMYDAL TMWDLSSATK ASVVIPGLAT EWAVNESDKT KWTFKLRPGV SFHDGSPFNA
DAVVWNVEKV LKQDAPQFDA SQVGVTASRM PTLVSAKKID DMTVELTTKE PDSFLPINLT
NLFMVSPSKW QALYEKAEGA DAKAKSQAAW ASFAKDASGT GPWKMSKFTP RERLELVKND
KYWDATRVPH VDRLVLLPMP EANARTAALL SGQVDWIEAP APDAVKEITA RGFKIEKNEQ
PHVWPWQFSR VEGSPWNDIR VRRAANLCID REGLRDGLLA GLMVPATGTF EPGHPWRGKP
AFQIKYDLPA AQKLMKEAGY GPTKKLSVKV QTSASGSGQM LPLPMNEYLQ QALAECYFDV
KLDVIEWNTL FTNWRRGTKD PSANGANATN VTYAAMDPFF AMVRFLQSSM APPVSNNWGF
INNPKFDALV TKARTTFDAS LRDEALAELH AASVDDAAFL YVAHDVGPRA LSPKVKGFVQ
PKSWFVDFSP VTLAP