Gene RPD_1815 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1815 
Symbol 
ID4022297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2032173 
End bp2033771 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content62% 
IMG OID637962009 
Productextracellular solute-binding protein 
Protein accessionYP_568952 
Protein GI91976293 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATTT TGAACAAAAC GGGGTTGCAT GCGGCCGCCG CTTGCATCCT CGCCCTCGCC 
ACTCCCGCCT CCGCGCAGAC ATTGCGCTAC GCAAATCAGG GCGACCTCAA TTCGCTCGAT
CCCTACACGC TGAAAGAGAC CACGACGATT GCGCATCACG CGCACATCTA TGAAGGGCTC
ATCACCCGCG ACAAGGATCT CAAGATCGTT CCGGCGCTGG CGGAAAGCTG GGAAACGCTC
GACCCGAAGC ACTGGCGCTT CCATCTGCGC AAGGGTGTGA AATTCCACAA CGGCAATCCG
TTCACCGCGG ACGACGTGAT CTTCTCCGCC GAACGGGTGC GCGCCAAGGG CTCGAACTAT
CTCTCCAACG TCCCGTCCGA CGCCAAATTC GTCAAGATCG ACGACCACAC CGTCGACGTG
TTGCTCGATA CGCCAAATCC CATTCTGGTC TCGCAATGGG ACAACTGGTT CATCATGGAC
AAGGAGTGGT GCACCGAGAA CAATTCGGTA GCGCCGACGC CGGCCGCCGC GACGACGCCG
AGCTATGCAT CGCTGCATGA GAACGGCACC GGTCCGTTCG TGATCGACCT TCATCAACCC
GGCGTGAAGA CCGTGTTCAA GGCCAATCCG AACTGGTGGG GCAAGCCCGA GCATAATCTC
AAGGAAATCG TGTTCACGCC GATCGGCTCG GCCGCGACGC GCGTCGCCGC GCTGCTGTCC
GGCGAGGTCG ACCTTATCGA GCCGGTGCCG ATCCAGGACA TCGAGCGCGT CAATGGCAGC
GGCACGGCGA CGGTGCTGAC CGGGCCGGAA CTGCGCACGA TCTTCCTCGG GATGGATCAG
AACCGCGACG AACTGCTGTA CTCCAACGTC AAGGGCAAGA ACCCGTTCAA GGACATCCGC
GTCCGCGAGG CATTCTACAA GGCGATCGAC GTCGATCTGA TCAAAACGCG GGTGATGCGC
GGGCTGTCGA CGCCCACGGC GCTGATGATC GCGCCGCAGT TGTTCTCGCT GTCGAACGAC
TTCACCCGGC CGAAGTCCGA CGCCGCGGCG GCGAAGAAGC TGTTGGCCGA GGCCGGCTAT
CCCGACGGCT TCGAAGTCAC GATGGATTGC CCCAACGATC GCTACGTCAA CGACGCGGCG
ATATGCCAGG CGGCGGTCAG CATGCTGGCG CGGATCGGCG TCAAGGTCGA ACTGCTGGCG
CAGCCGAAGG CGCAATATTT CGCCAAGGTT CTGAAACCCG GCGGCTTCAA GACCTCGTTC
TTCATGCTGG GCTGGACTCC CGCGTCGCTC GACTCGCACG GCGTGCTGCA CGACGTCATG
GGCTGCGGCA ACGACCCGAC CGACGCGACC CGCGGCGAAG CCAATCTCGG CGGCTATTGC
AACAAGGAGC TCGACGCACT CACCGACAAG GTTCTGATCG AGACCGACCC CGGCAAGCGC
GACCAGTTGA TCAAGCAGGC ATACGAGATC GGCATCAAGG ACTATTCGTA CATCCCGCTG
CACCAGCAGG CGCTGGCCTG GGGCGTGTCG AAAAAGGTGA AGCTGACCCA GCGCGCCGAC
AATCGGGTGC TGCTGCATTG GGCGACCAAG CAAGATTGA
 
Protein sequence
MTILNKTGLH AAAACILALA TPASAQTLRY ANQGDLNSLD PYTLKETTTI AHHAHIYEGL 
ITRDKDLKIV PALAESWETL DPKHWRFHLR KGVKFHNGNP FTADDVIFSA ERVRAKGSNY
LSNVPSDAKF VKIDDHTVDV LLDTPNPILV SQWDNWFIMD KEWCTENNSV APTPAAATTP
SYASLHENGT GPFVIDLHQP GVKTVFKANP NWWGKPEHNL KEIVFTPIGS AATRVAALLS
GEVDLIEPVP IQDIERVNGS GTATVLTGPE LRTIFLGMDQ NRDELLYSNV KGKNPFKDIR
VREAFYKAID VDLIKTRVMR GLSTPTALMI APQLFSLSND FTRPKSDAAA AKKLLAEAGY
PDGFEVTMDC PNDRYVNDAA ICQAAVSMLA RIGVKVELLA QPKAQYFAKV LKPGGFKTSF
FMLGWTPASL DSHGVLHDVM GCGNDPTDAT RGEANLGGYC NKELDALTDK VLIETDPGKR
DQLIKQAYEI GIKDYSYIPL HQQALAWGVS KKVKLTQRAD NRVLLHWATK QD