Gene RPD_0186 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0186 
Symbol 
ID4020643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp209323 
End bp210648 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content63% 
IMG OID637960364 
Productextracellular solute-binding protein 
Protein accessionYP_567327 
Protein GI91974668 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATTTC GTTTCTTTGG TCCGACTGCG GCCGCAACCG CGATCGCTGC GACTCTGGCG 
ATCGCCACGC CGGCGCATGC GGCGACCGAG ATCCAGTGGT GGCACGCCAT GACCGGCGGC
AACAACGACG TCGTCGTCAA ACTTGCCAAT GACTTCAACG CAGCGCAGAG CGACTACAAG
GTCGTCCCGA GTTACAAAGG CGGCTACGCC GACACGATGA ACGCCGGCAT CGCCGCGTTC
CGCGCCGGCA ACGCGCCGCA TATCATGCAG GTGTTCGAGG TCGGAACCGC CACCATGATG
GCGGCGACCG GCGCGGTGAA GCCGGTCTAC AAATTGATGC AGGAGACCGG CGAACCGTTC
GACGCCAAGG CCTATTTGCC GGCGATCACC GGTTACTACT CGACCTCGAA GGGCGAGATG
CTGTCGTTCC CCTTCAACTC GTCGTCGATG GTGATGTGGG TCAATCTCGA CGCGCTGAAG
AAGGCCGACA TCGCCGAGAT CCCGAAGACA TGGCCCGAAG TGTTCGAGGG CGCCAAGAAG
CTGAAAGCGG CCGGCTATGC CACCTGCGGC TTCTCCACCG CCTGGGTGAC CTGGGCCCAT
GTCGAGCAGT TGTCCGCCTG GCACAACGTG CCGCTGGCGA GCAAGGCGAA CGGTCTCGAC
GGCTTCGACA CCAAGCTTGA ATTCAACGGA CCGGTGCAGG TCAAGCATCT CGACAAGCTG
ATCGAGTTGC AGAAGGACAA GACCTTCGAC TATTCCGGCC GCACCAACAC CGGCGAGGGA
CGCTTCACAT CGGGCGAGTG CCCGCTGTTC CTGACCTCGT CGGGCTTCTT CGGCAACGTC
AAGTCGCAGG CCAAGTTCAA TTGGACCAGC GCGCCGATGC CGTACTACCC GGACGTGAAG
GGCGCCCCGC AGAATTCGAT CATCGGCGGC GCATCGCTGT GGGTGATGGG CGGCAAGACA
CCCGAGGAAT ACAAGGGCGT CGCCAAATTC CTGGCATTCC TGTCCGACAC CGACCGCCAG
GTCGCGGTCC ACAAGGCCTC GGGTTATCTG CCGATCACCA TGGCGGCCTA TGAGAAGGCC
AAGGCCGAGG GCTTCTACAA GGAAGCGCCC TATCTCGAGA CGCCGATCAA GGAACTGACC
AACAAGCCTC CGACCGAGAA TTCGCGCGGC CTGCGGCTCG GCAACATGGT GCAACTGCGC
GACCTGTGGG CCGAGGAGAT CGAACAGGCC CTAGCCGGCA AGAAGACCGC GAAGGAAGCG
CTCGACGCTG CCGTCACCCG CGGCAACACC ATGCTGCGTC AGTTCGAAAA GACCGCGGTG
AAGTAG
 
Protein sequence
MAFRFFGPTA AATAIAATLA IATPAHAATE IQWWHAMTGG NNDVVVKLAN DFNAAQSDYK 
VVPSYKGGYA DTMNAGIAAF RAGNAPHIMQ VFEVGTATMM AATGAVKPVY KLMQETGEPF
DAKAYLPAIT GYYSTSKGEM LSFPFNSSSM VMWVNLDALK KADIAEIPKT WPEVFEGAKK
LKAAGYATCG FSTAWVTWAH VEQLSAWHNV PLASKANGLD GFDTKLEFNG PVQVKHLDKL
IELQKDKTFD YSGRTNTGEG RFTSGECPLF LTSSGFFGNV KSQAKFNWTS APMPYYPDVK
GAPQNSIIGG ASLWVMGGKT PEEYKGVAKF LAFLSDTDRQ VAVHKASGYL PITMAAYEKA
KAEGFYKEAP YLETPIKELT NKPPTENSRG LRLGNMVQLR DLWAEEIEQA LAGKKTAKEA
LDAAVTRGNT MLRQFEKTAV K