Gene RPD_4373 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4373 
Symbol 
ID4024898 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4836584 
End bp4837897 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content64% 
IMG OID637964583 
Productextracellular solute-binding protein 
Protein accessionYP_571491 
Protein GI91978832 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.104197 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAGGA CCCCGCTGAT ATTCGCCCTT GTGTTGCTAG TCGTCGCCGG CGCAGCGCCC 
GCGCGGGCCG CGACGGAGAT CGCGTGGTGG CACGCGATGT CCGGAGAACT CGGCCGCCGT
CTCGAAAAGC TCGCAGCCGA TTTCAATGCG TCGCAATCCG ACTACCGTGT GGTGCCGACC
TACAAAGGCA ACTACACCGA GACGGTCACC GCGTCGATCT TCGCGTTCCG CTCGTCGACT
CAGCCGGCGA TCGTTCAGGT CAACGAGATC GCCACCGCAA CAATGATGGC CGCCAAGGGC
GCGGTTTATC CGGTCTACGA GCTGATGCGC GACGAGAAGG AGGCGTTCTC GCCGTCCGAC
TATCTGCCCG CGGTCGCCGG CTACTATGTC GATCTCGCCG GCAATATGCT GTCGTTTCCG
TTCAACGCCT CGACGCCGAT TCTGTACTAC AACAAGACGC TGTTCAAAAA GGCGGGGCTC
GATCCGGAGA CGCCGCCGGG CACTTGGCCG GACGTCGGCG CCGCGGCGAA GCGGCTGATC
GACGCGGGCG TGCCCTGCGG ATTCACCACC TCCTGGCCCT CCTGGGTCAA TGTCGAGAAT
TTCTCCGCCT ATCACAATCT TCCGCTCGCG ACTCGGGCCA ACGGCCTCGG CGGGCTGGAC
GCGGTGCTGG TGTTCAACAA TCCGCTGGTG ATCAGGCACG TCGCCACGCT CGCGGAATGG
CAGAAAACCA AGGTGTTCGA CTATGCCGGC CGCGCCACCG CCGCAGAGCC GCGCTTTCAG
CAGGGCGACT GCGGCATCTT CATCGGCTCG TCCGCCACCC GTGCCGATAT CATCGCCAAT
TCCAATTTCG AGGTCGGCTA CGGCAGGCTG CCGTATTGGC CGGAGGTTCC CGGGGCGCCG
CAGAATACGA TCATCGGCGG GGCGACGCTG TGGGTGCTGC GCGGCCGGCC GGCGACGGAC
TATCACGGCG TCGCCAAGTT CTTCACCTAT CTGTCGCGGC CCGAAGTGCA GGCCGCCTGG
CACCAGAACA CGGGCTATCT TCCGGTGACA CGGGCCGCCT ATCAGCTGAC CCGTGCGCAG
GGCTTTTACG ACCGCAATCC GGGCACCGCG ATCTCGATCG AGCAGATCAT CTCGAAGCCG
CCCACCGAAA ACTCCCGCGG GCTCCGGCTC GGTTCTTTCG TTCTGATCCG CGACGTCATC
GACGACGAGC TCGAACAGGC ATTCAGGGGC AAGAAACCCG CGCAGGCCGC GATGAATTCC
GCGGTCGAGC GCGGCAACAA GTTGCTGCGC CAGTTCGAAC GGACGCAGCC ATGA
 
Protein sequence
MARTPLIFAL VLLVVAGAAP ARAATEIAWW HAMSGELGRR LEKLAADFNA SQSDYRVVPT 
YKGNYTETVT ASIFAFRSST QPAIVQVNEI ATATMMAAKG AVYPVYELMR DEKEAFSPSD
YLPAVAGYYV DLAGNMLSFP FNASTPILYY NKTLFKKAGL DPETPPGTWP DVGAAAKRLI
DAGVPCGFTT SWPSWVNVEN FSAYHNLPLA TRANGLGGLD AVLVFNNPLV IRHVATLAEW
QKTKVFDYAG RATAAEPRFQ QGDCGIFIGS SATRADIIAN SNFEVGYGRL PYWPEVPGAP
QNTIIGGATL WVLRGRPATD YHGVAKFFTY LSRPEVQAAW HQNTGYLPVT RAAYQLTRAQ
GFYDRNPGTA ISIEQIISKP PTENSRGLRL GSFVLIRDVI DDELEQAFRG KKPAQAAMNS
AVERGNKLLR QFERTQP