Gene RPB_0174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0174 
Symbol 
ID3907779 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp188622 
End bp189902 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content63% 
IMG OID637882056 
Productextracellular solute-binding protein 
Protein accessionYP_483797 
Protein GI86747301 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.396862 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGCGAA CCTGGATGGC AGCGGCGGCT TTCACGCTCG CCGCCGGCTG CGCTCACGCG 
CAGACGCAGA CCGAAGTCGT GCTGCAATAT CCCTATCCGG AGCTGTTCAC CGAGACCCAC
AAGCAGATCG CGGCCGAATT CGCCAAGGTG CATCCGGAAA TCAAGGTGAC GTTCCGCGCG
CCTTACGAAT CCTATGAAGA AGGCACCCAG AAGGTGCTGC GCGAGGCGGT CACCAATCAG
GTCCCCGACG TCACCTTCCA GGGCCTGAAC CGCGTCCGCG TGCTGGTCGA CAAGAACATT
CCGGCCGAAC TCGACGGCTA CATCGCCGCC GAAAAGGATT TCGACAAGCA GGGCTTCCAC
CAGGCGATGT ACGACATCGG CACCGCCAGC GGAAAGGTCT ACGCGCTGCC GTTCGCGATC
TCGCTGCCGA TCGTCTACGT CAATGTCGAT CTGGTGAAAC AGGTCGGCGG CGATCCGAAC
AATCTGCCGA CCAGCTGGGA CGGCCTGATC GACCTCGCCA AGAAGGTCAA GGCGCTCGGC
CCGGACTATA ACGGCATCAC CTATGCGTGG GACATCACCG GCAACTGGCT GTGGCAGGCG
CCGGTGTTCG CCCGCGGCGG CACCATGCTG AACGCGGACG AAACCAAGGT GGCGTTCGAT
GGTCCCGAAG GCCAGTTCGC CATGAAGCAG ATCGCCCGCC TCGTCACCGA GGGCGGCATG
CCGAATCTCG ACCAGCCGTC GATGCGCGCC GCCTTCGCGG CGGGCAAGAC CGGCATCCAC
ATCACCTCGA CCTCCGATCT CAACAAGACC ACGCAGATGA TCGGCGGCAA GTTCACGCTG
AAGACCCACA TCTTCCCGGA CGTGGTCAAG CCGAACGGCC GTCTGCCGGC CGGCGGCAAC
GTGGTGCTGA TCACCGCCAA GGACAAGGCC AAGCGTGACG CGGCCTGGGA AGTGGTGAAG
TTCTGGACCG GCCCGAAGGG CGCCGCGATC ATGGCGGAGA CCACCGGCTA CATGCCGCCC
AACAAGGTCG CCAACGACGT CTATCTGAAG GACTTCTACG AGAAGAACCC GAACAACTAC
ACCGCGGTGA GCCAGCTCGC GCTGCTGACC AAATGGTACG CGTTCCCGGG CGACAACGGC
CTCAAGATCA CCGACGTGAT CAAGGATCAT CTCAACTCGA TCGTGACCGG AACCCGGGCC
AAGGAGCCGG ACGCGGTGCT CGCCGACATG ACCAAGGACG TGCAGAAGCT GCTGCCGAAA
TCGGTCGGCG CGGCGCGCTG A
 
Protein sequence
MLRTWMAAAA FTLAAGCAHA QTQTEVVLQY PYPELFTETH KQIAAEFAKV HPEIKVTFRA 
PYESYEEGTQ KVLREAVTNQ VPDVTFQGLN RVRVLVDKNI PAELDGYIAA EKDFDKQGFH
QAMYDIGTAS GKVYALPFAI SLPIVYVNVD LVKQVGGDPN NLPTSWDGLI DLAKKVKALG
PDYNGITYAW DITGNWLWQA PVFARGGTML NADETKVAFD GPEGQFAMKQ IARLVTEGGM
PNLDQPSMRA AFAAGKTGIH ITSTSDLNKT TQMIGGKFTL KTHIFPDVVK PNGRLPAGGN
VVLITAKDKA KRDAAWEVVK FWTGPKGAAI MAETTGYMPP NKVANDVYLK DFYEKNPNNY
TAVSQLALLT KWYAFPGDNG LKITDVIKDH LNSIVTGTRA KEPDAVLADM TKDVQKLLPK
SVGAAR