Gene RPB_0646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0646 
Symbol 
ID3908571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp731907 
End bp733232 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content64% 
IMG OID637882536 
Productextracellular solute-binding protein 
Protein accessionYP_484268 
Protein GI86747772 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.206924 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTTTC GTTTGAAAGC TCTGACCGCT GCGATGGCGA CGACGATCGC CGCCACGCTG 
CTGATCGCGC CGGCGCAGGC GGCGACCGAG ATCCAGTGGT GGCACGCCAT GACCGGCGGC
AACAACGATG TCGTGGTCAA GCTCGCCAAC GACTTCAACG CCGCGCAGAG CGACTACAAG
GTCGTCCCGA CCTACAAGGG CAGCTACGCC GACACCATGA ACGCCGGCAT CGCGGCGTTC
CGCGCCGGCA ACGCCCCGCA CATCATGCAG GTGTTCGAGG TCGGCACCGC CACCATGATG
GCGGCGACCG GCGCGGTGAA GCCGGTCTAC AAGCTGATGC AGGAGACCGG CGAGACATTC
GATCCCAACG CCTACCTGCC GGCGATCACC GGCTACTACT CGACCTCCAA GGGCGAGATG
CTGTCGTTCC CGTTCAACTC GTCCTCGACG GTGATGTGGG TCAATCTCGA CGCGCTCAAG
AAGGCCGGGA TCGCCGAAGT TCCGAAGACC TGGCCGCAGG TGTTCGAGGA TGCCAAGAAG
CTGAAGGCGG CCGGCTACGC CACCTGCGGC TTCTCCACCG CCTGGGTCAC TTGGGTCAAT
CTCGAGCAGC TCTCCGCCTG GCACAATGTA CCGCTGGCCA GCAAGGCCAA CGGCCTCGAC
GGCTTCGACA CCAAGCTGGA GTTCAACGGC CCGGTGCAGG TCAAGCATCT GGAGACGCTG
ATCGAGCTGC AGAAGGACAA GACCTACGAT TATTCCGGCC GCACCAACAC CGGCGAGGGC
CGCTTCACCT CCGGCGAGTG CCCGATCTTC CTGACCTCGT CGGGTTTCTT CGGCAACGTC
AAGTCTCAGG CCAAGTTCGC CTGGACCAAC GCGCCGATGC CGTACTACCC GGACGTTGCC
GGCGCGCCGC AGAATTCGAT CATCGGCGGC GCCTCGCTGT GGGTGATGGG CGGCAAGAGC
GCCGACGAAT ACAAGGGCGT CGCCAAGTTC CTCGCCTTCC TGTCCGACAC CGACCGTCAG
GTCGCGGTGC ACAAGGCGTC GGGCTATCTG CCGATCACCA AGGCGGCCTA CGAGAAGGCC
AAGGCCGACG GCTTCTACAA CGACCAGCCC TATCTCGAGA CCCCGATCAA GGAACTGACC
AACAAGCCGC CGACCGAGAA TTCCCGCGGC CTGCGGCTCG GCAACATGGT TCAGCTCCGC
GACGTCTGGG CCGAGGAAAT CGAGCAGGCG CTGGCCGGCA AGAAGACCGC CAAGGAAGCG
TTGGATGCAG CCGTCACCCG CGGCAACGTG ATGCTGCGGC AGTTCGAAAA GACCGCGGTG
AAGTAA
 
Protein sequence
MTFRLKALTA AMATTIAATL LIAPAQAATE IQWWHAMTGG NNDVVVKLAN DFNAAQSDYK 
VVPTYKGSYA DTMNAGIAAF RAGNAPHIMQ VFEVGTATMM AATGAVKPVY KLMQETGETF
DPNAYLPAIT GYYSTSKGEM LSFPFNSSST VMWVNLDALK KAGIAEVPKT WPQVFEDAKK
LKAAGYATCG FSTAWVTWVN LEQLSAWHNV PLASKANGLD GFDTKLEFNG PVQVKHLETL
IELQKDKTYD YSGRTNTGEG RFTSGECPIF LTSSGFFGNV KSQAKFAWTN APMPYYPDVA
GAPQNSIIGG ASLWVMGGKS ADEYKGVAKF LAFLSDTDRQ VAVHKASGYL PITKAAYEKA
KADGFYNDQP YLETPIKELT NKPPTENSRG LRLGNMVQLR DVWAEEIEQA LAGKKTAKEA
LDAAVTRGNV MLRQFEKTAV K