Gene Rsph17025_2890 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_2890 
Symbol 
ID5083238 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp2945202 
End bp2946791 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content66% 
IMG OID640484460 
Productextracellular solute-binding protein 
Protein accessionYP_001169081 
Protein GI146278922 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.271762 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACGCT TGCCGCTCCG CACCGGGGCC TGTCTTCTCG CGATGATCGT TGCAGGCGCC 
GGCGCTGCCG CCGCGCAGGT GACCCTTGTC CGGGGCAACG ACACCGATCC CGCGACGCTC
GACCATCACC TGACCTCGAC CGTGGCCGAG AGCCGGATCC TGAACGATCT CTACGAGGGG
CTCGTGGTGC AGGATGCGCG GGGCGAGGTG GTGCCGGGGG TGGCCGAGAG CTGGGAGATC
TCGGAGGATG GTCTGACCTA CAGCTTCAAG CTGCGCGACG ATGCGAAATG GTCGAATGGC
GATCCGGTGG TGGCCGGAGA CTTCGTCTTC GCGCTGCGCC GGATCGTGAC GCCCGCGACG
GCGGCGGTCT ATGCCAACAT CCTCTACCCG ATCCTGAACG CCGAAGCCGT CGCCTCGGGC
CAGATGACCC CGGAAGAGCT GGGGGTCGAG GCGGTGGATG ATCATACGCT GCAAATCACC
CTGAACGCGC CCACCCCCTA CTTCCTCGAA CTGCTCACGC ACCAGTCCTC GCTGCCGCTG
CACCCGGCGA CGGTCGAGGC GGAAGGCGCG AACTTCACGC GGCCCGGCGT GATGGTCACG
AACGGCGCCT ACAAGCTGGT CAGCTTCGTG CCGAACGACC GCATCGTCAT GGAGAAGAAT
GAGCATTTCC ACGGCGCGCA GGACATCGCC GTCGATCGCG TCGAATGGGT GCCCTTCGAG
GATCGCTCGG CCTGCCTGCG GCGCTTCGAG GCGCAGGAGG TGCAGATCTG CACCGACGTG
CCCGCCGAAC AGATGAGCTA CATGCGCCAG AACCTCGGCG AGCAGCTGCG CATCGCGCCC
TACCTCGGCA CCTACTACCT GCCGGTGAAG GGCGCCGACG GCAGCCCGCT CAAGGACAAG
CGCGTGCGTC AGGCGATCTC GCTCGTGCTC GACCGCGACT TCATCGCCGA GCAGGTCTGG
CAGGAGACGA TGCTGCCCGG CTACTCGATC GTCCCGCCGG GCATCTCGAA CTATGTCGAG
ACGCCCCCCT CGCTCGATTA TGCCGAAGAG GATCTGCTTG ACCGCGAGGA CCGGGCCAAG
GCGCTCCTTG AGGAAGCGGG CGTGGCCGAG GGCAGCCTGA CCGTGCAGCT CTCCTACAAC
TCGTCCGAGA ACCATCGCAA TACGATGACC GCCATCGCCG ACATGCTGAA GAACATCGGC
ATCAACGCGA CGCTGAACGA GATGGAGGGG ACGAACTACT TCAACTACCT CAAGGAAGGC
GGCGCCTTCG ACATCGTGCG CGCGGGCTGG ATCGGCGACT ATTCCGACCC GCAGAACTTC
CTGTTCCTGT TCGAGGGCGG CGTGCCCTTC AACTATCCGC GCTGGGAGAA CGCCGATTAC
GACGCGCTGA TGGACAGGGC CGCCCAGACC CAGGATCTCG ACGAGCGGGC ACAGATCATG
GCCGAGGCCG AGACGATCCT GCTCGACGAG GTGCCGGCGA TCCCGCTGCT CACCTACTCC
TCGCGCGCGC TCGTTTCGGA CCGGGTGCAG GGCTACGAGG ACAACCTGCC CGACGTCCAC
CAGACCCGCT GGCTCTCGCT GTCCCAGTAA
 
Protein sequence
MTRLPLRTGA CLLAMIVAGA GAAAAQVTLV RGNDTDPATL DHHLTSTVAE SRILNDLYEG 
LVVQDARGEV VPGVAESWEI SEDGLTYSFK LRDDAKWSNG DPVVAGDFVF ALRRIVTPAT
AAVYANILYP ILNAEAVASG QMTPEELGVE AVDDHTLQIT LNAPTPYFLE LLTHQSSLPL
HPATVEAEGA NFTRPGVMVT NGAYKLVSFV PNDRIVMEKN EHFHGAQDIA VDRVEWVPFE
DRSACLRRFE AQEVQICTDV PAEQMSYMRQ NLGEQLRIAP YLGTYYLPVK GADGSPLKDK
RVRQAISLVL DRDFIAEQVW QETMLPGYSI VPPGISNYVE TPPSLDYAEE DLLDREDRAK
ALLEEAGVAE GSLTVQLSYN SSENHRNTMT AIADMLKNIG INATLNEMEG TNYFNYLKEG
GAFDIVRAGW IGDYSDPQNF LFLFEGGVPF NYPRWENADY DALMDRAAQT QDLDERAQIM
AEAETILLDE VPAIPLLTYS SRALVSDRVQ GYEDNLPDVH QTRWLSLSQ