Gene Rsph17029_3168 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3168 
Symbol 
ID4899167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp192839 
End bp194443 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content69% 
IMG OID640113770 
Productextracellular solute-binding protein 
Protein accessionYP_001045040 
Protein GI126463927 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.114782 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTACC CCACGGGCCC CATCGGCGGG CAGATCCTTG CGCAGGCCAT GAGCCTCGCC 
CCCTCGCGAC GGGCCTTTCT GGGGGGCGCG GCCGCCGTAG CCGGCGCCTT CTGCCTGCCC
GCCTCGCTGC GAGCCGAGGA GGGGCCGAAG CAGGGCGGCC GGCTCCGCTA CGGCGTCAAC
GACGGCTCGC AGCAGGATTC GCTCGAGCCC GGCAGCTGGG CCACCGTCAT GTGCGGTGCG
GCCTTCAACG GCGCGCTCTG CAACAACCTC GTCGAGCTTC TGCCGGACGG GTCGCTGGCG
GGCGATCTCG CCGAAAGCTG GGAGGAGGCC GAGGGTGCCA CCCGCTGGAC CTTCACGCTC
CGCAAGGGTG TCCTGTTCCA CGACGGCCGC CCCTTCACCC CGGAGGATGC CCGGCAGTCG
CTGATGCATC ACATGGGCGA GGGCAGCACC TCGGGCGCGC TCGCCATCGT CAGCCAGATC
AGGGAGATCG CCGTCGAGGG TGAGGACCGG CTGATCGTGA CCCTCACGCA GGGCAATGCC
GACTTCCCCT ATCTGCTGTC GGATTATCAC CTCTCGATCT TCCCGGCGAA GGAGGGCGGC
GGCATCGACT GGGAGAGCGG CATCGGCACC GGCGCCTTCA AGCTCGACAG TTTCGAGCCG
GGCGTCGCGG TCCGACTGCT CCGCAATCCG AACTATCACA AGCCCGGCCT GCCGCATTTC
GACGAGGTCG AATTCATCGC GATCCCCGAC CGGTCCGCGC GGCTGAATGC GCTGCTGACC
GGCGAGGTCG ATGTGATCGA GGATGTCGAC ATCCGCAACG TCCCCCTGAT CGAGCGCAAT
CCCGATCTGG CGCTGCACCG CACGCCGAGC CTGCGGCACC TGACCTTCGA CATGAACTGC
CAAACGGCGC CCTTCGACAA TCCGGTCGTG CGCAAGGCCC TGAAGCTCAG CCTCGACCGC
GAGGATGTGA TCGCCAAGGT GTTCCTCGGC GAGGCCGAGA CGGGGAACGA CAACCCGGTG
GCGCGCATCA TGCCCTTCTG GGCCGAGACG CCGCCCGAGC ACCGCTACGA TCCCGAGGCC
GCGCGGGCGC TTCTGGCCGA GGCCGGGATC GAGGGGCTGA CGGTCGATCT CTCGGTGGCC
GAATCCGCCT TCCCCGGCGC GGTCGAAGCG GGGGTGCTCT TCCGCGAACA TGCCGCCAAG
GCCGGCATCA CGATCAACCT CGTGCAGGAG GCCGATGACG GCTACTGGGA CAATGTCTGG
CTGGTGAAGC CCTTCAACGC CGCGGACTGG TACGGGCGGG TCACGCTCGA CTGGCTGTTC
GCCACCTCCT ACACCTCCGA CGCGCCCTGG AACAACACGG GGTTCAAGAA CGCCCGCTTC
GACGAGCTGC ATGCGGCGGC GCGGTCGGAG ACCGATCCCG CCACGCGGGG CGGGCACTAT
GCCGAGATGC AGCAGATCCT GCATGACGAC GGCGGCGTGA TCACGGTGGC CTTCGTGTCG
TGGCTGCTCG CCATGTCGCG CGCCATCGGC CATGGTGAGA CCGGAGGCAT CCTGCCCGCC
GACAATCACC GCTGCGCCGA GCGGTGGTGG CGCACCGACG TCTGA
 
Protein sequence
MRYPTGPIGG QILAQAMSLA PSRRAFLGGA AAVAGAFCLP ASLRAEEGPK QGGRLRYGVN 
DGSQQDSLEP GSWATVMCGA AFNGALCNNL VELLPDGSLA GDLAESWEEA EGATRWTFTL
RKGVLFHDGR PFTPEDARQS LMHHMGEGST SGALAIVSQI REIAVEGEDR LIVTLTQGNA
DFPYLLSDYH LSIFPAKEGG GIDWESGIGT GAFKLDSFEP GVAVRLLRNP NYHKPGLPHF
DEVEFIAIPD RSARLNALLT GEVDVIEDVD IRNVPLIERN PDLALHRTPS LRHLTFDMNC
QTAPFDNPVV RKALKLSLDR EDVIAKVFLG EAETGNDNPV ARIMPFWAET PPEHRYDPEA
ARALLAEAGI EGLTVDLSVA ESAFPGAVEA GVLFREHAAK AGITINLVQE ADDGYWDNVW
LVKPFNAADW YGRVTLDWLF ATSYTSDAPW NNTGFKNARF DELHAAARSE TDPATRGGHY
AEMQQILHDD GGVITVAFVS WLLAMSRAIG HGETGGILPA DNHRCAERWW RTDV