Gene Rsph17025_2844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_2844 
Symbol 
ID5084222 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp2899302 
End bp2900906 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content69% 
IMG OID640484414 
Productextracellular solute-binding protein 
Protein accessionYP_001169035 
Protein GI146278876 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.922612 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.415189 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTACC CCACGGGCCC CATCGGCGGG CCGATCCCGG TGCGGGCCCT GGGCCTCGCA 
CCCTCGCGGC GGGCCTTTCT CGGCGGCGCG GCCGTCCTGG CCGGCGCCTT CTGCCTGCCC
GCCTCGCTCC GGGCCGAGGA AGGGCCGAAG CGGGGCGGCC GGCTGCGCTA CGGCGTCAAT
GACGGCTCGC AGCAGGATTC GCTCGAGCCC GGAAGCTGGG CGACGGTGAT GTGCGGCGCG
GCCTTCAACG GGGCGCTCTG CAACAACCTC GTCGAGCTTT TGCCGGACGG GTCGCTGGCG
GGCGATCTGG CCGAACGCTG GGAGGAGGCG GAGGGGGCGA CCCGCTGGAC CTTCACGCTG
CGCAAGGGCG TGACCTTCCA CGACGGACGC CCCTTCACGC CCGAGGATGC GCGGCAATCG
CTGCTTCACC ACATGGGCGA AGACAGCACC TCGGGCGCGC TCGCCATCGT CAGCCAGATC
AAGGAGATCG CGGTCGAGGG CGAGGATCGG CTGATCGTGA CACTCACGCA GGGCAATGCC
GACTTCCCCT ATCTGCTGTC AGACTATCAC CTCTCGATCT TCCCGGCGAA GGAGGGGGGC
GGCATCGACT GGGAAAGCGG CATCGGCACC GGCGCCTTCC GGCTGGACAG TTTCGAGCCC
GGCGTCGCGG TCCGGCTGGT CCGCAATCCC CGCTATCACA AGCCCGGCCT GCCGCATTTC
GACGAGGTCG AGTTCATCGC GATCCCCGAC CGGGCGGCCC GGCTGAATGC GCTGCTGACC
GGCGAGGTCG ATGTGATCGA GGATCTCGAC ATCCGCAACG TCCCCCTCAT CGAGCGCAAC
CCCGATCTGG TGCTGCACCG CACGCCCAGC CTGCGGCACC TGACCTTTGA CATGAACTGC
CAGACGGCGC CCTTCGACCA TCCGGCGGTG CGTCAGGCCC TGAAGCTCAG CCTCGACCGC
GAGGATGTGA TCGCCAAGGT CTTCCTCGGC GAGGGCGAGA CCGGCAACGA CAATCCGGTG
GCGCGGATCA TGCCCTTCTG GGCCGAGACG CCGCCCGAGC ACCGCTACGA CCCCGAGGCC
GCGCGGGCGC TTCTGGCCGA GGCCGGGATC GAGGGGCTCA CGGTCGATCT CTCGGTCGCG
GAGTCGGCCT TCCCCGGCGC GGTCGAGGCG GGGGTGCTGT TCCGCGAGCA TGCGGCGCGC
GCCGGCATCA CCATCAACCT CGTGCAGGAG GCCGATGACG GCTACTGGGA CAATGTCTGG
CTGGTGAAGC CCTTCAACGC CGCGGACTGG TACGGGCGGG TCACGCTCGA CTGGCTGTTC
GCGACCTCCT ACACGTCCGA CGCACCCTGG AACAACACGG GGTTCAGGAA CGCCCGCTTC
GACGAGCTGC ACGCCAAGGC CCGGTCGGAG ACCGAACCCG CAAAGCGCGG CGCGCAGTAT
GCCGAGATGC AGCAGATCCT GCATAACGAG GGCGGCGTGA TCACGGTGGC CTTCGTCTCC
TGGCTCCTCG CCATGTCCCG TGCCATCGGC CATGGCGAGA CCGGAGGGAT CCTGCCCGCC
GACAATCACC GCTGCGCCGA GCGATGGTGG CGCACCGACA TCTGA
 
Protein sequence
MRYPTGPIGG PIPVRALGLA PSRRAFLGGA AVLAGAFCLP ASLRAEEGPK RGGRLRYGVN 
DGSQQDSLEP GSWATVMCGA AFNGALCNNL VELLPDGSLA GDLAERWEEA EGATRWTFTL
RKGVTFHDGR PFTPEDARQS LLHHMGEDST SGALAIVSQI KEIAVEGEDR LIVTLTQGNA
DFPYLLSDYH LSIFPAKEGG GIDWESGIGT GAFRLDSFEP GVAVRLVRNP RYHKPGLPHF
DEVEFIAIPD RAARLNALLT GEVDVIEDLD IRNVPLIERN PDLVLHRTPS LRHLTFDMNC
QTAPFDHPAV RQALKLSLDR EDVIAKVFLG EGETGNDNPV ARIMPFWAET PPEHRYDPEA
ARALLAEAGI EGLTVDLSVA ESAFPGAVEA GVLFREHAAR AGITINLVQE ADDGYWDNVW
LVKPFNAADW YGRVTLDWLF ATSYTSDAPW NNTGFRNARF DELHAKARSE TEPAKRGAQY
AEMQQILHNE GGVITVAFVS WLLAMSRAIG HGETGGILPA DNHRCAERWW RTDI