Gene Rsph17025_1506 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_1506 
Symbol 
ID5084755 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp1543569 
End bp1545383 
Gene Length1815 bp 
Protein Length604 aa 
Translation table11 
GC content68% 
IMG OID640483063 
Productextracellular solute-binding protein 
Protein accessionYP_001167705 
Protein GI146277546 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0138783 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATACAGG ATTATCGGCC CGCCCTCAGG GGTCTCATGG CAGCCGCGGG CCTTGCAGTC 
TGGGGATTCG CAGCCGCCTC GGAACCGCGC CACGGCATAG CTATGTATGG CGAACCTGCG
CTTCCACCGG ATTTTGTGTC TCTGCCATAT GCCAATCCCG ACGCCCCCAG GGGCGGGCGG
ATCGTGCTGG GCGAGACGGG CGGATTCGAT TCCCTCAACC CCTACATCGT GAAGGGCCGC
GCGCCCTACT CGCTCGCCCC GCTCACCGTC GAGAGCCTCA TGGGGCGCTC GATCGACGAG
CCCTTCACCC TTTACGGCCT TCTTGCCGAA TCGGTAGAGA CCGACCCGGA GCGGACGTGG
GTCGAATTCA CCCTGCGCGA GGGCGCGCAA TTCTCGGACG GATCGCCCGT GACGGTCGAG
GATGTCCTGT GGTCGTTCGA GGAACTGGGC ACCCGCGGCA TGCCGCGCTA CTGGATCGCC
TGGCAGAAGA TCGCCTCGGC CGAGCAGACC GGCCCGCGCT CCGTCCGCTT CACCTTCACC
GAACAGGACC GCGAACTGCC GCTGGTGCTG GGCCTGCGTC CCGTCCTGAA GAAGGCCCAG
TGGGCGGGGC GCGACTTCGC GGCCTCGGGG TTCGAGGCGC CGATCGGCTC GGGGCCCTAC
ATCCTCGAGA GCTTCGAGCC GGGCCGCTCG CTGCGCTACC GGCGCAACCC CGACTGGTGG
GGGCAGGACC TGCCCTTCAA CCGCGGCCTT CACAATTTCG ACGAGGTGAC GGTCGAATAT
TTCGGCGACA GCAGCGTCGC GTTCGAGGCC TTCAAGGCCG GCGCCCTCTC GGTCTTCCGC
GAGACCAGCG CCGCCAGATG GGCCACGCAT TACGACTTTC CCGCCGTGCG AAGCGGCGCG
ATCGTCCGCT CCGAGATCCC GCACGGGCGC CCGTCGGGCA TGGAGGGGCT GGTGATGAAC
ACGCGCCGGC CGATCTTCGC CGACTGGCGC GTGCGCGAGG CGCTGATCCA GGCCTTCAAC
TTCGAACTGA TCAACCGCAC CCTCACCGGC GGGGCCGAGC CGCGCATCGC CTCCTACTTC
TCGAACTCCG ACCTCGGGAT GGCGGTCGGC GCGCCGGCCG AGGGGCGCGA ACGCGCGCTG
CTCGAACCCT ACGCGGCCGA TCTGCTGCCC GGCACGCTCG AGGGCTACGC GCTGCCCGTC
TCGGATGGCA GCGAGGCCAA CCGGCAGGGC CTGAGGGTCG CGACACGGCT TCTGGCCGAG
GCCGGATGGC GGGTCGAGGA CGGGGTGCTG ACAGGCGCCG ACGGGCGGCC CTTCCAGTTC
GAGATCCTGC TGCCGCAGGG GGCGGACGCG ATGATCGCGG CGGCCAACAT CTACCGGCAG
GCGCTGGCGC GGCTCGGGAT CTCGGTCAGC ATCGCGGCGG TGGATCCGGC GCAATACAAG
CAGCGGGTGG ACAACCAGCA GTTCGACATG ACGAGCTTCC TGCGCTCGCT CTCGCTCTCG
CCCGGCAACG AGCAGATGCT CTACTGGTCG GCGGAGGGCA AGGACGCCCC CGGCTCGCGC
AACCTGATGG GGATGGAGAG CCCGGCGGCC GAAGCGATGA TCCGCGGGAT GCTGGCCACC
GATGACCCGG CCGAGTTTCA AGCCTCCGTG AGGGCGCTCG ACCGGATCCT GACGGCCGGT
AGATATGTCA TTCCGATGTG GTATCCTCGG GTTTCGCGGC TGGCGCATGA CAGGCACCTG
CGCTATCCGG CGACCACACC CATCTATGGC GACTGGCCGG GGTTCCTGCC GGACGTCTGG
TGGCAAGAAA ACTGA
 
Protein sequence
MIQDYRPALR GLMAAAGLAV WGFAAASEPR HGIAMYGEPA LPPDFVSLPY ANPDAPRGGR 
IVLGETGGFD SLNPYIVKGR APYSLAPLTV ESLMGRSIDE PFTLYGLLAE SVETDPERTW
VEFTLREGAQ FSDGSPVTVE DVLWSFEELG TRGMPRYWIA WQKIASAEQT GPRSVRFTFT
EQDRELPLVL GLRPVLKKAQ WAGRDFAASG FEAPIGSGPY ILESFEPGRS LRYRRNPDWW
GQDLPFNRGL HNFDEVTVEY FGDSSVAFEA FKAGALSVFR ETSAARWATH YDFPAVRSGA
IVRSEIPHGR PSGMEGLVMN TRRPIFADWR VREALIQAFN FELINRTLTG GAEPRIASYF
SNSDLGMAVG APAEGRERAL LEPYAADLLP GTLEGYALPV SDGSEANRQG LRVATRLLAE
AGWRVEDGVL TGADGRPFQF EILLPQGADA MIAAANIYRQ ALARLGISVS IAAVDPAQYK
QRVDNQQFDM TSFLRSLSLS PGNEQMLYWS AEGKDAPGSR NLMGMESPAA EAMIRGMLAT
DDPAEFQASV RALDRILTAG RYVIPMWYPR VSRLAHDRHL RYPATTPIYG DWPGFLPDVW
WQEN