Gene Rsph17029_3982 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3982 
Symbol 
ID4899129 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp1124354 
End bp1125865 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content63% 
IMG OID640114585 
Productextracellular solute-binding protein 
Protein accessionYP_001045832 
Protein GI126464719 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGGAA AATGGGCCCG CGTCGGCCTC ATCGCGCTGA GCGTGACCCT CGGCCTCGGC 
AGCATTGCCG AAGCCGCAGG CGTGCTGACC ATCGGCCGGC GCGAGGACGG CACCACCTTC
GACCCGATCG CCACGGCGCA GAACGTGGAT TTCTGGGTCT TCTCGAACGT CTATGACGTG
CTCGTGCGCG TGGACAAGAC CGGGACCAAG CTCGAGCCCG GTCTGGCCGA AAGCTGGGAG
ATCTCGGAGG ACGGGCTCAC CTACACCTTC CATCTGCGCG ACGCGAACTT CTCCGACGGC
TCGCCCATCA CCGCCGAGGA CGCGGCCTTC ACGCTGCTGC GCATCCGCGA CAGCGAACTG
TCGCTCTGGT CGGACAGCTA TGCGGTGATC GAGACCGCCG AGGCGACCGA TCCGAAGACT
CTCGTCGTCA AGCTGAAGAC CCCCTCCGCG CCCTTCCTGT CCACCATGGC CATGCCCGCC
GTCTCGATCC TGTCCAAGGC AGGCGTCGAA GCCATGGGCG AGGAAGCCTA CGCCGAGAAG
CCCGTGGCCT CCGGCGCCTT CACCGTCGAG GAATGGCGCC GCGGCGACCG GGTGATCCTG
AAGAAGAACC CGGAATTCTG GCAAGCCGAC CGGGTGAGCC TCGACGGGGT CGAATGGATC
TCGATCCCCG ACGACAACAG CCGGATGCTG AGCGTGCAGG CGGGCGAACT CGACGCCGCG
ATCTTCGTGC CCTTCTCGCG GGTGGCGGAA CTGAAGAAGG ATACCAACCT CAAGGTCATG
GTCGAGCCCT CGACGCGCGA GGACCATCTG CTCCTCAACC ACGAGCACGA GCCACTGAAC
GATCCCAAGG TGCGCGAGGC GATCGACCTC GCCATCGACA AGCAGGCGAT CGTCGACACG
GTGACCTTCG GGCAGGCGCA GATTGCCAAT TCCTACATCC CGGCGGGCGC GCTCTATCAC
AACGACAACA ACCTGCTGCG GCCCCACGAC CCCGAGAAGG CCAAGGCGCT TCTGGCCGAG
GCGGGCGTGT CCGACGTGAG CCTCGATTAT GTGGTGAACG CGGGCAACGA GGTGGACGAG
CAGATCGCGG TCCTTCTCCA GCAGCAGCTG GGTCAGGCCG GGGTCACGGT CAATCTGCAG
AAGATGGACC CGAGCATGAC CTGGGACATG CTGGTGAATG GCGAATACGA CCTGTCGGTC
ATGTATTGGA CGAACGACAT CCTCGATCCC GACCAGAAGA CCACCTTCGT TCTGGGTCAC
GACGTCAACA TGAACTACAT GACGCGCTAC GAGAACGAGA CGGTGAAGCA GCTCGTGGCC
GATGCCCGGC TCGAGATGGA CCCGGCCAAG CGCGAGGCGA TGTATACCGA GATCCAGGAA
CTGTCGAAGG CCGATACCCA CTGGATCGAC CTCTATTACA GCCCCTTCAT CAACGTGACG
CGCGCCAATA TCGAGAACTT CAACCAGAAC CCGCTGGGCC GCTTCTTCCT CGAGGATACG
GTGAAGAACT GA
 
Protein sequence
MTGKWARVGL IALSVTLGLG SIAEAAGVLT IGRREDGTTF DPIATAQNVD FWVFSNVYDV 
LVRVDKTGTK LEPGLAESWE ISEDGLTYTF HLRDANFSDG SPITAEDAAF TLLRIRDSEL
SLWSDSYAVI ETAEATDPKT LVVKLKTPSA PFLSTMAMPA VSILSKAGVE AMGEEAYAEK
PVASGAFTVE EWRRGDRVIL KKNPEFWQAD RVSLDGVEWI SIPDDNSRML SVQAGELDAA
IFVPFSRVAE LKKDTNLKVM VEPSTREDHL LLNHEHEPLN DPKVREAIDL AIDKQAIVDT
VTFGQAQIAN SYIPAGALYH NDNNLLRPHD PEKAKALLAE AGVSDVSLDY VVNAGNEVDE
QIAVLLQQQL GQAGVTVNLQ KMDPSMTWDM LVNGEYDLSV MYWTNDILDP DQKTTFVLGH
DVNMNYMTRY ENETVKQLVA DARLEMDPAK REAMYTEIQE LSKADTHWID LYYSPFINVT
RANIENFNQN PLGRFFLEDT VKN