Gene Rsph17029_3988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3988 
Symbol 
ID4898698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp1132302 
End bp1133924 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content66% 
IMG OID640114591 
Productextracellular solute-binding protein 
Protein accessionYP_001045838 
Protein GI126464725 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGAC TGCTCGCCTC CACCGCCGTG ATGCTGGCGC TCGCCCTGCC CGCCGCAGCC 
CAGGACTATA CGCCCGACCC GAACGCCAGG CCCGGCGGCA CGATCACCAT CACCTACAAG
GACGATGTGG CGACGCTCGA TCCGGCCATC GGCTACGACT GGCAGAACTG GTCGATGATC
AAATCGATCT TCGACGGGCT GATGGATTAC GTCCCCGGCA CGACCGAGCT GCGCCCGGGT
CTCGCCGAGA GCTACGAGAT CTCGGAGGAC GGGCTCACCT ATACGTTCAA GCTGCGCCCG
GGCGTGACAT TCCACAACGG CCGCGAGATG ACGGCCGAGG ATGTGAAATA TTCGCTCGAC
CGCGTGACGC TGCCCGCGAC CCAGTCGCCG GGAGCGGGCT TCTTCGGCTC GATCAAGGGC
TTCGATGCGA TGGCCGACGG CTCGGCCACC ACGCTCGAGG GCGTGACGGT GGTCGATCCC
TCGACCGTGA AGATCGAGCT CTCGCGTCCC GACGCCACCT TCCTGCATGT GATGGCGCTG
AACTTCGCCT CGGTGGTGCC GAAGGAGGCC GTCGAGGCGG CGGGCGCCGA CTTCGGCAAG
CAGCCGGTCG GCACCGGGGC CTTCAAGCTC GCCGAATGGA CCCTCGGCCA ACGGCTCGTC
TTCGAGAAGA ACGCCGATTA CTGGCGCGAG GGCGTGCCCT ATCTCGACAG CATCGTCTTC
GAAGTGGGAC AGGAGCCGAT TGTGGCGCTG CTGCGGCTGC AGAACGGCGA GGTGGACGTG
CCCGGCGACG GCATTCCGCC TGCGAAATTC ACCGAAGTGA TGGCCGATCC GGCGCAGGCC
GAGCGCGTGG TCGAGGGCGG CCAGCTGCAC ACGGGCTACA TCACGATGAA CGTGACCCAG
CCGCCCTTCG ACAATCTGCA GGTCCGTCAG GCCGTCAACA TGGCGATCAA CAAGCAGCGG
ATCACCCAGA TCATCAACGG CCGCGCGATC CCCGCGACCC AGCCGCTGCC GCCCTCGATG
CCGGGCTATA CCGAAGGCTA CGAGGGCTAT CCGCACGATG TCGAGAAGGC CAAGGCGCTG
CTCTCCGAGG CGGGCTTCGC CGACGGGTTC GAGACCGAGC TCTATGTGAT GAACACCGAC
CCGAACCCGC GCATCGCGCA GGCGATCCAG CAGGATCTGT CGCAGATCGG CATCAAGGCC
GCGATCCAGA GCCTCGCGCA GGCCAATGTG ATCGAGGCCG GCGGCAATGG CTCGGCGCCG
ATGATCTGGT CGGGCGGCAT GGCCTGGATC GCGGATTTCC CCGATCCGTC CAACTTCTAC
GGCCCGATCC TCGGCTGCGC GGGCGCGGCT GACGGCGGCT GGAACTGGTC GAAATTCTGC
GACGAGGCGC TCGACGCCAA GGCCACCGAG GCCGACAGCC TCGCCGATCC GGCCCGTGCC
GAGGAGCGGC TGAAGCTCTG GTCCGACGTC TATATGGGCG TGATGGAGAA GGCGCCGTGG
GTGCCCGTCT TCAACGAACA GCGCTACACG ATGAAATCCG CGCGCATGGG CGGCGACGAC
AGCCTCTATG TCGATCCCGT CTCGATCCCC GTGAACTACG ACTATGTCTT CGTGACCGAG
TAA
 
Protein sequence
MKRLLASTAV MLALALPAAA QDYTPDPNAR PGGTITITYK DDVATLDPAI GYDWQNWSMI 
KSIFDGLMDY VPGTTELRPG LAESYEISED GLTYTFKLRP GVTFHNGREM TAEDVKYSLD
RVTLPATQSP GAGFFGSIKG FDAMADGSAT TLEGVTVVDP STVKIELSRP DATFLHVMAL
NFASVVPKEA VEAAGADFGK QPVGTGAFKL AEWTLGQRLV FEKNADYWRE GVPYLDSIVF
EVGQEPIVAL LRLQNGEVDV PGDGIPPAKF TEVMADPAQA ERVVEGGQLH TGYITMNVTQ
PPFDNLQVRQ AVNMAINKQR ITQIINGRAI PATQPLPPSM PGYTEGYEGY PHDVEKAKAL
LSEAGFADGF ETELYVMNTD PNPRIAQAIQ QDLSQIGIKA AIQSLAQANV IEAGGNGSAP
MIWSGGMAWI ADFPDPSNFY GPILGCAGAA DGGWNWSKFC DEALDAKATE ADSLADPARA
EERLKLWSDV YMGVMEKAPW VPVFNEQRYT MKSARMGGDD SLYVDPVSIP VNYDYVFVTE