Gene Rsph17025_0372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_0372 
Symbol 
ID5082208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp367423 
End bp369012 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content65% 
IMG OID640481924 
Productextracellular solute-binding protein 
Protein accessionYP_001166583 
Protein GI146276424 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.132597 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.149219 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCATCC GAGCGCGCTT GCTGGCGACG GCGGCCGTTG CCGCATTCGC CCTGACCACG 
GCACCGCTTT CCGCCGAGAC GCCGCCCGAC ACCTTCATCC AAGCCTGGGC GATCGACGAC
ATGATCACCC TCGACCCGGC CGAGGTGTTC GAGTTCACCG CCTCCGAAAT CATCGGCAAC
AGCTACGAGA CGATCATCGG CTACGATGTG AACGACGTCT CGAACATCTT CGGCCGCGTG
GCGGAAAGCT GGGAGCTGTC CGAAGACGGC AAGACCATGA GCTTCACGGT CCGTCAGGGC
AAGAAGTTCG CCTCGGGCAA CGACCTGACC GCCGAGGATG TGGTCTACAG TCTCGTCCGC
GCGGTCAAGC TCGACAAGTC GCCGGCCTTC ATCCTGGGCC AGTTCGGCCT GACCCCCGAC
AACGTCGAGG AGAAGATCGT CCAGACCGGC GACCATTCCT TCACCTTCGA GATGGACAAG
GCCTATGCGC CGACGTTCCT GCTCTACTGC CTGACGGCGA CGGTCGCCGC GGTGGTGGAC
AAGGATCTCG TCCAGTCGAA CGAGGTGGAC GGCGACTGGG GCTACAACTG GCTCAAGACC
AACTACGCCG GCTCGGGCCC GTTCACGATC CGCGAGTGGC GCGCCAACGA GGCCGTCGTG
ATGGAGCGGA ACGACAACTG GGACGGCGAG ACGCCCGCGA TGGCACGCGC GATCTACCGC
CACATCCCCG AAGCCGCGAC CGAGCGGCTG CTGCTCGAGC AGGGCGACAT CGACATCGCG
CGCAAGCTGC TGCCCGAAGA GATCGAGGCG CTGAGCCAGA ACCCCGACAT CAAGATCCAG
AGCGGGGTGA AGGGCACGAT CTTCTACCTC GGCCTGAACC AGAAGAACGA GAACCTGGCC
AAGCCCGAGG TGCGCGAGGC GATGAAATGG CTCGTGGACT ACGACGCCAT CGCCGAGACG
CTGGTCAAGG GCATGAAGAA GAAGCACCAG ACCTTCCTGC CGGAGGGCTT CCTCGGCGCG
CTGGACGAAA ACCCCTACAG CTTCGATCCC GCCAAGGCCA AGGAGCTTCT GGCCGCGGCC
GGTCTGCCCG ACGGCTTCAC CGTCACGATG GACACCCGCA ACACGCCCGA AGTGACCTCG
ATCGCGCAGG CGATCCAGCA GACCATGGCT GAGGCCGGCA TCCGCATCGA GATCATCCCC
GGCGACGGCG GCCAGACGCT CGAGAAATAC CGCGCCCGCA CGCATGACAT CTACATCGGC
CAGTGGGGCC CCGACTATCA GGACCCGCAC ACCAACGCGA CCTTCGCGCA GAACCCCGAC
AATTCGGACA CGGCGGCGTC GAAGCCGCTC GCCTGGCGCA ACGCCTGGGA GATCCCCGAG
CTGACCGCCA AGGCCGATGC CGCGGTGCTC GAACGCGACA CCGACAAGCG CGCCGAGATG
TATCGCGAGA TGCAGCGCGA GGTGCTCGAA ACCTCGCCCT TCGTGATCAT GTTCCAGGAA
TCCGAGGTCG TCGCCATGCG CAAGAACGTC GAGGGCTACA TCATCGGCCC CTCGTTCAAC
GACAACTCGT TCCGGGCCGT GACGAAGTAA
 
Protein sequence
MFIRARLLAT AAVAAFALTT APLSAETPPD TFIQAWAIDD MITLDPAEVF EFTASEIIGN 
SYETIIGYDV NDVSNIFGRV AESWELSEDG KTMSFTVRQG KKFASGNDLT AEDVVYSLVR
AVKLDKSPAF ILGQFGLTPD NVEEKIVQTG DHSFTFEMDK AYAPTFLLYC LTATVAAVVD
KDLVQSNEVD GDWGYNWLKT NYAGSGPFTI REWRANEAVV MERNDNWDGE TPAMARAIYR
HIPEAATERL LLEQGDIDIA RKLLPEEIEA LSQNPDIKIQ SGVKGTIFYL GLNQKNENLA
KPEVREAMKW LVDYDAIAET LVKGMKKKHQ TFLPEGFLGA LDENPYSFDP AKAKELLAAA
GLPDGFTVTM DTRNTPEVTS IAQAIQQTMA EAGIRIEIIP GDGGQTLEKY RARTHDIYIG
QWGPDYQDPH TNATFAQNPD NSDTAASKPL AWRNAWEIPE LTAKADAAVL ERDTDKRAEM
YREMQREVLE TSPFVIMFQE SEVVAMRKNV EGYIIGPSFN DNSFRAVTK