Gene Rsph17025_2156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_2156 
Symbol 
ID5082934 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp2184746 
End bp2186377 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content69% 
IMG OID640483719 
Productextracellular solute-binding protein 
Protein accessionYP_001168351 
Protein GI146278192 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.470572 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGCG AGAGACTGCA TCCCGCGGCC CTGATGCATC AGGTCGAAGT TGCGGCGGGC 
CGGATGAGCC GACGCGAGTT CCTGAGCCGG GCCACCGCGC TCGGCGTTTC GGCGGGCGCG
GCCTACGGCC TGCTGGGGCT GGCATCCCCC GCCCGCGCGC AAGAGGCGCC GCGGCCCGGC
GGCACCCTGC GCATGGAGAT GGAGACCCGG GCGCTCAAGG ATCCGCGGAC GGCCGACTGG
TCGCAGATCG CGAACTTCAC CCGCGGCTGG CTGGAGTATC TGGTGGAATA CCAGGCCGAC
GGAACCTTCC GGCCGATGCT CCTCGAAAGC TGGGAGGCCA ACGACAACGC CACGGAATAC
CGGCTGAACG TCCGCCCCGG CGTGCGCTGG AGCAACGGCG ACCCCTTCAC CGCCGAGGAT
GTCCGGCACA ATTTCGAGCG CTGGTGCGAC GCCTCGGTCG AGGGGAACGC GATGGCGGCC
CAGATGGTGG CCCTGCAGGC CGAGGGCAAG CTGCGGCCGG ACGCGATCGA AGTGGTGGAC
GAGACGACAC TCCGGCTGAC ACTCTCGCAG CCCGACATCG CGCTGATTGC GAACCTCGCC
GACTATCCGG CTGCGATCGT CCACCCCTCC TACGAGGGCG GCGACCCGGC GGCCAATCCC
GTCGGAACCG GGCCCTATCG GCCGGAAACG GTGGAGGTGG GCATCCGCAT GGTGCTGGTC
CGCAACGAGG CGCATCCCTG GTGGGGCAGC GACGTCCACG GCGGCCCCTG GCTCGACCGG
ATCGAATATC TCGATTTCGG CAGCGATCCC TCGGCCGCCG TGGCCGCGGC GGGATCGGGC
GAGATCGACG CGACCTATCA GACCGTGGGC GAGTTCATCG AGGTGCTGGA CGGGCTTGGC
TGGGACAAGT CCGAAGCCCG CACCGCGACC ACCCTTGCGA TCCGGTTCAA CCAGCAGTCC
GAGGAGTACC GCGACGTCCG CGTGCGCCGG GCGCTGCAGA TGGCGGTGGA CAATGCGGTG
GTGCTGGAAC TGGGCTACTC GGGCCACGGG CATGTGGCCG AGAACCACCA CGTCAGCCCG
ATCCATCCCG AATATGCCGA GCTGCCGCCG CTGACCGTCG ATCGCGCGCA GGCCCGGGCC
CTGCTGGAAG AGGCCGGGAT GGACGGCCAC GAGTTCGAGC TGGTCTCGCT CGACGACGCC
TGGCAGGCGG CCTCGTGCGA CGCCGTGGCG GCGCAGTTGC GGGACGCCGG CGTGTCGATC
CGGCGGACGG TCCTGCCCGG CGCGACCTAC TGGAACGACT GGCTGAAGTT CCCCTTCTCG
GCGACCGAGT GGAACATGCG CCCGCTGGGG GTGCAGGTTC TGGCGCTCGC CTATCGCTCG
GGGGTGCCCT GGAACGAATC CGGCTTTGCC AACAAGGAGT TCGACGCCAA GCTTGATGAG
GCCATGTCGA TCGTCGATCC CGACGGGCGC CGCACCCTCA TGGCCGATCT CGAGCGGATC
CTGCAGGAGG AGGGCGTGCT GATCCAGCCC TACTGGCGCT CGATCTTCCG CCATGTCGAT
CCGAAGGTGA AGGGTGCCGA GGCCCACCCG ACCTTCGAGC ATCACCATTA CAAGTGGTGG
ATCGACGCCT GA
 
Protein sequence
MTSERLHPAA LMHQVEVAAG RMSRREFLSR ATALGVSAGA AYGLLGLASP ARAQEAPRPG 
GTLRMEMETR ALKDPRTADW SQIANFTRGW LEYLVEYQAD GTFRPMLLES WEANDNATEY
RLNVRPGVRW SNGDPFTAED VRHNFERWCD ASVEGNAMAA QMVALQAEGK LRPDAIEVVD
ETTLRLTLSQ PDIALIANLA DYPAAIVHPS YEGGDPAANP VGTGPYRPET VEVGIRMVLV
RNEAHPWWGS DVHGGPWLDR IEYLDFGSDP SAAVAAAGSG EIDATYQTVG EFIEVLDGLG
WDKSEARTAT TLAIRFNQQS EEYRDVRVRR ALQMAVDNAV VLELGYSGHG HVAENHHVSP
IHPEYAELPP LTVDRAQARA LLEEAGMDGH EFELVSLDDA WQAASCDAVA AQLRDAGVSI
RRTVLPGATY WNDWLKFPFS ATEWNMRPLG VQVLALAYRS GVPWNESGFA NKEFDAKLDE
AMSIVDPDGR RTLMADLERI LQEEGVLIQP YWRSIFRHVD PKVKGAEAHP TFEHHHYKWW
IDA