Gene Rsph17025_1201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_1201 
Symbol 
ID5084485 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp1243317 
End bp1245035 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content66% 
IMG OID640482759 
Productextracellular solute-binding protein 
Protein accessionYP_001167407 
Protein GI146277248 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.487559 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.842714 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATACG GCACGACAGC CATGGCGCTG GCGCTGATGG CGCTGGGGGC ACCGGCCTTC 
GCCGACATCG AGGCTGCGCG GCAGTTCCTC GATGCCGAGA TCGGCGACAT GTCCTCGCTC
ACGCGCGAGG AGCAGGAAGC CGAGATGCAA TGGTTCATCG ACGCGGCGCA GCCCTTCGCC
GGCATGGACA TCAAGGTGGT CTCGGAGACC ATCACCACGC ATGAATATGA ATCCAAGGTG
CTGGCGCCCG CCTTCACCGC GATCACGGGC ATCCGGGTCA GCCACGACCT GATCGGCGAA
GGCGACGTGG TCGAGAAGCT GCAGACGCAG ATGCAGTCGG GCGAGAACAT CTATGACGCC
TACATCAACG ACAGCGACCT GATCGGCACC CACTGGCGCT ACCAGCAGGC CCGCAGCCTG
ACCGACTGGA TGGCGAACGA GGGCCAGGAC GTCACCAACC CCGGCCTCGA TCTCGACGAT
TACATCGGCC TGAAGTTCAC CACCGCACCG GATGGCGAGC TTTACCAGCT GCCCGACCAG
CAGTTCGCCA ACCTCTACTG GTTCCGCGCC GACTGGTTCG ACGATCCAGA GACCAAGGCC
GACTTCCAGG AGAAGTACGG CTACGAGCTG GGCGTGCCGC TGAACTGGTC GGCCTACGAG
GACATCGCCG AGTTCTTCAC GGGGCGCGAC ATGAGCGCGC TCGGCGGGCC GACGAGCGCC
TATGGCAGCA TGGATTACGG CAAGAAGGAC CCGAGCCTCG GCTGGCGCTA CACCGACGCC
TGGATGTCGA TGGCGGGCAT GGGCGACAAG GGCGATCCGA ACGGTCTGCC GGTCGATGAA
TGGGGCATCC GGGTGGACGA GAACTCGCGC CCCGTGGGCT CCTGCGTGGC GCGCGGCGGC
GCGACCAACG ACGCGGCGGC GGTCTATGCG ATCACCAAGG CGATCGAATG GCTGCAGAAA
TACGCCCCGC CGCAGGCCGC CGGCATGACC TTCTCGGAAT CCGGGCCGGT GCCCGCGCAG
GGCGAGGTCG CCCAGCAGAT CTTCTGGTAC ACCGCTTTCA CCGCCGACAT GGTCAAGGAG
GGCCTGCCGG TGATGAACGA GGATGGCACG CCCAAGTGGC GCATGGCCCC CTCGCCGCAT
GGCGCCTACT GGACCGAAGG CACCAAGGTC GGCTACCAGG ACGTGGGCTC GTGGACGCTG
CTGAAATCCA CCCCCGACGA CCGCGCCAAG GCCGCCTGGC TCTACGCCCA GTTCGTCTCG
TCCAAGACCG TGGACGTGAA GAAGAGCCAC GTCGGCCTGA CCTTCGTGCG CGAATCCACC
ATCCAGCACC AGAGCTTCAC CGACCGCGCG CCCAATCTTG GCGGTCTGGT CGAGTTCTAC
CGCTCGCCCG CCCGCGTCCA GTGGTCGCCC ACGGGGACGA ACGTGCCGGA TTACCCGAAG
CTCGCGCAGC TCTGGTGGCA GAACATCGGC GATGCGATGT CGGGCGCCAA GTCGCCGCAG
GAGGCTCTGG ACGCGCTCTG CGCCGAGCAG GAGCGGGTGC TGGCGCGGCT GGAACGGGCC
GGCGTGCAGG GTGATCTCGG GCCGAAGCTG AACGAGGAGA AGGACCCGCA GGAATGGCTC
GACGCGCCCG GCGCGCCGGT GGGCAAGCTC GAGAACGAGA AACCCGCGGG TGAGACGATC
CCCTACGACG AACTCATCAA GTCCTGGCAG CAGGGCTGA
 
Protein sequence
MRYGTTAMAL ALMALGAPAF ADIEAARQFL DAEIGDMSSL TREEQEAEMQ WFIDAAQPFA 
GMDIKVVSET ITTHEYESKV LAPAFTAITG IRVSHDLIGE GDVVEKLQTQ MQSGENIYDA
YINDSDLIGT HWRYQQARSL TDWMANEGQD VTNPGLDLDD YIGLKFTTAP DGELYQLPDQ
QFANLYWFRA DWFDDPETKA DFQEKYGYEL GVPLNWSAYE DIAEFFTGRD MSALGGPTSA
YGSMDYGKKD PSLGWRYTDA WMSMAGMGDK GDPNGLPVDE WGIRVDENSR PVGSCVARGG
ATNDAAAVYA ITKAIEWLQK YAPPQAAGMT FSESGPVPAQ GEVAQQIFWY TAFTADMVKE
GLPVMNEDGT PKWRMAPSPH GAYWTEGTKV GYQDVGSWTL LKSTPDDRAK AAWLYAQFVS
SKTVDVKKSH VGLTFVREST IQHQSFTDRA PNLGGLVEFY RSPARVQWSP TGTNVPDYPK
LAQLWWQNIG DAMSGAKSPQ EALDALCAEQ ERVLARLERA GVQGDLGPKL NEEKDPQEWL
DAPGAPVGKL ENEKPAGETI PYDELIKSWQ QG