Gene Rsph17025_1681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_1681 
Symbol 
ID5083413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp1724797 
End bp1726107 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content66% 
IMG OID640483239 
Productextracellular solute-binding protein 
Protein accessionYP_001167879 
Protein GI146277720 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.901925 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCAA GATTTCGCGC CCTGATGGGG GCGTGCGCCG TGGCTGCGCT TTCGACCGCC 
GCAGGCGCCG AAACCATCAC CGTGGCGACC GTCAACAACG GCGACATGAT CCGCATGCAG
GGGCTCATGT CCGAGTTCAA CGCGCAGCAT CCCGACATCA CGGTCGAGTG GGTCACGCTC
GAGGAAAACG TGCTGCGCCA GAAGGTCACG ACCGACATCG CCACCCGGGG CGGGCAGTTC
GACGTGCTGA CCATCGGCAC CTACGAGGTG CCGATCTGGG GCAAGCAGGG CTGGCTCGTG
AGCCTGAACG ACCTGCCGCC CGAATATGAC GCCGACGACA TCCTGCCCGC GATCCGCAAC
GGCCTGACCG TCGATGGCGA GCTCTATGCC GCGCCCTTCT ACGGCGAAAG CTCGATGATC
ATGTACCGGA CGGACCTGAT GGAGAAGGCC GGGCTGACCA TGCCCGACGC CCCCACCTGG
GAATTCGTCA AGGAAGCCGC CGCCAAGATG ACCGACAAGG ATGCCGAGAT CTACGGCATC
TGCCTGCGCG GCAAGGCCGG CTGGGGCGAG AACATGGCGT TCCTGACCGC CATGGCCAAC
AGCTACGGCG CGCGCTGGTT CGACGAGAAC TGGCAGCCGC AGTTCGACGG CGAGGCCTGG
AAGGCCGCGC TGACCGATTA TCTCGACCTG ATGACGAACC ACGGGCCTCC GGGCGCCTCG
AACAACGGCT TCAACGAGAA CCTCGCGCTG TTCCAGCAAG GCAAGTGCGG CATGTGGATC
GACGCGACGG TTGCGGCCTC GTTCGTGACC AACCCCGCGG AATCGACCGT GGCCGACCAG
GTGGGCTTCG CGCTGGCGCC CGACACCGGC AAGGGCAAGC GGTCCAACTG GCTCTGGGCC
TGGAACCTCG CGGTGCCGGC GGGGTCGCAG AAGGTGGATG CCGCCAAGCA GTTCATCGCC
TGGGCAACCT CGAAGGACTA CGCCGAGCTG GTCGCCTCGA AGGAGGGCTG GGCCAACGTG
CCTCCGGGGA CGCGAGCCTC GCTCTACGAG AACCCGGAAT ACCAGAAGGT GCCCTTCGCG
CAGATGACGC TGGAGAGCAT CAACGCGGCT GATCCGACCA ACCCGGCCGT CGATCCGGTG
CCTTACGTCG GTATCCAGTT CGTGGCGATC CCCGAGTTCC AGGGCATCGG CACGGCTGTC
GGCCAGCAGT TCTCGGCGGC GCTTGCCGGG TCGATGTCGG CCGAACAGGC GCTGGCCGCG
GCACAAGCCT TCACAACGCG CGAGATGACC CGCGCCGGCT ACATCAAGTA A
 
Protein sequence
MTARFRALMG ACAVAALSTA AGAETITVAT VNNGDMIRMQ GLMSEFNAQH PDITVEWVTL 
EENVLRQKVT TDIATRGGQF DVLTIGTYEV PIWGKQGWLV SLNDLPPEYD ADDILPAIRN
GLTVDGELYA APFYGESSMI MYRTDLMEKA GLTMPDAPTW EFVKEAAAKM TDKDAEIYGI
CLRGKAGWGE NMAFLTAMAN SYGARWFDEN WQPQFDGEAW KAALTDYLDL MTNHGPPGAS
NNGFNENLAL FQQGKCGMWI DATVAASFVT NPAESTVADQ VGFALAPDTG KGKRSNWLWA
WNLAVPAGSQ KVDAAKQFIA WATSKDYAEL VASKEGWANV PPGTRASLYE NPEYQKVPFA
QMTLESINAA DPTNPAVDPV PYVGIQFVAI PEFQGIGTAV GQQFSAALAG SMSAEQALAA
AQAFTTREMT RAGYIK