Gene Rsph17029_1727 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1727 
Symbol 
ID4898068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1821242 
End bp1822552 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content66% 
IMG OID640112320 
Productextracellular solute-binding protein 
Protein accessionYP_001043609 
Protein GI126462495 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCAA GATTTCGCGC CCTGATGGGC GCGTGCGCCG TGGCTGCGCT CTCGTCCGCC 
GCCGGCGCCG AAACCATCAC CGTGGCGACT GTCAACAACG GCGACATGAT CCGCATGCAG
GGGCTCATGT CCGAGTTCAA CGCGCAGCAC CCCGACATCA CCGTCGAGTG GGTGACGCTC
GAGGAAAACG TACTGCGCCA GAAGGTCACG ACCGACATCG CCACCAAGGG CGGGCAGTTC
GACGTGCTGA CCATCGGCAC CTACGAGGTT CCGATCTGGG GCAAGCAGGG CTGGCTCGTG
AGCCTGAACG ACCTGCCGCC GGAGTATGAT GCCGACGACA TCCTGCCCGC GATCCGCAAC
GGCCTCACCG TCGACGGCGA GCTCTATGCC GCGCCCTTCT ACGGCGAGAG CTCGATGATC
ATGTATCGCA AGGACCTGAT GGAGAAGGCG GGGCTGACCA TGCCCGACGC CCCCACCTGG
GACTTCGTGA AGGAAGCGGC GCAGAAGATG ACCGACAAGG ATGCCGAGGT CTACGGCATC
TGCCTGCGCG GCAAGGCGGG CTGGGGCGAG AACATGGCCT TCCTCACCGC CATGGCCAAC
AGCTACGGCG CGCGCTGGTT CGACGAGAAC TGGCAGCCGC AGTTCGATGG CGAGGCCTGG
AAGGCCACGC TGACCGACTA TCTCGACATG ATGACGAACT ACGGCCCGCC CGGCGCCTCG
AACAACGGCT TCAACGAGAA CCTCGCGCTG TTCCAGCAGG GCAAGTGCGG CATGTGGATC
GACGCGACGG TGGCCGCCTC CTTCGTGACC AACCCCGAGG AATCCACGGT GGCCGACAAG
GTGGGCTTCG CGCTCGCCCC CGATACCGGC AAGGGCAAGC GGGCCAACTG GCTCTGGGCC
TGGAACCTCG CGATCCCGGC GGGCTCGCAG AAGGTCGATG CCGCCAAGCA GTTCATCGCC
TGGGCGACCT CGAAGGACTA TGCCGAGCTG GTGGCTTCGA AGGAAGGCTG GGCCAATGTG
CCTCCGGGGA CGCGGACCTC GCTCTACGAG AATCCGGAAT ATCAGAAGGT GCCGTTCGCG
AAGATGACGC TCGACAGCAT CAACGCGGCT GACCCGACCC ACCCGGCCGT CGATCCGGTG
CCTTATGTCG GTGTGCAGTT CGTGGCGATC CCCGAGTTCC AGGGCATCGG CACCGCCGTG
GGCCAGCAGT TCTCGGCGGC TCTCGCGGGC TCGATGTCGG CCGCGCAGGC GCTTCAGGCG
GCCCAGCAGT TCACGACGCG CGAAATGACC CGCGCGGGCT ACATCAAGTA A
 
Protein sequence
MTARFRALMG ACAVAALSSA AGAETITVAT VNNGDMIRMQ GLMSEFNAQH PDITVEWVTL 
EENVLRQKVT TDIATKGGQF DVLTIGTYEV PIWGKQGWLV SLNDLPPEYD ADDILPAIRN
GLTVDGELYA APFYGESSMI MYRKDLMEKA GLTMPDAPTW DFVKEAAQKM TDKDAEVYGI
CLRGKAGWGE NMAFLTAMAN SYGARWFDEN WQPQFDGEAW KATLTDYLDM MTNYGPPGAS
NNGFNENLAL FQQGKCGMWI DATVAASFVT NPEESTVADK VGFALAPDTG KGKRANWLWA
WNLAIPAGSQ KVDAAKQFIA WATSKDYAEL VASKEGWANV PPGTRTSLYE NPEYQKVPFA
KMTLDSINAA DPTHPAVDPV PYVGVQFVAI PEFQGIGTAV GQQFSAALAG SMSAAQALQA
AQQFTTREMT RAGYIK