Gene Rsph17025_1147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_1147 
Symbol 
ID5084579 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp1181348 
End bp1182982 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content68% 
IMG OID640482705 
Productextracellular solute-binding protein 
Protein accessionYP_001167353 
Protein GI146277194 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGGGC CCGCCGCTGC GGGTCCGCGA TTGCAGCGGG CGCCGGATCC CGGCCGCCCA 
TGGGAAGGAG CCGGCGCGTC GCCCCGGCCG CCTCCCTCGC CTGACGGAAC GGGAGAGACG
GTCGCCAGTT CCCCGGCGGG CCGGGAACCA TCGGGCGCGA GGCGGCGTTG CGGGTCAGGA
AGCGGGCATC TGCCGCATGC GCTTGTGCTT TCCAGCCACG GGAGGGGTGG GAGGGCCGGA
GGAGCGCCGC ATCCGGGTTC GTTTCGGTGG GGGGAGGACG CAATGAAGGC GATGGTTCCC
GCGATGGCGG CCACGCTCGC GCTGGCATCC GGCCCGCTCG CGGGGCAGGA GCTGACGTTT
CCTCCCGGCG AGGACGACCG CTTTCACTGG AACAGTTTCG AGACCCTCAG GGAGCAGGAT
TTCTCTGGGC AGCGGCTCAC GATCCTCGGG CCGTGGCTCG GCCCGGACCG GACGCTGTTC
AATTCCGTCA TCGCCTATTT CGAGGCCGCG ACCGGAGCGG CCGTCACCTA CAACGGCTCG
GACAATTTCG AGCAGCAGAT CGTGATCGAC GCGAGCGCGG GCTCGCCGCC GGACATCGCG
ATCTTTCCCC AGCCCGGCCT TGCGCGGGAC CTGGCCTCGA AGGGGCAGCT GGCGCCCCTC
GATCCGTCGC TCGGCGAGTG GCTGCGCGAG AATTACGCGG CCGGCGACAG CTGGGTGAAT
CTTGGCACAT TTCCCGGCCG GGACGGGCAG GAGGCTCTCT ACGGGTTCTT CTACAAGATC
GATGTGAAGT CGCTCGTCTG GTATGTGCCC GAGAATTTTG CCGACTTCGG CTATGAGGTT
CCGGGCACGA TGGAGGAGCT TCTGGCGCTG TCCGAGCGGA TGGTGGAGGA TGGGGTGACG
CCCTGGTGCA TCGGCCTCGC CTCGGGCGGG GCCACCGGCT GGCCCGCGAC CGACTGGGTC
GAGGACATGA TGCTGCGCAT CAACCCGCCC GAAGTCTATG ACCAGTGGAC CCTGAACGAG
ATCCCGTTCG ACGATCCGCA GGTGGTGGCG GCGATCGAGG AGTTCGGCCG GTTCGCGCGC
GACGGACGCT TCGTGGCCGG TGGGCCCAAT GCGGTGGCCG CGACCGATTT CCGCGACAGC
CCCAAGGGTC TCTTCGCCGC CCCGACGCAA TGCTTCATGC ACAAGCAGGC AAGCTTCATC
CCCTCCTTCT TTCCCGAGGG GACGGTGATC GGCGAGGATG CCGACTTCTT CTACCTTCCC
GCCTACGAGA GCCGCGACCT GGGCCAGCCG GTGCTGGGTG CGGGAACGGT CTTCGGCATC
ACCCGCGACA CGCCGGTGGC GCGCGCCTTC ATCGACTTTC TCAAGACGCC GATCGCGCAC
GAGGTCTGGA TGGCCCAGAC CGGCTTTCTC ACGCCGCACA CGGGCGTGAA CACCGATGTC
TATGGCGATC CCACGCTGCG CAAGATGGGC GACATCCTGC TCGAGGCCAC GACCTTCCGC
TTCGACGGAT CCGACCTGAT GCCGGGCGCG GTGGGCGCAG GCGCCTTCTG GACCGGAATG
ATCGACTACA TGGGCGGACA GCCGGCCGAG ACCGTGGCGG CCGGCATCCA GCGCACCTGG
GACACGTTCA AGTGA
 
Protein sequence
MEGPAAAGPR LQRAPDPGRP WEGAGASPRP PPSPDGTGET VASSPAGREP SGARRRCGSG 
SGHLPHALVL SSHGRGGRAG GAPHPGSFRW GEDAMKAMVP AMAATLALAS GPLAGQELTF
PPGEDDRFHW NSFETLREQD FSGQRLTILG PWLGPDRTLF NSVIAYFEAA TGAAVTYNGS
DNFEQQIVID ASAGSPPDIA IFPQPGLARD LASKGQLAPL DPSLGEWLRE NYAAGDSWVN
LGTFPGRDGQ EALYGFFYKI DVKSLVWYVP ENFADFGYEV PGTMEELLAL SERMVEDGVT
PWCIGLASGG ATGWPATDWV EDMMLRINPP EVYDQWTLNE IPFDDPQVVA AIEEFGRFAR
DGRFVAGGPN AVAATDFRDS PKGLFAAPTQ CFMHKQASFI PSFFPEGTVI GEDADFFYLP
AYESRDLGQP VLGAGTVFGI TRDTPVARAF IDFLKTPIAH EVWMAQTGFL TPHTGVNTDV
YGDPTLRKMG DILLEATTFR FDGSDLMPGA VGAGAFWTGM IDYMGGQPAE TVAAGIQRTW
DTFK