Gene Smed_5747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5747 
Symbol 
ID5320049 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp714631 
End bp715782 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content65% 
IMG OID640777460 
Productputative integral membrane protein 
Protein accessionYP_001314392 
Protein GI150377797 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.514564 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACGCT TAAAAGCGCT GACTTGGGAC CATCCGCGCG GCTATAACGC GCTGGCCGCC 
GCGGCTCGCC GATTGGATTT GGCCGAAAGC GGCCTGGCAA TCGACTGGGA CAAGCAGCCG
CTTGAAGGAT TCGAGTCCTA TCCGATCGCC GATCTCTGCG CCCGTTACGA CCTGGTCGTG
CTCGATCATC CCCATGTCGG CGAGGCGGTG GATGGCGATT GCCTGCAGCC GCTGGAGAGC
ATCTTCGAGG AGGCGACGAT CAACCTGCTG AGAGCCGAGA GCATCGGCCC CTCCTTGCGC
AGTTACCACT TCACTGGCCA ACACTGGGCC CTTCCGCTCG ATGCGGCGAC CCAGGTGATG
GCTGCGCGCG CCGACCTGCT CGTCGGCCCC GCCCCCGTTC TCTGGGACGA GGTGCTGTTG
CTGTCGCAGA AAACCGGCAA GGTGGCGCTG TCGCTGGCCG GACCGCATGC CGCACTCTCC
TTCCTGTCGA TAGCCACGGC GCTCGGCGAG CCGCCGGCCG AGCGGGATCC AGACATTCTG
GTCTCGGAGC AAGTCGGCAC CGAAGTCTAC GACCTAATGA ACGAGCTTGC GGCTCGCAGC
CCGCATGTGG TTCGCCAAAA GAACCCGATC AGTATCCTCG AGCACATGGC GGCCCACGAC
GACGTCGCCC TCGTGCCGCT GGTCTACGGC TACGTGAACT ATGCCGCGCC GGTAAGCGGC
CGGCCGATCA CCTTCCACAA TGCGCCGCGG CTAGAACCTG GCGACCGTCC CGGCTCCACT
CTCGGCGGCA CCGGAATCGG CATATCCCGC CGCTGCGAGG TGACGCCGGC ACTGAAGCGC
CACCTGCTTT GGCTGATGAG CGCCGACGCG CAGATCGGCT TCATACCGTG CCATGAAGGT
CAGCCGTCCC GGCGGGAAGC CTGGCATGAT GCAGGCGTGA ATGCCCGCTG GGGCAGGTTC
TATTCGAACA CCGTCGCCAC GCTGGAGCAG GCCTATGTGC GTCCGCGCCA CAACGGCTAC
ATCGCGTTCC AAAGCAGGGC TTCCGCCCTG CTGCGCGAGT CATTCCTCGA GAATGCGCCG
GCCAGGGGCG TGATCAACCG ACTCCAGACA CTTTATGCAG ATCATCGCGG CAGCAAGGGC
GGCGAAAGGT AG
 
Protein sequence
MQRLKALTWD HPRGYNALAA AARRLDLAES GLAIDWDKQP LEGFESYPIA DLCARYDLVV 
LDHPHVGEAV DGDCLQPLES IFEEATINLL RAESIGPSLR SYHFTGQHWA LPLDAATQVM
AARADLLVGP APVLWDEVLL LSQKTGKVAL SLAGPHAALS FLSIATALGE PPAERDPDIL
VSEQVGTEVY DLMNELAARS PHVVRQKNPI SILEHMAAHD DVALVPLVYG YVNYAAPVSG
RPITFHNAPR LEPGDRPGST LGGTGIGISR RCEVTPALKR HLLWLMSADA QIGFIPCHEG
QPSRREAWHD AGVNARWGRF YSNTVATLEQ AYVRPRHNGY IAFQSRASAL LRESFLENAP
ARGVINRLQT LYADHRGSKG GER