Gene Smed_4467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4467 
Symbol 
ID5318169 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp947971 
End bp949008 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content61% 
IMG OID640776268 
ProductD-xylose ABC transporter, periplasmic substrate-binding protein 
Protein accessionYP_001313200 
Protein GI150376604 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4213] ABC-type xylose transport system, periplasmic component 
TIGRFAM ID[TIGR02634] D-xylose ABC transporter, substrate-binding protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0159284 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCCA TTTTGAAACT GATGGCCGGG GCTGCCATCA TAGCGTCCAT GCATTCCGCG 
GCGATCGCCA AGGATCTCGT CATCGGCGTT TCCTGGTCAA ACTTCCAGGA GGAGCGCTGG
AAGACCGATG AGGCCGCCAT CAAGACCGCG CTGGAAGCCT CGGGCGACAA ATACATTTCG
GCCGATGCAC AGTCTTCAGC AGCAAAGCAG CTCACCGACA TCGAATCGCT AATCGCCCAG
GGCGCCAACG CACTGATCGT GCTTGCGCAG GACTCCGATG CGATCGGTCC GGCTATCGAG
AAGGCCGCTG CCGAGGGGAT CCCGGTCGTC GGCTATGACC GCCTGATCGA AAACCCGGCC
GCCTTCTACA TCACCTTTGA CAACAAGGAA GTCGGCCGCC TGCAAGCGAG CGAGGTGTTC
AAGCAGAAGC CGGAAGGCAA CTACGTCTTC ATCAAGGGCT CCTCCGCCGA TCCGAACGCC
GACTTCCTTT TCTCAGGACA GATGGAAGTC CTGAAGGATG CCATCGATGC GGGCAAGATC
AAGAATGTCG GCGAGGCCTA TACCGATGGC TGGAAGCCGG AAAACGCCCA GAAGAACATG
GAACAGTTCC TGACGGCTAA CGACAACAAG GTCGATGCGA TCGTGGCCTC GAACGACGGG
ACCGCCGGCG GCGCGATCGC GGCACTCGAC GCCCAGGGCC TTGCCGGTTC GGTTCCTGTG
TCCGGCCAAG ATGCCGACAA GGCAGCGCTG AACCGCGTCG CTCGCGGCAC GCAGACGGTT
TCGGTGTGGA AGGACTCCCG CGAACTCGGT AAGAAAGCGG CAGAGATTGC CGCGGCGCTT
GCCGCCGGCA AGACCATGGA TGAAATCGAA GGCGTCCAGA CCTTTGACGG CGGCCCCAAG
GGCGTGGCCA TGAAATCCGT TTTCCTGGCA CCGCTGGCGA TCACCAGGGA CAATCTCAAT
GTCGTCATCG ATGCCGGCTG GATTGCCAAG GAAGAGACCT GCCAGGGCGC CAAGGACGAC
GTGGCTGCGT GCAAGTAA
 
Protein sequence
MKSILKLMAG AAIIASMHSA AIAKDLVIGV SWSNFQEERW KTDEAAIKTA LEASGDKYIS 
ADAQSSAAKQ LTDIESLIAQ GANALIVLAQ DSDAIGPAIE KAAAEGIPVV GYDRLIENPA
AFYITFDNKE VGRLQASEVF KQKPEGNYVF IKGSSADPNA DFLFSGQMEV LKDAIDAGKI
KNVGEAYTDG WKPENAQKNM EQFLTANDNK VDAIVASNDG TAGGAIAALD AQGLAGSVPV
SGQDADKAAL NRVARGTQTV SVWKDSRELG KKAAEIAAAL AAGKTMDEIE GVQTFDGGPK
GVAMKSVFLA PLAITRDNLN VVIDAGWIAK EETCQGAKDD VAACK