Gene Smed_0226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0226 
Symbol 
ID5321058 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp250372 
End bp251883 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content64% 
IMG OID640789161 
ProductABC transporter related 
Protein accessionYP_001325920 
Protein GI150395453 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.973654 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.104566 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCCCG CCATCGCGCT TGAGGGCATA TCGAAGTCCT TTCCGGGCGT GCGCGCGCTT 
TCCGATGTCT CACTCGCGCT CTATCCGGGC TCGGTGACGG CGCTCGTCGG CGAGAACGGC
GCCGGCAAGT CGACCCTCGT GAAGATACTG ACCGGCATTT ATCAACCCGA TGCGGGCGCA
ATCCGGCTCG CTGACAGGGA GACGACATTT CCGACAGCCC TTGCCGCGTC CCGCGCCGGC
GTGACCGCGA TCCACCAGGA GACGGTCCTC TTCGACGAGC TCTCCGTCGC GGAGAATATT
TTCCTCGGTC ATGCTCCGCG CAATCGCTTC GGCCTCATCG ACTGGAAGAA GCTCAACGCC
GACGCCAAGA CCCTGTTGAA CCGGGCAGGC GCCGATTTCG ACCCGACGAT CCGCCTCCGC
GACCTCGGCA TCGCCAAGAA GCACCTGGTC GCGATCGCCC GGGCGCTCTC GGTCGATGCG
CGCGTCGTCA TCATGGACGA GCCGACGGCC GCTCTGTCGC ACAAGGAAAT TCACGAGCTC
TACGCGCTGA TCGAACGGCT CAAGGCCAAC GGCAAGGCCA TCCTCTTCAT CAGTCACAAA
TTCGATGAGA TCTTCCGCAT CGCCGACCGC TACACCGTCT TCCGCGACGG AGCGATGATC
GGCGAAGGGC TGATCGCCGA TGTCAGCCAG GACGATCTCG TCCGCATGAT GGTCGGCCGC
GCGGTCGGCT CCGTGTACCC GAAGAAGGAG GTGGCCATCG GTCAGCCGGT GCTTACCGTT
TCCGGTTATC GCCACCCGAC CGAATTCGAG GACATCAACT TCGAGCTCAG GCGCGGCGAG
ATTCTCGGCT TCTATGGCCT CGTCGGCGCG GGACGTTCGG AGTTCATGCA GTCGCTGATC
GGCATCACCC GGCCGTCGGC CGGTGCGGTC AAGCTCGATG GGGAGGTGCT GGTTATCCGC
AGCCCGGCGG AGGCGATCCG CGCCGGCATC GTCTATGTGC CGGAAGAGCG CGGGCGGCAG
GGGGCGATCA TCGGCATGCC GATCTTTCAG AACATCACGC TGCCATCGCT CTCGCAGACC
TCGCGTTCGG GGTTCCTGAG GCTCGCCCAG GAATTCGCCT TGGCACGCGA ATATACCTCG
CGTCTCGACC TGCGCGCCGC CTCGCTCGAT CAGGATGTCG GCACGCTGTC CGGTGGAAAC
CAGCAGAAGG TGGTGATCGC CAAGTGGCTC GCCACCCGGC CGAAGGTCAT CATCCTCGAC
GAGCCGACCA AAGGCATAGA CATCGGATCC AAGGCTGCCG TCCATGCCTT CATGAGCGAA
CTCGCCGCCC AGGGCCTGAG CGTCATCATG GTATCCTCGG AAATCCCGGA GATCATGGGA
ATGTCCGACC GCGTCATCGT CATGCGCGAG GGCCGCGTCG CCGGAAGGTT CGAACGATCG
GAACTGACTG CCGAGAAGCT GGTGCGGGCT GCCGCGGGCA TCGAAACGCA AGCCGGCGGG
GGAGCAGCAT GA
 
Protein sequence
MKPAIALEGI SKSFPGVRAL SDVSLALYPG SVTALVGENG AGKSTLVKIL TGIYQPDAGA 
IRLADRETTF PTALAASRAG VTAIHQETVL FDELSVAENI FLGHAPRNRF GLIDWKKLNA
DAKTLLNRAG ADFDPTIRLR DLGIAKKHLV AIARALSVDA RVVIMDEPTA ALSHKEIHEL
YALIERLKAN GKAILFISHK FDEIFRIADR YTVFRDGAMI GEGLIADVSQ DDLVRMMVGR
AVGSVYPKKE VAIGQPVLTV SGYRHPTEFE DINFELRRGE ILGFYGLVGA GRSEFMQSLI
GITRPSAGAV KLDGEVLVIR SPAEAIRAGI VYVPEERGRQ GAIIGMPIFQ NITLPSLSQT
SRSGFLRLAQ EFALAREYTS RLDLRAASLD QDVGTLSGGN QQKVVIAKWL ATRPKVIILD
EPTKGIDIGS KAAVHAFMSE LAAQGLSVIM VSSEIPEIMG MSDRVIVMRE GRVAGRFERS
ELTAEKLVRA AAGIETQAGG GAA