Gene Smed_4317 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4317 
Symbol 
ID5318892 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp814420 
End bp815742 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content62% 
IMG OID640776122 
Productmonosaccharide-transporting ATPase 
Protein accessionYP_001313055 
Protein GI150376459 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1172] Ribose/xylose/arabinose/galactoside ABC-type transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.769006 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0636141 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTTTC TGAAGGCTCT CTCCCGCACG AAACTCTATT GGGGGCTGAT CGCCATTTTC 
CTGATCGGGG TCCTGTTCTC GCCGGTGACT TCGTCAGGCA GGAACATCTT TCTCTCCTCC
GGCAATCTCC TCGACGTCCT TCGGCAGGTA TCGACCACCG GGCTGATCGC CACCGGCATG
ACCGCGGTCA TTCTGACCGG CGGCATCGAC CTTTCGGTCG GCTCGCTGAT GGCGATCTGC
ACGGTGGTCT GTGCGATGCT GCTGACCGTC CCCGGCGACA CGGCAGCCAT CTATCTGGGG
CTGCCGGCCG TCGGGCTTGC CGTCCTAATC CTCGGTGCGG CGGTCGCCCG CTTCATCTTC
CTCAATATCG AGAAGTCGCG ATCCGGCGAG GCGCATATCC GGGATATCAA GCTTGGCGGG
GCGCGCGGTA CCGTCCTTCC CGCCATAGCA GGCGTCGTCC TCTGTGCACT CGTCTTGAGC
TTCCTCTTGC CTCAGATGCA GACGAAATTC GGCGTGTTCG GCGTTCTGCT CGTGGCGCCG
GCGGTGGGGC TGCTGTTCGG CGCCGTCAAC GGCGTTATCA TCGTGGCCGG ACGATTGCAG
CCCTTCATCG TCACACTTGC CATGATGGTA ACGGCGCTCG GCATTGCCCG GCTCACTGCC
GGACAGAACA ACGCCGTCCT GCCGGTCTAT ACCGGCAGCA ATGCCACGGC CGACTTCGAC
GTGCTGCGGC AGCTCTTATT CGGCATCGTG CCGATGCCCG GCATATTCTT CATCGTCGCG
ATTCTTCTCT ACGGCGCGGT GCTGCGCTTC ACGCCCTTCG GCCGCTACGT CTATGCGATC
GGCGGAAACG AGGAGGCGGC GCGCCTCTCC GGCATCAATG CCGGCCGGGT GAAGATCGTC
ACCTATGCGG TCTCGGGCCT TCTCGCGGGC ATCGCCGCGG TGCTCTATGT GGCGCAGTAC
CGCCAGGGCA AGCCGGATGC GGGCGCGGGG CTCGAACTGG ATGCGATCGC TGCAGTGGTC
ATTGGCGGAA CAAGTCTGAT GGGAGGGCGC GGGAGCCTTG CCGGAACGTT CTGCGGGGTC
CTGATCTTCG GTCTGCTCTC CAACATCCTG CAGCTTCACA ACATCAATTC CAATCTTCAG
CTGGTACTGA AAGGCGTCAT CATCATCGGC ACCGTGCTCG TTCAGGAGCG CAATGCCTAC
GATCTTTTCG CGCAGTTGCG GCTGCCGGGC GCAAGCCGGC GGCCGGTTCA GGGAGACACG
TCCGTGGAGG AGCGGACGTC GAGAGAGACC TTGTCTCTAA CAATGGGAGG AAAGAAAGAA
TGA
 
Protein sequence
MTFLKALSRT KLYWGLIAIF LIGVLFSPVT SSGRNIFLSS GNLLDVLRQV STTGLIATGM 
TAVILTGGID LSVGSLMAIC TVVCAMLLTV PGDTAAIYLG LPAVGLAVLI LGAAVARFIF
LNIEKSRSGE AHIRDIKLGG ARGTVLPAIA GVVLCALVLS FLLPQMQTKF GVFGVLLVAP
AVGLLFGAVN GVIIVAGRLQ PFIVTLAMMV TALGIARLTA GQNNAVLPVY TGSNATADFD
VLRQLLFGIV PMPGIFFIVA ILLYGAVLRF TPFGRYVYAI GGNEEAARLS GINAGRVKIV
TYAVSGLLAG IAAVLYVAQY RQGKPDAGAG LELDAIAAVV IGGTSLMGGR GSLAGTFCGV
LIFGLLSNIL QLHNINSNLQ LVLKGVIIIG TVLVQERNAY DLFAQLRLPG ASRRPVQGDT
SVEERTSRET LSLTMGGKKE