Gene Smed_4738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4738 
Symbol 
ID5319106 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1259709 
End bp1260713 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content62% 
IMG OID640776536 
Productmonosaccharide-transporting ATPase 
Protein accessionYP_001313468 
Protein GI150376872 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1172] Ribose/xylose/arabinose/galactoside ABC-type transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.419907 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00245002 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGACGA TCGAGACACA GGCCACCGCG GGCACCGTCC CCTCCGGCAA ACGCTTCTTC 
ACACTGGCCG GACGATACGG CACTGTCGCC GCGTTCCTCG CACTGATCCT CTTCAACGTC
CTCTTCACTC CAAACTTCCT GTCTTTGCAG ACGCTCAACG TCAACCTGAC CCAGGTCGCC
ACGATCGTGA TCGTCGCCAC CGGCATGACA CTGGTGATTG CTACCGGCGG CATCGACCTT
TCGGTGGGTT CACTGATGGC GATAGGCGGT GCGCTTGCAC CGATGATCTT TATGGGCGCG
CTGTTTCCGG TTTCGTCCAT GCCCGTCGCC GTGGCACTTG CCTTTATTCT GCCGGTCATC
GCCACGGCGC TGCTCGGTTT GTTCAACGGG CTGCTGGTGA CTCGTTTCGC CATCCAGCCG
ATCATCGCCA CCCTCGTCCT GTTCATTGCC GGCCGCGGCA TCGCCCAGGT CATGACCAAC
GGCAACCTGC AGGTCTTCCG CAACGAAGGC TTCCAATTCA TAGCTCTCGG GCGCATTGCC
GGCATTCCCG CCCAGGTAAT TTTGATGATT GTGATTGCGG CGATCGCATG GGCGGCAGTT
CGCCACACGG TTTTTGGACG CCAGGTCATC GCGGTCGGGG GCAACGAGAA GGCAGCCCGG
CTGACCGGTA TCCCCGTGCA CCGCGTCAAA CTGCTCGTCT ATATGATCAG CGGCGCGCTT
GCCGGCGTGG CGGGCCTCAT CGTCGTCGCG CGGAATTCCG CAAGCGATGC AAACCTTGTC
GGCCTCGGCA TGGAACTAGA CGCAATCGCC GCCGTCGCCG TAGGCGGCAC GCTTCTGACC
GGCGGGCGCG CGAACATCAT GGGCACCTTG ATCGGCGCCC TGGTTATCCA GCTGGTGCGC
TACACCCTGC TTGCAAATGG TGTGCCCGAC GCGGCTGCGC TGATCGTCAA GGCTGCCCTG
ATCCTGCTTG CGGTATTCAT CCAGCAGCGT GCCGGAAAAC CGTGA
 
Protein sequence
MTTIETQATA GTVPSGKRFF TLAGRYGTVA AFLALILFNV LFTPNFLSLQ TLNVNLTQVA 
TIVIVATGMT LVIATGGIDL SVGSLMAIGG ALAPMIFMGA LFPVSSMPVA VALAFILPVI
ATALLGLFNG LLVTRFAIQP IIATLVLFIA GRGIAQVMTN GNLQVFRNEG FQFIALGRIA
GIPAQVILMI VIAAIAWAAV RHTVFGRQVI AVGGNEKAAR LTGIPVHRVK LLVYMISGAL
AGVAGLIVVA RNSASDANLV GLGMELDAIA AVAVGGTLLT GGRANIMGTL IGALVIQLVR
YTLLANGVPD AAALIVKAAL ILLAVFIQQR AGKP