Gene Smed_4266 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4266 
Symbol 
ID5319017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp754573 
End bp755598 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content61% 
IMG OID640776071 
Productmonosaccharide-transporting ATPase 
Protein accessionYP_001313004 
Protein GI150376408 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1172] Ribose/xylose/arabinose/galactoside ABC-type transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.525278 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAGCA ATGTCGCAGC GCAGGGCACG AATTCGCCCC TCGTTCGCTC AAGACGGCGC 
ATGCCGCCCG AACTCAGCAT CTTTCTGGTG CTGATCGGTA TCGCGCTCGT CTACGAGATT
CTTGGCTGGC TCTTCGTCGG CCAGAGCTTT CTGATGAATA CGCAGCGTCT GACGATCATG
ATTCTGCAGG TCTCGGTTAT CGGCATTATC GCCGTGGGAG TCACCCAGGT CATCATCACT
GGCGGCATCG ACCTTTCGTC GGGCTCGGTC GTCGGCATGA CGGCGATGAT CTCGGCAAGC
GTCGCCCAGG CCTCCACATG GCCGAGGGCG CTTTATCCGT CGCTGACGGA CCTGCCGGCT
ATCATACCGA TCGGCCTCGG CGTCGGGATC GGCCTTCTCG CCGGCTTCAT TAATGGTCAG
CTGATCGCCA GAACCAAGAT CCCGCCCTTC ATTGCCACGC TGGGAATGAT GGTATCGGCT
CGCGGCGTCT CCAAGTGGTA CACGAAGGGC CAGCCGGTCT CCGGCCTCAC CGAGCAGTTC
AACTTCATCG GCACAGGCAT CTGGCCGGTT ATCGTCTTCC TCGTCGTCGC CCTTATATTT
CACATCGCGT TGCGCTACAC CCGTTACGGC AAGTTTACCT ATGCGATCGG CGCCAATGTG
CAGGCCGCGC GAGTCTCCGG CATCAATGTC GAAGCGCATC TGGTGAAGGT CTATGCGATC
GCCGGCATGC TCGCCGGTCT GGCTGGCGTG GTCACCGCCG CGCGCGCCCA GACGGCGCAG
GCCGGAATGG GGGTCATGTA TGAGCTCGAT GCGATCGCCG CGACCGTCAT CGGCGGCACT
TCGCTGACCG GGGGCGTCGG CCGCATCACC GGGACGGTGA TCGGCACGGT GATCCTCGGC
GTGATGACGT CCGGCTTCAC TTTCCTCAGG GTCGACGCCT ACTACCAGGA AATCGTCAAA
GGCATCATCA TCGTCGCTGC GGTGGTCGTC GACGTGTATC GTCAGAAAAG CCGGAAAAAA
GCGTAA
 
Protein sequence
MNSNVAAQGT NSPLVRSRRR MPPELSIFLV LIGIALVYEI LGWLFVGQSF LMNTQRLTIM 
ILQVSVIGII AVGVTQVIIT GGIDLSSGSV VGMTAMISAS VAQASTWPRA LYPSLTDLPA
IIPIGLGVGI GLLAGFINGQ LIARTKIPPF IATLGMMVSA RGVSKWYTKG QPVSGLTEQF
NFIGTGIWPV IVFLVVALIF HIALRYTRYG KFTYAIGANV QAARVSGINV EAHLVKVYAI
AGMLAGLAGV VTAARAQTAQ AGMGVMYELD AIAATVIGGT SLTGGVGRIT GTVIGTVILG
VMTSGFTFLR VDAYYQEIVK GIIIVAAVVV DVYRQKSRKK A