Gene Smed_5887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5887 
Symbol 
ID5320189 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp851017 
End bp852063 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content62% 
IMG OID640777582 
Productmonosaccharide-transporting ATPase 
Protein accessionYP_001314514 
Protein GI150377919 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1172] Ribose/xylose/arabinose/galactoside ABC-type transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0742654 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.67986 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATTG AACAGAGCAA ACCGGCGGCG AGCCAGGCGA TTCCGTCGTC AGCTGGCGTG 
CGGGAAGCAG CGATAAGATA CGGCTTTCTC GTGCTTTTGG CCGGGATGAT CCTCTATTTC
TCATTGGTCA CCGGTGGATT CGCCTCTCCG CAAAGCGCAG TTTTCATTCT GCAATCGGTC
TCGATCACAG GCATCCTCGC GCTCGGCGTA ACGGCGACCC TGGTCGTCGG CGGCTTCGAC
CTGTCGATAG GCTCCATCGC CACGACGGCG ATGATGGCCT CGTCCTACGT CATGGTCGTG
CTGGGTGGGG ATGCTTTGAC GGCGACCCTC GTGTGCTTCT CGATCGGGGT TCTCATCGGG
CTGATCAATG GCATCATTAT CGTCTACATG CGCGTGCCCG ACCTGCTCGC GACGCTCGGC
ATGATGTTCC TGCTGCTCGG CCTTCAGCGC ATCCCGACAG AGGGACGCTC GATCGCCGCC
GGCATGACCC TGCCCGACGG CACCGTTGCG CCCGGCACTT TCAGCCCTGC CTTTCTGGCG
CTCGGGCGTC ATCGCTTCGA TTTCGTCCTG CCAAATCTCG TGCCGGTCTC TGTCGTGGTC
CTGATTATTC TTGCGGTCGT TATCTGGTTC TTCCTCGAAT ATACGCGCTT CGGCCGGATG
ATGTACGCCG TGGGCTCGAA CGAACGTGCC GCCAGCCTCG CGGGCGCGCC GGTCAATGCT
TACAAAATCT GGGCCTATAT CATTTCCGGC GTCTTTGCCT CGATCGGCGG TATCCTGCTC
GCGGCCCGCC TCGGCCGCGG GGATATCGCC TCCGGCAACA ACCTGCTGCT GGACGCCGTC
GCCGCTGCGC TGATCGGTTT CGCCGTACTC GGTGCCACTA AGCCGAACGC CTTCGGCACG
GCCGTCGGCG CGCTCTTCGT CGGCATCCTG CTGCAGGGCC TGACGATGAT GAACGCGCCC
TACTACACCC AGGATTTCGT CAAGGGCGCG GTGCTGGTCA TTGCCCTGAT TTTCACCTTT
GCGCTCTCGA AAAGAGGCAG ACGCTGA
 
Protein sequence
MSIEQSKPAA SQAIPSSAGV REAAIRYGFL VLLAGMILYF SLVTGGFASP QSAVFILQSV 
SITGILALGV TATLVVGGFD LSIGSIATTA MMASSYVMVV LGGDALTATL VCFSIGVLIG
LINGIIIVYM RVPDLLATLG MMFLLLGLQR IPTEGRSIAA GMTLPDGTVA PGTFSPAFLA
LGRHRFDFVL PNLVPVSVVV LIILAVVIWF FLEYTRFGRM MYAVGSNERA ASLAGAPVNA
YKIWAYIISG VFASIGGILL AARLGRGDIA SGNNLLLDAV AAALIGFAVL GATKPNAFGT
AVGALFVGIL LQGLTMMNAP YYTQDFVKGA VLVIALIFTF ALSKRGRR