Gene EcSMS35_1583 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1583 
SymboluidB 
ID6143046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1568407 
End bp1569780 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content52% 
IMG OID641616460 
Productglucuronide transporter 
Protein accessionYP_001743638 
Protein GI170683190 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID[TIGR00792] sugar (Glycoside-Pentoside-Hexuronide) transporter 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCAAC AACTCTCCTG GCGCACCATC GTCGGCTACA GCCTCGGTGA CGTCGCCAAT 
AACTTCGCCT TCGCAATGGG GGCGCTCTTC CTGTTGAGTT ACTACACCGA CGTCGCTGGC
GTCGGTGCCG CTGCGGCGGG CACCATGCTG TTACTGGTGC GGGTATTCGA TGCCTTCGCC
GATGTCTTTG CCGGACGAGT GGTGGACAGT GTGAATACCC GTTGGGGAAA ATTCCGCCCG
TTTTTACTCT TCGGTACTGC GCCGTTAATG ATCTTCAGCG TGCTGGTATT CTGGGTACCG
ACCGACTGGA GCCATAGCAG CAAAGTTGTG TATGCATATT TGACCTACAT GGGACTCGGG
CTTTGCTACA GCCTGGTGAA TATTCCTTAT GGTTCACTTG CTACCGCGAT GACCCAACAA
CCACAATCCC GCGCCCGTCT GGGCGCGGCT CGCGGGATTG CCGCGTCATT GACCTTTGTC
TGCCTGGCAT TTCTGATAGG ACCGAGCATT AAGAACTCCA GCCCGGAAGA GATGGTGTCG
GTATACCATT TCTGGACGAT TGTGCTGGCG ATTGCCGGAA TGGTGCTTTA CTTCATCTGC
TTCAAATCGA CGCGTGAGAA TGTGGTACGT ATCGTGGCGC AGCCGTCATT GAAGATCAGT
CTGCAAACCC TGAAACGGAA TCGCCCGCTG TTTATGTTGT GCATCGGTGC GCTGTGTGTG
CTGATTTCGA CCTTCGCGGT CAGCGCCTCG TCGTTGTTCT ACGTGCGCTA TGTGTTAAAT
GATACCGGGC TGTTCACTGT GCTGGTACTG GTGCAAAACC TGGTCGGTAC TGTGGCATCG
GCACCGCTGG TGCCTGGGAT GGTCGCGAGG ATCGGTAAAA AGAATACCTT CCTGATTGGC
GCTTTGCTGG GAACCTGCGG TTATCTGCTG TTCTTCTGGG TTTCCGTCTG GTCACTGCCG
GTGGCGTTGG TTGCGTTGGC CATCGCATCA ATTGGCCAGG GCGTTACCAT GACCGTGATG
TGGGCACTGG AAGCTGATAC CGTAGAATAC GGTGAATACC TGACCGGCGT GCGAATTGAA
GGGCTCACCT ATTCACTATT CTCATTTACC CGTAAATGCG GTCAGGCAAT CGGAGGTTCA
ATTCCTGCCT TTATTTTGGG ATTAAGCGGA TATATCGCCA ATCAGGCGCA AACGACGGAA
GTTATTATGG GCATCCGCAC ATCAATTGCC TTAGTACCTT GCGGATTTAT GCTGCTGGCA
TTCGTCATTA TCTGGTTTTA TCCGCTCACG GATAAAAAAT TCAAAGAAAT CGTGGTTGAA
ATTGATAATC GTAAAAAAAT GCAGCAGCAA TTAATCAGCG ATATCATTAA TTAA
 
Protein sequence
MNQQLSWRTI VGYSLGDVAN NFAFAMGALF LLSYYTDVAG VGAAAAGTML LLVRVFDAFA 
DVFAGRVVDS VNTRWGKFRP FLLFGTAPLM IFSVLVFWVP TDWSHSSKVV YAYLTYMGLG
LCYSLVNIPY GSLATAMTQQ PQSRARLGAA RGIAASLTFV CLAFLIGPSI KNSSPEEMVS
VYHFWTIVLA IAGMVLYFIC FKSTRENVVR IVAQPSLKIS LQTLKRNRPL FMLCIGALCV
LISTFAVSAS SLFYVRYVLN DTGLFTVLVL VQNLVGTVAS APLVPGMVAR IGKKNTFLIG
ALLGTCGYLL FFWVSVWSLP VALVALAIAS IGQGVTMTVM WALEADTVEY GEYLTGVRIE
GLTYSLFSFT RKCGQAIGGS IPAFILGLSG YIANQAQTTE VIMGIRTSIA LVPCGFMLLA
FVIIWFYPLT DKKFKEIVVE IDNRKKMQQQ LISDIIN