Gene ECH74115_2327 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2327 
SymboluidB 
ID6971317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2198804 
End bp2200177 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content52% 
IMG OID643386203 
Productglucuronide transporter 
Protein accessionYP_002270687 
Protein GI209396395 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID[TIGR00792] sugar (Glycoside-Pentoside-Hexuronide) transporter 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCAAC AACTCTCCTG GCGCACCATC GTCGGCTACA GCCTTGGTGA CATCGCCAAT 
AACTTCGCCT TCGCAATGGG GGCGCTCTTC CTGTTGAGTT ACTACACCGA CGTCGCTGGC
GTCGGTGCCG CTGCGGCGGG CACCATGCTG TTACTGGTGC GGGTATTCGA TGCCTTCGCC
GACGTCTTTG CCGGACGAGT GGTGGACAGT GTGAATACCC GCTGGGGAAA ATTCCGCCCG
TTTTTACTCT TCGGTACTGC GCCGTTAATG ATCTTCAGCG TGCTGGTATT CTGGGTGCCG
ACCGACTGGA GCCATGGTAG CAAAGTGGTG TATGCATATT TGACCTACAT GGGCCTCGGG
CTTTGCTACA GCCTGGTGAA TATTCCTTAT GGTTCACTTG CTACCGCGAT GACCCAACAA
CCACAATCCC GCGCCCGTCT GGGCGCGGCT CGTGGGATTG CCGCTTCATT GACCTTTGTC
TGCCTGGCAT TTCTGATAGG GCCGAGCATT AAGAACTCCA GCCCGGAAGA GATGGTGTCG
GTATACCATT TCTGGACGAT TGTGCTGGCG ATTGCCGGAA TGGTGCTTTA CTTCATCTGC
TTCAAATCGA CGCGTGAGAA TGTGGTACGT ATCGTGGCGC AGCCGTCATT GAAGATCAGT
CTGCAAACCC TGAAACGGAA TCGCCCGCTG TTTATGTTGT GCATCGGTGC GCTGTGTGTG
CTGATTTCGA CCTTCGCGGT CAGCGCCTCG TCGTTGTTCT ACGTGCGCTA TGTGTTAAAT
GATACCGGGC TGTTCACTGT GCTGGTACTG GTGCAAAACC TGGTTGGTAC TGTGGCATCG
GCACCGTTGG TGCCGGGGAT GGTCGCGAGG ATCGGTAAAA AGAATACCTT CCTGATTGGC
GCTTTGCTGG GAACCTGCGG TTATCTGCTG TTCTTCTGGG TTTCCGTCTG GTCGCTGCCG
GTGGCGTTGG TTGCGTTAGC CATTGCCTCA ATTGGTCAGG GCGTTACCAT GACCGTGATG
TGGGCGCTGG AAGCTGATAC CGTAGAATAC GGTGAATACC TGACCGGCGT GCGAATTGAA
GGGCTCACCT ATTCACTATT CTCATTTACC CGTAAATGCG GTCAGGCAAT CGGTGGTTCA
ATTCCTGCCT TTATTTTGGG ATTAAGCGGA TATATCGCCA ATCAGGTGCA AACGCCGGAA
GTAATTATGG GCATCCGCAC ATCAATTGCC TTAGTACCTT GCGGATTTAT GCTACTGGCA
TTCGTTATTA TCTGGTTTTA TCCGCTCACG GATAAAAAAT TCAAAGAAAT CGTGGTTGAA
ATTGATAATC GTAAAAAAGT GCAGCAGCAA TTAATCAGCG ATATCACTAA TTAA
 
Protein sequence
MNQQLSWRTI VGYSLGDIAN NFAFAMGALF LLSYYTDVAG VGAAAAGTML LLVRVFDAFA 
DVFAGRVVDS VNTRWGKFRP FLLFGTAPLM IFSVLVFWVP TDWSHGSKVV YAYLTYMGLG
LCYSLVNIPY GSLATAMTQQ PQSRARLGAA RGIAASLTFV CLAFLIGPSI KNSSPEEMVS
VYHFWTIVLA IAGMVLYFIC FKSTRENVVR IVAQPSLKIS LQTLKRNRPL FMLCIGALCV
LISTFAVSAS SLFYVRYVLN DTGLFTVLVL VQNLVGTVAS APLVPGMVAR IGKKNTFLIG
ALLGTCGYLL FFWVSVWSLP VALVALAIAS IGQGVTMTVM WALEADTVEY GEYLTGVRIE
GLTYSLFSFT RKCGQAIGGS IPAFILGLSG YIANQVQTPE VIMGIRTSIA LVPCGFMLLA
FVIIWFYPLT DKKFKEIVVE IDNRKKVQQQ LISDITN