Gene ECH74115_0416 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0416 
SymbollacY 
ID6970848 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp422301 
End bp423554 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content46% 
IMG OID643384468 
Productgalactoside permease 
Protein accessionYP_002268982 
Protein GI209400635 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2223] Nitrate/nitrite transporter 
TIGRFAM ID[TIGR00882] oligosaccharide:H+ symporter 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.64566 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACTATT TAAAAAACAC AAACTTTTGG ATGTTCGGTT TATTCTTTTT CTTTTACTTT 
TTTATCATGG GAGCCTACTT CCCGTTTTTC CCGATTTGGC TACATGACAT CAACCATATC
AGCAAAAGTG ATACGGGTAT TATTTTTGCT GCTATTTCTC TGTTCTCGCT ATTATTCCAA
CCGCTGTTTG GTCTGCTTTC TGACAAACTC GGGCTGCGCA AATACCTGCT GTGGATTATT
ACCGGCATGT TAGTGATGTT TGCGCCGTTC TTTATTTTTA TCTTCGGGCC ACTGTTACAA
TACAACATTT TAGTAGGATC GATTGTTGGT GGTATTTATC TTGGCTTTTG TTTTAACGCC
GGTGCGCCCG CAGTAGAGGC ATTTATCGAG AAAGTCAGCC GTCGCAGTAA TTTCGAATTT
GGTCGCGCGC GGATGTTTGG CTGTGTTGGC TGGGCGCTGT GTGCCTCGAT TGTCGGCATC
ATGTTCACCA TCAATAATCA GTTCGTTTTC TGGCTGGGTT CTGGCTGTGC ACTCATCCTC
GCCATTTTAC TCTTTTTCGC CAAAACGGAT GCGCCCTCTT CCGCCACGGT TGCCAATGCG
GTAGGTGCCA ACCATTCGGC ATTTAGCCTT AAACTGGCGC TGGAACTGTT CAGACAGCCA
AAACTGTGGT TTTTGTCACT GTATGTTATT GGCGTTTCCT GCACCTACGA TGTTTTTGAC
CAACAGTTTG CTAATTTCTT TACTTCTTTC TTTGCCACCG GTGAACAGGG TACGCGGGTA
TTTGGCTACG TAACGACAAT GGGCGAATTA CTTAACGCCT CAATTATGTT CTTTGCGCCA
CTGATCATTA ATCGCATCGG TGGGAAAAAT GCCCTGCTGC TGGCTGGCAC TATTATGTCT
GTACGTATTA TTGGCTCATC GTTCGCCACC TCAGCGCTGG AAGTGGTTAT TCTGAAAACG
CTGCATATGT TTGAAGTACC GTTCCTGCTG GTGGGCTGCT TTAAATATAT TACCAGCCAG
TTTGAAGTGC GTTTTTCAGC GACGATTTAT CTGGTCTGTT TCTGCTTCTT TAAGCAACTG
GCGATGATTT TTATGTCTGT ACTAGCGGGT AATATGTATG AAAGCATCGG TTTCCAGGGC
GCTTATCTGG TGCTGGGTCT GGTGGCGCTG GGCTTCACCT TAATTTCCGT GTTCACGCTT
AGCGGCCCCG GCCCGCTTTC TCTACTGCGT CGTCAGGTGA ATGAAGTCGC TTAA
 
Protein sequence
MYYLKNTNFW MFGLFFFFYF FIMGAYFPFF PIWLHDINHI SKSDTGIIFA AISLFSLLFQ 
PLFGLLSDKL GLRKYLLWII TGMLVMFAPF FIFIFGPLLQ YNILVGSIVG GIYLGFCFNA
GAPAVEAFIE KVSRRSNFEF GRARMFGCVG WALCASIVGI MFTINNQFVF WLGSGCALIL
AILLFFAKTD APSSATVANA VGANHSAFSL KLALELFRQP KLWFLSLYVI GVSCTYDVFD
QQFANFFTSF FATGEQGTRV FGYVTTMGEL LNASIMFFAP LIINRIGGKN ALLLAGTIMS
VRIIGSSFAT SALEVVILKT LHMFEVPFLL VGCFKYITSQ FEVRFSATIY LVCFCFFKQL
AMIFMSVLAG NMYESIGFQG AYLVLGLVAL GFTLISVFTL SGPGPLSLLR RQVNEVA