Gene ECH74115_4394 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4394 
Symbol 
ID6967597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4071502 
End bp4073853 
Gene Length2352 bp 
Protein Length783 aa 
Translation table11 
GC content55% 
IMG OID643388116 
Productputative glycosyl hydrolase 
Protein accessionYP_002272553 
Protein GI209396102 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCA AAACTATTTT AACGCCAGTA ACCTGCGCTC TGCTGATAAG TTTTTCCGCC 
CATGCCACTA ACGCCGACAA TTATAAAAAC GTGATTAACC GTACTGGTGC GCCGCAGTAC
ATGAAGGATT ACGATTACGA CGATCACCAG CGTTTTAATC CGTTTTTCGA TCTCGGAGCC
TGGCATGGTC ATCTGTTGCC AGACGGCCCG AACACCATGG GCGGCTTTCC GGGCGTTGCG
CTGCTGACGG AAGAATACAT CAACTTTATG GCCAGCAATT TCGACCGCCT GACCGTCTGG
CAGGACGGCA AGAAAGTCAA CTTCACGCTG GAGGCATACA ATATTCCCGG CGCGCTGGTG
CAAAAACTGA CAGCAAAAGA TGTGCAGGTC GAAATGACTC TGCGCTTCGC CACGCCGCGC
ACGTCACTAC TGGAAACTAA AATCATCAGC AATAAACCGC TGGATCTGGT GTGGGATGGC
GAACTGCTGG AAAAACTGGA AGCGAAAGAA GGGAAACCGC TTTCCGATAA AACCATTGCT
GGCGAATACC CTGACTACCA GCGCAAAATC AGCGCCACCC GTGATGGCCT GAAAGTCACC
TTTGGCAAAG TGCGCGCCAC CTGGGATCTG CTGACCTCCG GTGAATCAGA ATATCAGGTG
CATAAATCCC TGCCGGTGCA GACTGAAATC AACGGCAATC GCTTTACCAG TAAGGCGCAT
ATCAACGGTT CGACGACGCT CTATACCACC TATTCCCATC TGCTGACCTC TCAGGAAGTT
AGCAAAGAGC AAATGCAGAT CCGCGATATT CTGGCGCGTC CGGCGTTTTA TCTCACCGCC
TCGCAGCAAC GCTGGGAAGA GTATCTGAAG AAAGGGTTAA CCAATCCGGA TGCGACGCCG
GAACAGACAC GCGTCGCGGT GAAAGCGATC GAAACGCTCA ACGGTAACTG GCGCTCGCCT
GGGGGTGCGG TGAAATATAA CACCGTCACG CCGTCGGTGA CCGGGCGCTG GTTCTCCGGC
AATCAGACCT GGCCGTGGGA TACCTGGAAG CAGGCGTTTG CGATGGCGCA TTTCAATCCG
GACATCGCCA AAGAGAATAT CCGCGCGGTC TTCTCCTGGC AGATCCAGCC TGGCGACAGC
GTGCGTCCGC AGGATGTGGG CTTTGTCCCC GACCTGATAG CGTGGAATCT TAGCCCCGAG
CGTGGTGGCG ATGGTGGCAA CTGGAACGAA CGAAATACCA AACCCAGCCT TGCCGCCTGG
TCGGTGATGG AAGTGTACAA CGTCACCCAG GATAAAACCT GGGTGGCAGA GATGTACCCG
AAACTGGTGG CCTATCACGA CTGGTGGTTA CGTAACCGCG ATCACAACGG CAACGGCGTG
CCGGAATATG GCGCGACCCG CGACAAAGCC CACAACACTG AGAGCGGCGA GATGCTGTTT
ACGGTGAAGA AAGGCAACAA AGAAGAGACG CAGTCTGGGC TGAACAACTA CGCCCGCGTG
GTGGAGAAAG GCCAGTACGA CAGTCTGGAA ATTCCGGCAC AGGTTGCTGC ATCGTGGGAA
TCGGGGCGTG ATGATGCCGC CGTCTTTGGG TTTATCGACA AAGAACAGCT GGATAAATAT
GTCGCAAGCG GCGGCAAACG TAGCGACTGG ACAGTGAAAT TCGCCGAAAA CCGCAGTCAG
GACGGAACGT TGCTGGGGTA CTCGCTATTG CAGGAGTCGG TGGATCAGGC CAGCTATATG
TACAGCGATA ACCATTATCT GGCGGAGATG GCGACGATTC TCGGTAAACC GGAAGAGGCT
AAGCGCTATC GCCAGTTGGC ACAGCAGCTC GCGGACTACA TCAACACCTG TATGTTCGAC
CCGGCTACGC AGTACTTCTA TGACGTGCGT ATTGAAGATA AACCGCTGGC GAACGGCTGC
GCGGGCAAAC CGATTGTTGA GCGCGGTAAA GGGCCGGAAG GCTGGTCGCC GCTGTTTAAC
GGTGCGGCAA CGCAGGCCAA TGCCGACGCG GTGGTGAAGG TGATGCTCGA TCCTAAAGAG
TTCAATACCT TTGTTCCGCT GGGAACGGCG GCGTTAACCA ATCCGGCCTT TGGCGCTGAT
ATCTACTGGC GCGGGCGCGT ATGGGTGGAT CAGTTCTGGT TTGGTCTGAA AGGGATGGAG
CGTTACGGTT ATCGCGATGA TGCCCTGAAA CTGGCGGATA CGTTCTTCCG GCACGCCAAA
GGGTTAACCG CCGATGGCCC AATTCAGGAA AATTACAACC CGCTGACAGG CGCACAGCAA
GGCGCACCAA ATTTCTCCTG GAGTGCCGCA CATTTTTATA TGTTGTATAA CGATTTTTTC
CGTAAGCAAT AA
 
Protein sequence
MKIKTILTPV TCALLISFSA HATNADNYKN VINRTGAPQY MKDYDYDDHQ RFNPFFDLGA 
WHGHLLPDGP NTMGGFPGVA LLTEEYINFM ASNFDRLTVW QDGKKVNFTL EAYNIPGALV
QKLTAKDVQV EMTLRFATPR TSLLETKIIS NKPLDLVWDG ELLEKLEAKE GKPLSDKTIA
GEYPDYQRKI SATRDGLKVT FGKVRATWDL LTSGESEYQV HKSLPVQTEI NGNRFTSKAH
INGSTTLYTT YSHLLTSQEV SKEQMQIRDI LARPAFYLTA SQQRWEEYLK KGLTNPDATP
EQTRVAVKAI ETLNGNWRSP GGAVKYNTVT PSVTGRWFSG NQTWPWDTWK QAFAMAHFNP
DIAKENIRAV FSWQIQPGDS VRPQDVGFVP DLIAWNLSPE RGGDGGNWNE RNTKPSLAAW
SVMEVYNVTQ DKTWVAEMYP KLVAYHDWWL RNRDHNGNGV PEYGATRDKA HNTESGEMLF
TVKKGNKEET QSGLNNYARV VEKGQYDSLE IPAQVAASWE SGRDDAAVFG FIDKEQLDKY
VASGGKRSDW TVKFAENRSQ DGTLLGYSLL QESVDQASYM YSDNHYLAEM ATILGKPEEA
KRYRQLAQQL ADYINTCMFD PATQYFYDVR IEDKPLANGC AGKPIVERGK GPEGWSPLFN
GAATQANADA VVKVMLDPKE FNTFVPLGTA ALTNPAFGAD IYWRGRVWVD QFWFGLKGME
RYGYRDDALK LADTFFRHAK GLTADGPIQE NYNPLTGAQQ GAPNFSWSAA HFYMLYNDFF
RKQ