Gene ECH74115_3362 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3362 
Symbol 
ID6970636 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3096931 
End bp3101367 
Gene Length4437 bp 
Protein Length1478 aa 
Translation table11 
GC content54% 
IMG OID643387171 
Productalpha-2-macroglobulin family protein 
Protein accessionYP_002271634 
Protein GI209398434 
COG category[R] General function prediction only 
COG ID[COG2373] Large extracellular alpha-helical protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTTGCTG ACAGCAGTTT TAGCAGCAGT GAAGAGTCGA AAGTGCGACT GGAAGCGCCG 
GGGCGTGATT ATCGGCGCTA TCAGATGGAA GAGTACGGCG GCGTGGACGT TCGCCTGTAT
CGTATTCCTG ACCCGATGGC ATTTTTGCGC CAGCAGAAAA ACCTGCATCG CATTGTGGTG
CAACCGCAAT ATCTGGGCGA CGGGCTGAAC AATACGCTGA CCTGGCTGTG GGATAACTGG
TACGGCAAAT CTCGCCGCGT GATGCAGCGT ACTTTCTCTT CTCAGTCACG GCAGAATGTG
ACTCAGGCAT TACCCGAATT ACAGCTCGGC AATGCCATTA TTAAACCTTC CCGTTATGTA
CAGAACAACC AGTTTTCCCC GCTGAAAAAA TATCCCCTGG TGGAACAGTT CCGTTATCCA
CTATGGCAGG CTAAACCGGT CGAGCCGCAG CAAGGGGTAA AACTGGAAGG CGCATCCAGC
AATTTCATCT CGCCGCAGCC GGGTAACATT TATATTCCTC TCGGCCAACA AGAGCCGGGA
CTGTACCTCG TCGAGGCGAT GGTTGGTGGG TATCGGGCGA CGACGGTGGT GTTTGTTTCC
GATACCGTGG CGCTTAGCAA AGTGTCAGGC AAAGAGCTTC TGGTGTGGAC CGCGGGTAAA
AAACAGGGTG AAGCGAAGCC CGGCTCAGAG ATCTTGTGGA CTGACGGTCT TGGCGTGATG
ACCCGCGGTG TGACCGATGA CAGCGGTACC TTGCAGTTAC AACATATATC GCCAGAACGT
TCATACATTC TGGGTAAGGA TGCTGAAGGC GGCGTTTTTG TCTCCGAGAA CTTCTTCTAC
GAAAGCGAAA TCTACAACAC CCGCTTGTAT ATTTTTACCG ATCGCCCGCT ATATCGCGCA
GGCGATCGTG TCGATGTTAA AGTGATGGGC CGCGAGTTCC ACGATCCGTT GCATTCATCC
CCCATCGTCA GCGCCCCGGC GAAGCTTTCG GTGCTGGACG CTAACGGCAG TCTGTTGCAA
ACCGTCGATG TCACGCTGGA TGCGCGCAAT GGCGGGCAGG GAAGTTTCCG CCTGCCAGAA
AATGCCGTTG CCGGAGGTTA TGAGTTACGT CTTGCTTACC GCAATCAGGT CTATAGCAGC
AGTTTTCGCG TGGCAAACTA CATCAAGCCA CATTTCGAGA TTGGTTTAGC TCTCGACAAA
AAAGAGTTCA AAACTGGCGA AGCGGTCAGC GGCAAACTGC AACTTCTCTA CCCGGATGGC
GAGCCGGTAA AAAATGCCCG CGTGCAGTTA AGTTTGCGCG CTCAGCAATT ATCAATGGTC
GGTAACGATT TGCGTTATGC CGGACGTTTC CCCGTGTCGC TGGAAGGCAG CGAAACGGTG
TCCGACGCCA GCGGTCATGT GGCGTTAAAT CTCCCCGCCG CCGATAAACC GAGCCGCTAT
TTGTTAACCG TCTCCGCCAG TGACGGCGCG GCGTATCGCG TCACCACCAC CAAAGAGATC
CTCATTGAAC GCGGCCTGGC GCATTACTCT TTAAGTACCG CCGCACAATA CAGTAATAGC
GGCGAGTCGG TTGTGTTCCG TTATGCCGCG CTGGAATCTT CAAAACAGGT TCCTGTTACG
TATGAATGGT TGCGTCTCGA AGACCGCACG AGCCATAGCG GAGATCTACC GTCAGGCGGC
AAATCCTTTA CCGTCAATTT CGATAAACCT GGCAACTACA ATCTGACGTT ACGCGATAAA
GACGGCTTAA TTCTCGCCGG GTTAAGCCAT GCCGTCAGCG GTAAGGGCAG TATGTCGCAT
ACTGGTACGG TAGATATCGT GGCAGATAAA ACGCTGTACC AGCCTGGCGA AACCGCGAAG
ATGCTGATTA CCTTCCCGGA GCCAATTGAT GAAGCATTAT TGACGCTGGA ACGCGATCGC
GTTGAACAGC AGTCGCTGCT ATCGCATCCG GCAAACTGGC TGACGTTACA ACGTTTAAAC
GATACTCAAT ATGAAGCCCG TGTTCCAGTG AGCAATTCCT TTGCGCCTAA CATCACTTTT
TCGGTGCTGT ATACCCGTAA CGGTCAGTAC AGTTTTCAGA ACGCCGGGAT CAAAGTTGCC
GTTCCCCAGC TGGATATCCG GGTGAAAACG GACAAAACCC ATTACCAGCC TGGTGAACTG
GTCAATGTCG AATTAACCTC GTCGCTGAAA GGTAAACCTG TTTCTGCGCA GCTAACGGTA
GGCGTGGTCG ATGAAATGAT CTACGCGCTG CAACCAGAAA TCGCGCCGAA TATCGGCAAA
TTTTTCTATC CGCTGGGGCG TAACAATGTG CGTACCAGCT CCAGTCTGTC GTTTATCAGC
TACGACCAGG CACTCTCCAG CGAGCCGGTT GCGCCTGGCG CAACTAACCG CAGCGAGCGG
CGAGTAAAAA TGCTTGAACG TCCACGGCGT GAAGAGGTGG ATACCGCGGC ATGGATGCCG
TCACTCACAA CCGATAAACA AGGCAAAGCG TATTTCACGT TCCTGATGCC TGATTCGTTA
ACCCGCTGGC GTATCACCGC GCGTGGGATG AACGGCGACG GGCTGGTCGG GCAGGGGCGT
GCTTATCTGC GTTCGGAAAA AAATCTCTAC ATGAAGTGGA GTATGCCAAC GGTGTATCGC
GTGGGCGACA AACCGTCGGC AGGACTGTTT ATCTTCAGTC AGCAGGATAA CGAACCGGTG
GCGCTGGTGA CTAAATTTGC AGGCGCTGAG ATGCGCCAGA CGCTGACGCT GCACAAAGGG
GCGAATTATA TTTCGCTGGC GCAGAACATT CAGCAATCTG GCTTGTTAAG CGCAGAACTG
CAACAAAATG GGCAAGTGCA GGACAGCATT AGCACAAAAC TGTCTTTTGT GGATAACAGC
TGGCCCGTTG AACAGCAGAA AAATGTCATG CTCGGCGGTG GCGATAACGC GCTGATGTTG
CCCGAGCAGG CGAGCAATAT CCGGCTACAA AGTAGTGAAA CGCCGCAGGA GATTTTCCGC
AACAATCTTG ATGCGTTAGT CGATGAACCG TGGGGGGGGG TGATCAACAC CGGTAGCCGT
CTGATCCCGC TCAGTCTCGC CTGGCGTTCG CTTGCCGATC ATCAAAGTGC CGCCGCTAAC
GACATTCGTC AGATGATTCA GGATAACCGT CTGCGGCTGA TGCAACTGGC GGGGCCCGGA
GCGCGCTTTA CCTGGTGGGG TGAAGATGGC AATGGTGACG CCTTCCTTAC GGCATGGGCA
TGGTACGCCG ACTGGCAGGC CAGCCAGGCG CTCGGCGTAA CGCAACAACC GGAATACTGG
CAGCATATGC TCGACAGTTA TGCCGAGCAG GCAGATAACA TGCCGTTATT GCATCGGGCG
CTGGTGCTGG CGTGGGCACA GGAGATGAAT CTGCCGTGCA AAACGTTGTT GAAAGGGTTG
GATGAAGCTA TCGCCCGGCG CGGAACTAAA ACTGAAGATT TCTCTGAGGA AGACACCCGC
GATATCAATG ACAGCCTGAT CCTCGATACA CCGGAATCTC CACTGGCAGA TGCGGTGGCA
AACGTCTTAA CCATGACGTT GCTGAAAAAA GCGCAGTTGA AGTCCACGGT GATGCCACAG
GTTCAGCAAT ATGCGTGGGA TAAAGCGGTA AACAGCAATC AGCCGCTGGC ACACACGGTT
GTGCTGCTCA ATAGCGGGGG CGACGCTACC CAGGCGGCTG CTATTTTAAG TGGTTTGACC
GCTGAGCAAT CTACTATTGA GCGCGCGCTG GCCATGAACT GGCTGGCGAA ATATATGGCG
ACAATGCCTT CGGTTGTGTT GCCTGCGCCT GCGGGCGCAT GGGCCAAACA TAAGTTAACT
GGAGGGGGCG AATACTGGCG TTGGGTTGGT CAGGGCGTGC CGGACATTCT CTCTTTTGGT
GACGAATTAT CGCCGCAAAA TGTGCAGGTC CGCTGGCGTG AAGCGGCAAA AACGGCTCAA
CAAAGTAACA TTCCGGTGAC CGTTGAACGC CAATTGTATC GGCTTATCCC TGGTGAAGAA
GAGATGAGCT TTACTCTGCA ACCGGTGACC AGCAATGAGA TTGACAGCGA TGCGCTGTAT
CTCGATGAAA TTACGCTTAC CAGCGAGCAG GATGCAGTTC TGCGCTATGG TCAGGTGGAA
GTACCGCTCC CGCCGGGAGC CGACGTTGAG CGCACAACAT GGGGCATTTC AGTCAATAAA
CCCAACGCTG GAAAACAGCA GGGGCAATTG CTGGAAAAAG CGCGAAATGA AATGGGCGAA
CTGGCCTATA TGGTGCCGGT GAAAGAACTG ACGGGAACGG TCACTTTCCG CCATTTGCTG
CGCTTCTCGC AAAAAGGGCA ATTCGTTCTG CCTCCTGCTC GTTATGTGCG TTCCTATGCA
CCTGCACAGC AAAGTGTTGC GGCAGGGAGC GAATGGACCG GGATGCAGGT GAAATAA
 
Protein sequence
MLADSSFSSS EESKVRLEAP GRDYRRYQME EYGGVDVRLY RIPDPMAFLR QQKNLHRIVV 
QPQYLGDGLN NTLTWLWDNW YGKSRRVMQR TFSSQSRQNV TQALPELQLG NAIIKPSRYV
QNNQFSPLKK YPLVEQFRYP LWQAKPVEPQ QGVKLEGASS NFISPQPGNI YIPLGQQEPG
LYLVEAMVGG YRATTVVFVS DTVALSKVSG KELLVWTAGK KQGEAKPGSE ILWTDGLGVM
TRGVTDDSGT LQLQHISPER SYILGKDAEG GVFVSENFFY ESEIYNTRLY IFTDRPLYRA
GDRVDVKVMG REFHDPLHSS PIVSAPAKLS VLDANGSLLQ TVDVTLDARN GGQGSFRLPE
NAVAGGYELR LAYRNQVYSS SFRVANYIKP HFEIGLALDK KEFKTGEAVS GKLQLLYPDG
EPVKNARVQL SLRAQQLSMV GNDLRYAGRF PVSLEGSETV SDASGHVALN LPAADKPSRY
LLTVSASDGA AYRVTTTKEI LIERGLAHYS LSTAAQYSNS GESVVFRYAA LESSKQVPVT
YEWLRLEDRT SHSGDLPSGG KSFTVNFDKP GNYNLTLRDK DGLILAGLSH AVSGKGSMSH
TGTVDIVADK TLYQPGETAK MLITFPEPID EALLTLERDR VEQQSLLSHP ANWLTLQRLN
DTQYEARVPV SNSFAPNITF SVLYTRNGQY SFQNAGIKVA VPQLDIRVKT DKTHYQPGEL
VNVELTSSLK GKPVSAQLTV GVVDEMIYAL QPEIAPNIGK FFYPLGRNNV RTSSSLSFIS
YDQALSSEPV APGATNRSER RVKMLERPRR EEVDTAAWMP SLTTDKQGKA YFTFLMPDSL
TRWRITARGM NGDGLVGQGR AYLRSEKNLY MKWSMPTVYR VGDKPSAGLF IFSQQDNEPV
ALVTKFAGAE MRQTLTLHKG ANYISLAQNI QQSGLLSAEL QQNGQVQDSI STKLSFVDNS
WPVEQQKNVM LGGGDNALML PEQASNIRLQ SSETPQEIFR NNLDALVDEP WGGVINTGSR
LIPLSLAWRS LADHQSAAAN DIRQMIQDNR LRLMQLAGPG ARFTWWGEDG NGDAFLTAWA
WYADWQASQA LGVTQQPEYW QHMLDSYAEQ ADNMPLLHRA LVLAWAQEMN LPCKTLLKGL
DEAIARRGTK TEDFSEEDTR DINDSLILDT PESPLADAVA NVLTMTLLKK AQLKSTVMPQ
VQQYAWDKAV NSNQPLAHTV VLLNSGGDAT QAAAILSGLT AEQSTIERAL AMNWLAKYMA
TMPSVVLPAP AGAWAKHKLT GGGEYWRWVG QGVPDILSFG DELSPQNVQV RWREAAKTAQ
QSNIPVTVER QLYRLIPGEE EMSFTLQPVT SNEIDSDALY LDEITLTSEQ DAVLRYGQVE
VPLPPGADVE RTTWGISVNK PNAGKQQGQL LEKARNEMGE LAYMVPVKEL TGTVTFRHLL
RFSQKGQFVL PPARYVRSYA PAQQSVAAGS EWTGMQVK