Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3362 |
Symbol | |
ID | 6970636 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3096931 |
End bp | 3101367 |
Gene Length | 4437 bp |
Protein Length | 1478 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643387171 |
Product | alpha-2-macroglobulin family protein |
Protein accession | YP_002271634 |
Protein GI | 209398434 |
COG category | [R] General function prediction only |
COG ID | [COG2373] Large extracellular alpha-helical protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCTTGCTG ACAGCAGTTT TAGCAGCAGT GAAGAGTCGA AAGTGCGACT GGAAGCGCCG GGGCGTGATT ATCGGCGCTA TCAGATGGAA GAGTACGGCG GCGTGGACGT TCGCCTGTAT CGTATTCCTG ACCCGATGGC ATTTTTGCGC CAGCAGAAAA ACCTGCATCG CATTGTGGTG CAACCGCAAT ATCTGGGCGA CGGGCTGAAC AATACGCTGA CCTGGCTGTG GGATAACTGG TACGGCAAAT CTCGCCGCGT GATGCAGCGT ACTTTCTCTT CTCAGTCACG GCAGAATGTG ACTCAGGCAT TACCCGAATT ACAGCTCGGC AATGCCATTA TTAAACCTTC CCGTTATGTA CAGAACAACC AGTTTTCCCC GCTGAAAAAA TATCCCCTGG TGGAACAGTT CCGTTATCCA CTATGGCAGG CTAAACCGGT CGAGCCGCAG CAAGGGGTAA AACTGGAAGG CGCATCCAGC AATTTCATCT CGCCGCAGCC GGGTAACATT TATATTCCTC TCGGCCAACA AGAGCCGGGA CTGTACCTCG TCGAGGCGAT GGTTGGTGGG TATCGGGCGA CGACGGTGGT GTTTGTTTCC GATACCGTGG CGCTTAGCAA AGTGTCAGGC AAAGAGCTTC TGGTGTGGAC CGCGGGTAAA AAACAGGGTG AAGCGAAGCC CGGCTCAGAG ATCTTGTGGA CTGACGGTCT TGGCGTGATG ACCCGCGGTG TGACCGATGA CAGCGGTACC TTGCAGTTAC AACATATATC GCCAGAACGT TCATACATTC TGGGTAAGGA TGCTGAAGGC GGCGTTTTTG TCTCCGAGAA CTTCTTCTAC GAAAGCGAAA TCTACAACAC CCGCTTGTAT ATTTTTACCG ATCGCCCGCT ATATCGCGCA GGCGATCGTG TCGATGTTAA AGTGATGGGC CGCGAGTTCC ACGATCCGTT GCATTCATCC CCCATCGTCA GCGCCCCGGC GAAGCTTTCG GTGCTGGACG CTAACGGCAG TCTGTTGCAA ACCGTCGATG TCACGCTGGA TGCGCGCAAT GGCGGGCAGG GAAGTTTCCG CCTGCCAGAA AATGCCGTTG CCGGAGGTTA TGAGTTACGT CTTGCTTACC GCAATCAGGT CTATAGCAGC AGTTTTCGCG TGGCAAACTA CATCAAGCCA CATTTCGAGA TTGGTTTAGC TCTCGACAAA AAAGAGTTCA AAACTGGCGA AGCGGTCAGC GGCAAACTGC AACTTCTCTA CCCGGATGGC GAGCCGGTAA AAAATGCCCG CGTGCAGTTA AGTTTGCGCG CTCAGCAATT ATCAATGGTC GGTAACGATT TGCGTTATGC CGGACGTTTC CCCGTGTCGC TGGAAGGCAG CGAAACGGTG TCCGACGCCA GCGGTCATGT GGCGTTAAAT CTCCCCGCCG CCGATAAACC GAGCCGCTAT TTGTTAACCG TCTCCGCCAG TGACGGCGCG GCGTATCGCG TCACCACCAC CAAAGAGATC CTCATTGAAC GCGGCCTGGC GCATTACTCT TTAAGTACCG CCGCACAATA CAGTAATAGC GGCGAGTCGG TTGTGTTCCG TTATGCCGCG CTGGAATCTT CAAAACAGGT TCCTGTTACG TATGAATGGT TGCGTCTCGA AGACCGCACG AGCCATAGCG GAGATCTACC GTCAGGCGGC AAATCCTTTA CCGTCAATTT CGATAAACCT GGCAACTACA ATCTGACGTT ACGCGATAAA GACGGCTTAA TTCTCGCCGG GTTAAGCCAT GCCGTCAGCG GTAAGGGCAG TATGTCGCAT ACTGGTACGG TAGATATCGT GGCAGATAAA ACGCTGTACC AGCCTGGCGA AACCGCGAAG ATGCTGATTA CCTTCCCGGA GCCAATTGAT GAAGCATTAT TGACGCTGGA ACGCGATCGC GTTGAACAGC AGTCGCTGCT ATCGCATCCG GCAAACTGGC TGACGTTACA ACGTTTAAAC GATACTCAAT ATGAAGCCCG TGTTCCAGTG AGCAATTCCT TTGCGCCTAA CATCACTTTT TCGGTGCTGT ATACCCGTAA CGGTCAGTAC AGTTTTCAGA ACGCCGGGAT CAAAGTTGCC GTTCCCCAGC TGGATATCCG GGTGAAAACG GACAAAACCC ATTACCAGCC TGGTGAACTG GTCAATGTCG AATTAACCTC GTCGCTGAAA GGTAAACCTG TTTCTGCGCA GCTAACGGTA GGCGTGGTCG ATGAAATGAT CTACGCGCTG CAACCAGAAA TCGCGCCGAA TATCGGCAAA TTTTTCTATC CGCTGGGGCG TAACAATGTG CGTACCAGCT CCAGTCTGTC GTTTATCAGC TACGACCAGG CACTCTCCAG CGAGCCGGTT GCGCCTGGCG CAACTAACCG CAGCGAGCGG CGAGTAAAAA TGCTTGAACG TCCACGGCGT GAAGAGGTGG ATACCGCGGC ATGGATGCCG TCACTCACAA CCGATAAACA AGGCAAAGCG TATTTCACGT TCCTGATGCC TGATTCGTTA ACCCGCTGGC GTATCACCGC GCGTGGGATG AACGGCGACG GGCTGGTCGG GCAGGGGCGT GCTTATCTGC GTTCGGAAAA AAATCTCTAC ATGAAGTGGA GTATGCCAAC GGTGTATCGC GTGGGCGACA AACCGTCGGC AGGACTGTTT ATCTTCAGTC AGCAGGATAA CGAACCGGTG GCGCTGGTGA CTAAATTTGC AGGCGCTGAG ATGCGCCAGA CGCTGACGCT GCACAAAGGG GCGAATTATA TTTCGCTGGC GCAGAACATT CAGCAATCTG GCTTGTTAAG CGCAGAACTG CAACAAAATG GGCAAGTGCA GGACAGCATT AGCACAAAAC TGTCTTTTGT GGATAACAGC TGGCCCGTTG AACAGCAGAA AAATGTCATG CTCGGCGGTG GCGATAACGC GCTGATGTTG CCCGAGCAGG CGAGCAATAT CCGGCTACAA AGTAGTGAAA CGCCGCAGGA GATTTTCCGC AACAATCTTG ATGCGTTAGT CGATGAACCG TGGGGGGGGG TGATCAACAC CGGTAGCCGT CTGATCCCGC TCAGTCTCGC CTGGCGTTCG CTTGCCGATC ATCAAAGTGC CGCCGCTAAC GACATTCGTC AGATGATTCA GGATAACCGT CTGCGGCTGA TGCAACTGGC GGGGCCCGGA GCGCGCTTTA CCTGGTGGGG TGAAGATGGC AATGGTGACG CCTTCCTTAC GGCATGGGCA TGGTACGCCG ACTGGCAGGC CAGCCAGGCG CTCGGCGTAA CGCAACAACC GGAATACTGG CAGCATATGC TCGACAGTTA TGCCGAGCAG GCAGATAACA TGCCGTTATT GCATCGGGCG CTGGTGCTGG CGTGGGCACA GGAGATGAAT CTGCCGTGCA AAACGTTGTT GAAAGGGTTG GATGAAGCTA TCGCCCGGCG CGGAACTAAA ACTGAAGATT TCTCTGAGGA AGACACCCGC GATATCAATG ACAGCCTGAT CCTCGATACA CCGGAATCTC CACTGGCAGA TGCGGTGGCA AACGTCTTAA CCATGACGTT GCTGAAAAAA GCGCAGTTGA AGTCCACGGT GATGCCACAG GTTCAGCAAT ATGCGTGGGA TAAAGCGGTA AACAGCAATC AGCCGCTGGC ACACACGGTT GTGCTGCTCA ATAGCGGGGG CGACGCTACC CAGGCGGCTG CTATTTTAAG TGGTTTGACC GCTGAGCAAT CTACTATTGA GCGCGCGCTG GCCATGAACT GGCTGGCGAA ATATATGGCG ACAATGCCTT CGGTTGTGTT GCCTGCGCCT GCGGGCGCAT GGGCCAAACA TAAGTTAACT GGAGGGGGCG AATACTGGCG TTGGGTTGGT CAGGGCGTGC CGGACATTCT CTCTTTTGGT GACGAATTAT CGCCGCAAAA TGTGCAGGTC CGCTGGCGTG AAGCGGCAAA AACGGCTCAA CAAAGTAACA TTCCGGTGAC CGTTGAACGC CAATTGTATC GGCTTATCCC TGGTGAAGAA GAGATGAGCT TTACTCTGCA ACCGGTGACC AGCAATGAGA TTGACAGCGA TGCGCTGTAT CTCGATGAAA TTACGCTTAC CAGCGAGCAG GATGCAGTTC TGCGCTATGG TCAGGTGGAA GTACCGCTCC CGCCGGGAGC CGACGTTGAG CGCACAACAT GGGGCATTTC AGTCAATAAA CCCAACGCTG GAAAACAGCA GGGGCAATTG CTGGAAAAAG CGCGAAATGA AATGGGCGAA CTGGCCTATA TGGTGCCGGT GAAAGAACTG ACGGGAACGG TCACTTTCCG CCATTTGCTG CGCTTCTCGC AAAAAGGGCA ATTCGTTCTG CCTCCTGCTC GTTATGTGCG TTCCTATGCA CCTGCACAGC AAAGTGTTGC GGCAGGGAGC GAATGGACCG GGATGCAGGT GAAATAA
|
Protein sequence | MLADSSFSSS EESKVRLEAP GRDYRRYQME EYGGVDVRLY RIPDPMAFLR QQKNLHRIVV QPQYLGDGLN NTLTWLWDNW YGKSRRVMQR TFSSQSRQNV TQALPELQLG NAIIKPSRYV QNNQFSPLKK YPLVEQFRYP LWQAKPVEPQ QGVKLEGASS NFISPQPGNI YIPLGQQEPG LYLVEAMVGG YRATTVVFVS DTVALSKVSG KELLVWTAGK KQGEAKPGSE ILWTDGLGVM TRGVTDDSGT LQLQHISPER SYILGKDAEG GVFVSENFFY ESEIYNTRLY IFTDRPLYRA GDRVDVKVMG REFHDPLHSS PIVSAPAKLS VLDANGSLLQ TVDVTLDARN GGQGSFRLPE NAVAGGYELR LAYRNQVYSS SFRVANYIKP HFEIGLALDK KEFKTGEAVS GKLQLLYPDG EPVKNARVQL SLRAQQLSMV GNDLRYAGRF PVSLEGSETV SDASGHVALN LPAADKPSRY LLTVSASDGA AYRVTTTKEI LIERGLAHYS LSTAAQYSNS GESVVFRYAA LESSKQVPVT YEWLRLEDRT SHSGDLPSGG KSFTVNFDKP GNYNLTLRDK DGLILAGLSH AVSGKGSMSH TGTVDIVADK TLYQPGETAK MLITFPEPID EALLTLERDR VEQQSLLSHP ANWLTLQRLN DTQYEARVPV SNSFAPNITF SVLYTRNGQY SFQNAGIKVA VPQLDIRVKT DKTHYQPGEL VNVELTSSLK GKPVSAQLTV GVVDEMIYAL QPEIAPNIGK FFYPLGRNNV RTSSSLSFIS YDQALSSEPV APGATNRSER RVKMLERPRR EEVDTAAWMP SLTTDKQGKA YFTFLMPDSL TRWRITARGM NGDGLVGQGR AYLRSEKNLY MKWSMPTVYR VGDKPSAGLF IFSQQDNEPV ALVTKFAGAE MRQTLTLHKG ANYISLAQNI QQSGLLSAEL QQNGQVQDSI STKLSFVDNS WPVEQQKNVM LGGGDNALML PEQASNIRLQ SSETPQEIFR NNLDALVDEP WGGVINTGSR LIPLSLAWRS LADHQSAAAN DIRQMIQDNR LRLMQLAGPG ARFTWWGEDG NGDAFLTAWA WYADWQASQA LGVTQQPEYW QHMLDSYAEQ ADNMPLLHRA LVLAWAQEMN LPCKTLLKGL DEAIARRGTK TEDFSEEDTR DINDSLILDT PESPLADAVA NVLTMTLLKK AQLKSTVMPQ VQQYAWDKAV NSNQPLAHTV VLLNSGGDAT QAAAILSGLT AEQSTIERAL AMNWLAKYMA TMPSVVLPAP AGAWAKHKLT GGGEYWRWVG QGVPDILSFG DELSPQNVQV RWREAAKTAQ QSNIPVTVER QLYRLIPGEE EMSFTLQPVT SNEIDSDALY LDEITLTSEQ DAVLRYGQVE VPLPPGADVE RTTWGISVNK PNAGKQQGQL LEKARNEMGE LAYMVPVKEL TGTVTFRHLL RFSQKGQFVL PPARYVRSYA PAQQSVAAGS EWTGMQVK
|
| |