Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3749 |
Symbol | |
ID | 6967571 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3472415 |
End bp | 3477376 |
Gene Length | 4962 bp |
Protein Length | 1653 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643387541 |
Product | alpha-2-macroglobulin domain protein |
Protein accession | YP_002271994 |
Protein GI | 209400655 |
COG category | [R] General function prediction only |
COG ID | [COG2373] Large extracellular alpha-helical protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 62 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAGT TACGCGTAGC CGCCTGCATG CTAATGCTGG CGCTGGCAGG GTGCGACAAC AACGATAACG CGCCAACAGC GGTGAAAAAA GATGCGCCTT CTGAAGTTAC TAAAGCGGCC TCTTCAGAAA ACGCGAGTTC AGCAAAACTC TCCGCGCCAG AGCGACAAAA ACTGGCCCAA CAGAGTGCCG GTAAGGCGCT GACATTGCTG GATCTCTCTG AAGTCCAACT TGATGGTGCA GCCACGCTGG TGCTGACGTT CTCCATCCCT CTCGACCCGG ATCAGGATTT CTCACGCGTT ATTCATGTCG TCGATAAAAA AAGCGGCAAA GTGGATGGTG CCTGGGAGCT ATCAGATAAT CTTAAAGAGC TGCGTTTACG CCACCTCGAA CCGAAACGTG ATTTGATCGT TACAATTGGC AAGGAGGTCA AAGCACTCAA CAACGCAACC TTCAGTAAAG ATTACGAAAA AACTATAACT ACCCGCGACA TCCAACCCAG TGTCGGTTTT GCCAGCCGTG GTTCGCTGCT GCCTGGTAAA GTCGTTGAAG GGCTGCCGGT AATGGCGCTC AACGTTAATA ATGTCGATGT TAACTTCTTC CGCGTTAAGC CAGAATCTCT GCCAGCATTC ATTAGCCAAT GGGAATACCG CAATTCGCTG GCGAACTGGC AGTCAGACAA ACTGCTGCAG ATGGCGGATC TGGTCTACAC CGGACGGTTT GATCTCAATC CTGCGCGTAA CACCCGTGAA AAATTATTGC TGCCGCTGGG CGATATCAAA CCGCTTCAGC AGGCGGGCGT GTATCTGGCT GTGATGAATC AGGCTGGACG TTACGATTAC AGTAATCCCG CGACGCTGTT TACGTTAAGT GATATCGGCG TTTCAGCTCA CCGTTATCAC AATCGTCTGG ATATCTTTAC CCAAAGTCTG GAAAACGGCG CGGCCCAGCA AGGAATTGAA GTCTCTTTAT TAAATGAGAA AGGGCAGACT CTGACTCAGG CAACCAGTGA CGCTCAGGGG CATGTGCAGC TGGAAAATGA TAAAAACGCC GCATTATTGC TGGCACGTAA AGACGGTCAG ACAACGCTAC TCGATTTAAA ACTTCCGGCG CTGGACTTAG CAGAATTTAA CATTGCTGGC GCGCCAGGCT ATAGCAAACA GTTTTTCATG TTTGGCCCGC GCGATCTTTA TCGCCCGGGT GAAACGGTAA TCCTCAATGG TTTGCTGCGT GATGCAGACG GTAAAGCGTT GCCCAATCAA CCCATCAAGT TAGACGTGAT TAAACCCGAT GGGCAGGTAC TCAGGAGCGT CGTTAGTCAG CCGGAAAATG GCCTCTACCA CTTTACCTGG CCACTCGATA GCAATGCGGC AACCGGTATG TGGCATATTC GCGCTAACAC GGGCGATAAC CAGTATCGGA TGTGGGATTT CCACGTCGAA GATTTTATGC CAGAGCGTAT GGCGCTGAAT CTGACCGGTG AGAAAACCCC GCTTACGCCG AATGATGAAG TGAAATTCTC CGTGGTGGGA TACTACCTGT ACGGTGCGCC TGCTAATGGT AATACTTTGC AAGGGCAACT TTTCCTGCGC CCACTGCGTG AGGCTGTGTC AGCCTTACCT GGTTTTGAAT TCGGCGATAT AGCTGCCGAA AACCTTTCCC GCACGCTGGA TGAAGTTCAG TTGACGCTGG ATGATAAAGG GCGCGGCGAA GTTTCTACAG AAAGCCAGTG GAAGGAAACG CATTCCCCAT TGCAGGTTAT TTTCCAGGGC AGTTTGCTGG AATCGGGCGG TCGCCCGGTG ACGCGCCGCG CTGAGCAGGC TATCTGGCCT GCCGATGCAT TGCCGGGGAT CCGTCCGCAG TTCGCCTCGA AATCGGTTTA CGATTATCGC ACTGACAGCA CGGTGAAACA GCCCATTGTT GATGAAGGCA GTAACGCCGC TTTTGACATC GTTTATAGCG ATGCGCAAGG CGTGAAAAAA GCCGTGTCGG GCTTGCAGGT GCGCCTGATT CGCGAACGCC GCGATTACTA CTGGAACTGG TCAGAAGATG AAGGCTGGCA ATCACAGTTT GATCAAAAAG ATCTGATTGA AAATGAACAA ACTCTGGATC TGAAAGCGGA CGAAACCGGC AAGGTCAGTT TCCCGGTAGA GTGGGGCGCT TATCGTCTGG AAGTCAAAGC GCCGAATGAA GCGGTCAGTA GTGTTCGTTT TTGGGCTGGC TATAGCTGGC AGGATAATAG CGACGGTAGC GGCGCAGTGC GACCCGACCG TGTCACGCTG AAACTGGATA AAGCCGGTTA TCGCCCTGGC GATACCATTA AGTTGCATAT CGCCGCGCCA ACGGCGGGTA AAGGTTATGC GATGGTCGAG TCCAGTGAAG GGCCGCTGTG GTGGCAAGAG ATTGATGTTC CGGCTCAAGG GCTGGATCTG ACGATTCCGG TCGATAAAAC CTGGAATCGT CATGATCTTT ATTTGAGTAC GCTGGTGGTG CGTCCTGGCG ATAAATCTCG CTCCGCGACG CCAAAACGCG CGGTTGGTGT GTTGCATCTG CCGCTTGGTG ATGAAAACCG TCGCCTCGAT CTGGCGCTGG AAACACCAGC AAAAATGCGT CCCAATCAAC CATTAACCGT GAAAATTAAA GCCAGCACTA AAAATGGCGA GAAGCCTAAA CAGGTGAATG TGCTGGTGTC TGCCGTTGAT AGTGGTGTGC TGAATATTAC TGACTACGTC ACGCCAGATC CGTGGCAGGC GTTCTTTGGT CAGAAACGCT ATGGCGCAGA CATTTACGAT ATTTACGGTC AGGTTATTGA AGGTCAGGGG CGTCTGGCAG CTCTGCGTTT CGGTGGCGAT GGTGATGAGC TGAAACGTGG TGGTAAACCG CCGGTCAATC ACGTCAATAT TGTCGCGCAG CAGGCGCTGC CGGTAACGCT CAACGAACAG GGCGAAGGCT CGGTTACACT GCCGATTGGC GATTTTAACG GTGAATTGCG CGTCATGGCG CAAGCCTGGA CGGCAGATGA ATTCGGTAGC AACGAAAGCA AAGTGATAGT TGCCGCACCG GTGATTGCTG AACTGAACAT GCCGCGCTTT ATGGCGAGTG GCGATAACTC GCGTCTGACG CTGGATATCA CTAATCTTAC CGATAAACCG CAAAAACTGA ACGTTGCCCT GACCGCCAGT GGTTTGCTTG AACTGGTCAG CGATTCACCC GCAGCCGTTG AATTAGCGCC AGGTGTGCGT ACTACGCTGT TTATCCCGGT GCGAGCATTG CCGGGTTATG GCGATGGAGA AATTCAGGCC ACCATTAGCG GGTTAGCGTT ACCGGGTGAA ACCGTTGCCG ATCAGCATAA GCAGTGGAAA ATCGGCGTCC GTCCGGCGTT CCCGGCACAA ACGGTTAATT ACGGTACGGC GTTACAGCCT GGTGAGACAT GGGCGATTCC GGCGGATGGA TTGCAAAACT TCTCGCCTGT TACGCTGGAA GGGCAATTGT TGTTGAGCGG CAAACCACCG CTGAACATCG CACGTTATAT CAAAGAGTTA AAAGCGTATC CGTACGGCTG TCTTGAGCAA ACCGCCAGCG GCCTGTTCCC GTCACTTTAT ACCAACGCAG CCCAACTGCA GGCGTTGGGC ATCAAAGGCG ACAGTGATGA GAAACGCCGT GCATCGGTCG ATATCGGCAT TTCCCGTTTG CTGCAAATGC AACGTGATAA CGGCGGCTTT GCGCTGTGGG ATAAAAACGG TGACGAAGAG TACTGGCTGA CGGCTTACGT GATGGATTTC CTGGTCCGCG CAGGCGAACA GGGTTACAGC GTGCCGACAG ACGCCATTAA CCGGGGTAAT GAGCGTCTGC TGCGCTATTT ACAAGATCCG GGCATGATGT CGATCCCGTA CGCGGATAAT CTCAAAGCCA GTAAATTCGC CGTACAGTCT TACGCTGCGC TGGTGTTGGC CCGTCAGCAA AAGGCTCCGC TGGGTGCGCT GCGTGAAATC TGGGAGCATC GTGCAGATGC CGCTTCTGGT TTACCGCTGC TGCAACTTGG CGTTGCGCTG AAAACCATGG GTGATGCGAC GCGTGATGAA GAAGCGATTG CGCTGGCGCT AAAAACGCCG CGTAATAGTG ATGAGCGGAT ATGGCTGGGT GATTACGGTA GTCCACTGCG CGACAACGCG TTGATGCTCT CCTTGCTGGA AGAGAACAAA CTGCTACCCG ATGAGCAGTA CTCCCTGCTG AACACACTTT CGCAGCAGGC GTTTGGTGAA CGCTGGCTAT CGACGCAGGA AAGTAACGCG TTGTTCCTGG CTGCCCGTAC GATTCAGGAT TTACCAGGTA AATGGCAGGC GCAAACCTCT TTCTCAGCTG AGCCGCTGAC AGGCGAGAAA GCGCAAAACA GCAATCTGAA TAGCGATCAA CTTGCCACCT TGCAGGTGAC CAACAGTGGC GAGCAGCCGT TATGGCTGCG TGTGGATGCC AGCGGTTATC CGCAATCCGC ACCTTTACCG GCGAACAATG TGCTGCAAAT TGAGCGTCAT ATTCTTGGTA CTGATGGTAA GAGCAAATCG CTGGACTCGT TACGTAGCGG CGATCTGGTG CTGGTCTGGT TGCAGGTAAA AGCTAGTAAC AGCGTGCCGG ATGCGTTAGT CGTGGATCTG CTGCCTGCGG GTCTGGAACT GGAAAACCAG AATCTGGCGA ACGGTAGCGC CAGCCTGGAG CAAAGTGGTG GCGAAGTGCA GAACTTACTG AACCAGATGC AGCAGGCGAG TATTAAGCAC ATTGAGTTCC GTGACGATCG CTTTGTGGCG GCGGTTGCCG TTGATGAATA CCAACCGGTA ACGCTGGTGT ATCTGGCGCG GGCGGTGACG CCGGGAACGT ATCAGGTACC GCAACCGATG GTGGAATCAA TGTATGTTCC CCAATGGCGG GCGACCGGCG CGGCTGAAGA TTTGCTGATT GTCAGACCGT AA
|
Protein sequence | MKKLRVAACM LMLALAGCDN NDNAPTAVKK DAPSEVTKAA SSENASSAKL SAPERQKLAQ QSAGKALTLL DLSEVQLDGA ATLVLTFSIP LDPDQDFSRV IHVVDKKSGK VDGAWELSDN LKELRLRHLE PKRDLIVTIG KEVKALNNAT FSKDYEKTIT TRDIQPSVGF ASRGSLLPGK VVEGLPVMAL NVNNVDVNFF RVKPESLPAF ISQWEYRNSL ANWQSDKLLQ MADLVYTGRF DLNPARNTRE KLLLPLGDIK PLQQAGVYLA VMNQAGRYDY SNPATLFTLS DIGVSAHRYH NRLDIFTQSL ENGAAQQGIE VSLLNEKGQT LTQATSDAQG HVQLENDKNA ALLLARKDGQ TTLLDLKLPA LDLAEFNIAG APGYSKQFFM FGPRDLYRPG ETVILNGLLR DADGKALPNQ PIKLDVIKPD GQVLRSVVSQ PENGLYHFTW PLDSNAATGM WHIRANTGDN QYRMWDFHVE DFMPERMALN LTGEKTPLTP NDEVKFSVVG YYLYGAPANG NTLQGQLFLR PLREAVSALP GFEFGDIAAE NLSRTLDEVQ LTLDDKGRGE VSTESQWKET HSPLQVIFQG SLLESGGRPV TRRAEQAIWP ADALPGIRPQ FASKSVYDYR TDSTVKQPIV DEGSNAAFDI VYSDAQGVKK AVSGLQVRLI RERRDYYWNW SEDEGWQSQF DQKDLIENEQ TLDLKADETG KVSFPVEWGA YRLEVKAPNE AVSSVRFWAG YSWQDNSDGS GAVRPDRVTL KLDKAGYRPG DTIKLHIAAP TAGKGYAMVE SSEGPLWWQE IDVPAQGLDL TIPVDKTWNR HDLYLSTLVV RPGDKSRSAT PKRAVGVLHL PLGDENRRLD LALETPAKMR PNQPLTVKIK ASTKNGEKPK QVNVLVSAVD SGVLNITDYV TPDPWQAFFG QKRYGADIYD IYGQVIEGQG RLAALRFGGD GDELKRGGKP PVNHVNIVAQ QALPVTLNEQ GEGSVTLPIG DFNGELRVMA QAWTADEFGS NESKVIVAAP VIAELNMPRF MASGDNSRLT LDITNLTDKP QKLNVALTAS GLLELVSDSP AAVELAPGVR TTLFIPVRAL PGYGDGEIQA TISGLALPGE TVADQHKQWK IGVRPAFPAQ TVNYGTALQP GETWAIPADG LQNFSPVTLE GQLLLSGKPP LNIARYIKEL KAYPYGCLEQ TASGLFPSLY TNAAQLQALG IKGDSDEKRR ASVDIGISRL LQMQRDNGGF ALWDKNGDEE YWLTAYVMDF LVRAGEQGYS VPTDAINRGN ERLLRYLQDP GMMSIPYADN LKASKFAVQS YAALVLARQQ KAPLGALREI WEHRADAASG LPLLQLGVAL KTMGDATRDE EAIALALKTP RNSDERIWLG DYGSPLRDNA LMLSLLEENK LLPDEQYSLL NTLSQQAFGE RWLSTQESNA LFLAARTIQD LPGKWQAQTS FSAEPLTGEK AQNSNLNSDQ LATLQVTNSG EQPLWLRVDA SGYPQSAPLP ANNVLQIERH ILGTDGKSKS LDSLRSGDLV LVWLQVKASN SVPDALVVDL LPAGLELENQ NLANGSASLE QSGGEVQNLL NQMQQASIKH IEFRDDRFVA AVAVDEYQPV TLVYLARAVT PGTYQVPQPM VESMYVPQWR ATGAAEDLLI VRP
|
| |