Gene ECH74115_3749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3749 
Symbol 
ID6967571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3472415 
End bp3477376 
Gene Length4962 bp 
Protein Length1653 aa 
Translation table11 
GC content53% 
IMG OID643387541 
Productalpha-2-macroglobulin domain protein 
Protein accessionYP_002271994 
Protein GI209400655 
COG category[R] General function prediction only 
COG ID[COG2373] Large extracellular alpha-helical protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGT TACGCGTAGC CGCCTGCATG CTAATGCTGG CGCTGGCAGG GTGCGACAAC 
AACGATAACG CGCCAACAGC GGTGAAAAAA GATGCGCCTT CTGAAGTTAC TAAAGCGGCC
TCTTCAGAAA ACGCGAGTTC AGCAAAACTC TCCGCGCCAG AGCGACAAAA ACTGGCCCAA
CAGAGTGCCG GTAAGGCGCT GACATTGCTG GATCTCTCTG AAGTCCAACT TGATGGTGCA
GCCACGCTGG TGCTGACGTT CTCCATCCCT CTCGACCCGG ATCAGGATTT CTCACGCGTT
ATTCATGTCG TCGATAAAAA AAGCGGCAAA GTGGATGGTG CCTGGGAGCT ATCAGATAAT
CTTAAAGAGC TGCGTTTACG CCACCTCGAA CCGAAACGTG ATTTGATCGT TACAATTGGC
AAGGAGGTCA AAGCACTCAA CAACGCAACC TTCAGTAAAG ATTACGAAAA AACTATAACT
ACCCGCGACA TCCAACCCAG TGTCGGTTTT GCCAGCCGTG GTTCGCTGCT GCCTGGTAAA
GTCGTTGAAG GGCTGCCGGT AATGGCGCTC AACGTTAATA ATGTCGATGT TAACTTCTTC
CGCGTTAAGC CAGAATCTCT GCCAGCATTC ATTAGCCAAT GGGAATACCG CAATTCGCTG
GCGAACTGGC AGTCAGACAA ACTGCTGCAG ATGGCGGATC TGGTCTACAC CGGACGGTTT
GATCTCAATC CTGCGCGTAA CACCCGTGAA AAATTATTGC TGCCGCTGGG CGATATCAAA
CCGCTTCAGC AGGCGGGCGT GTATCTGGCT GTGATGAATC AGGCTGGACG TTACGATTAC
AGTAATCCCG CGACGCTGTT TACGTTAAGT GATATCGGCG TTTCAGCTCA CCGTTATCAC
AATCGTCTGG ATATCTTTAC CCAAAGTCTG GAAAACGGCG CGGCCCAGCA AGGAATTGAA
GTCTCTTTAT TAAATGAGAA AGGGCAGACT CTGACTCAGG CAACCAGTGA CGCTCAGGGG
CATGTGCAGC TGGAAAATGA TAAAAACGCC GCATTATTGC TGGCACGTAA AGACGGTCAG
ACAACGCTAC TCGATTTAAA ACTTCCGGCG CTGGACTTAG CAGAATTTAA CATTGCTGGC
GCGCCAGGCT ATAGCAAACA GTTTTTCATG TTTGGCCCGC GCGATCTTTA TCGCCCGGGT
GAAACGGTAA TCCTCAATGG TTTGCTGCGT GATGCAGACG GTAAAGCGTT GCCCAATCAA
CCCATCAAGT TAGACGTGAT TAAACCCGAT GGGCAGGTAC TCAGGAGCGT CGTTAGTCAG
CCGGAAAATG GCCTCTACCA CTTTACCTGG CCACTCGATA GCAATGCGGC AACCGGTATG
TGGCATATTC GCGCTAACAC GGGCGATAAC CAGTATCGGA TGTGGGATTT CCACGTCGAA
GATTTTATGC CAGAGCGTAT GGCGCTGAAT CTGACCGGTG AGAAAACCCC GCTTACGCCG
AATGATGAAG TGAAATTCTC CGTGGTGGGA TACTACCTGT ACGGTGCGCC TGCTAATGGT
AATACTTTGC AAGGGCAACT TTTCCTGCGC CCACTGCGTG AGGCTGTGTC AGCCTTACCT
GGTTTTGAAT TCGGCGATAT AGCTGCCGAA AACCTTTCCC GCACGCTGGA TGAAGTTCAG
TTGACGCTGG ATGATAAAGG GCGCGGCGAA GTTTCTACAG AAAGCCAGTG GAAGGAAACG
CATTCCCCAT TGCAGGTTAT TTTCCAGGGC AGTTTGCTGG AATCGGGCGG TCGCCCGGTG
ACGCGCCGCG CTGAGCAGGC TATCTGGCCT GCCGATGCAT TGCCGGGGAT CCGTCCGCAG
TTCGCCTCGA AATCGGTTTA CGATTATCGC ACTGACAGCA CGGTGAAACA GCCCATTGTT
GATGAAGGCA GTAACGCCGC TTTTGACATC GTTTATAGCG ATGCGCAAGG CGTGAAAAAA
GCCGTGTCGG GCTTGCAGGT GCGCCTGATT CGCGAACGCC GCGATTACTA CTGGAACTGG
TCAGAAGATG AAGGCTGGCA ATCACAGTTT GATCAAAAAG ATCTGATTGA AAATGAACAA
ACTCTGGATC TGAAAGCGGA CGAAACCGGC AAGGTCAGTT TCCCGGTAGA GTGGGGCGCT
TATCGTCTGG AAGTCAAAGC GCCGAATGAA GCGGTCAGTA GTGTTCGTTT TTGGGCTGGC
TATAGCTGGC AGGATAATAG CGACGGTAGC GGCGCAGTGC GACCCGACCG TGTCACGCTG
AAACTGGATA AAGCCGGTTA TCGCCCTGGC GATACCATTA AGTTGCATAT CGCCGCGCCA
ACGGCGGGTA AAGGTTATGC GATGGTCGAG TCCAGTGAAG GGCCGCTGTG GTGGCAAGAG
ATTGATGTTC CGGCTCAAGG GCTGGATCTG ACGATTCCGG TCGATAAAAC CTGGAATCGT
CATGATCTTT ATTTGAGTAC GCTGGTGGTG CGTCCTGGCG ATAAATCTCG CTCCGCGACG
CCAAAACGCG CGGTTGGTGT GTTGCATCTG CCGCTTGGTG ATGAAAACCG TCGCCTCGAT
CTGGCGCTGG AAACACCAGC AAAAATGCGT CCCAATCAAC CATTAACCGT GAAAATTAAA
GCCAGCACTA AAAATGGCGA GAAGCCTAAA CAGGTGAATG TGCTGGTGTC TGCCGTTGAT
AGTGGTGTGC TGAATATTAC TGACTACGTC ACGCCAGATC CGTGGCAGGC GTTCTTTGGT
CAGAAACGCT ATGGCGCAGA CATTTACGAT ATTTACGGTC AGGTTATTGA AGGTCAGGGG
CGTCTGGCAG CTCTGCGTTT CGGTGGCGAT GGTGATGAGC TGAAACGTGG TGGTAAACCG
CCGGTCAATC ACGTCAATAT TGTCGCGCAG CAGGCGCTGC CGGTAACGCT CAACGAACAG
GGCGAAGGCT CGGTTACACT GCCGATTGGC GATTTTAACG GTGAATTGCG CGTCATGGCG
CAAGCCTGGA CGGCAGATGA ATTCGGTAGC AACGAAAGCA AAGTGATAGT TGCCGCACCG
GTGATTGCTG AACTGAACAT GCCGCGCTTT ATGGCGAGTG GCGATAACTC GCGTCTGACG
CTGGATATCA CTAATCTTAC CGATAAACCG CAAAAACTGA ACGTTGCCCT GACCGCCAGT
GGTTTGCTTG AACTGGTCAG CGATTCACCC GCAGCCGTTG AATTAGCGCC AGGTGTGCGT
ACTACGCTGT TTATCCCGGT GCGAGCATTG CCGGGTTATG GCGATGGAGA AATTCAGGCC
ACCATTAGCG GGTTAGCGTT ACCGGGTGAA ACCGTTGCCG ATCAGCATAA GCAGTGGAAA
ATCGGCGTCC GTCCGGCGTT CCCGGCACAA ACGGTTAATT ACGGTACGGC GTTACAGCCT
GGTGAGACAT GGGCGATTCC GGCGGATGGA TTGCAAAACT TCTCGCCTGT TACGCTGGAA
GGGCAATTGT TGTTGAGCGG CAAACCACCG CTGAACATCG CACGTTATAT CAAAGAGTTA
AAAGCGTATC CGTACGGCTG TCTTGAGCAA ACCGCCAGCG GCCTGTTCCC GTCACTTTAT
ACCAACGCAG CCCAACTGCA GGCGTTGGGC ATCAAAGGCG ACAGTGATGA GAAACGCCGT
GCATCGGTCG ATATCGGCAT TTCCCGTTTG CTGCAAATGC AACGTGATAA CGGCGGCTTT
GCGCTGTGGG ATAAAAACGG TGACGAAGAG TACTGGCTGA CGGCTTACGT GATGGATTTC
CTGGTCCGCG CAGGCGAACA GGGTTACAGC GTGCCGACAG ACGCCATTAA CCGGGGTAAT
GAGCGTCTGC TGCGCTATTT ACAAGATCCG GGCATGATGT CGATCCCGTA CGCGGATAAT
CTCAAAGCCA GTAAATTCGC CGTACAGTCT TACGCTGCGC TGGTGTTGGC CCGTCAGCAA
AAGGCTCCGC TGGGTGCGCT GCGTGAAATC TGGGAGCATC GTGCAGATGC CGCTTCTGGT
TTACCGCTGC TGCAACTTGG CGTTGCGCTG AAAACCATGG GTGATGCGAC GCGTGATGAA
GAAGCGATTG CGCTGGCGCT AAAAACGCCG CGTAATAGTG ATGAGCGGAT ATGGCTGGGT
GATTACGGTA GTCCACTGCG CGACAACGCG TTGATGCTCT CCTTGCTGGA AGAGAACAAA
CTGCTACCCG ATGAGCAGTA CTCCCTGCTG AACACACTTT CGCAGCAGGC GTTTGGTGAA
CGCTGGCTAT CGACGCAGGA AAGTAACGCG TTGTTCCTGG CTGCCCGTAC GATTCAGGAT
TTACCAGGTA AATGGCAGGC GCAAACCTCT TTCTCAGCTG AGCCGCTGAC AGGCGAGAAA
GCGCAAAACA GCAATCTGAA TAGCGATCAA CTTGCCACCT TGCAGGTGAC CAACAGTGGC
GAGCAGCCGT TATGGCTGCG TGTGGATGCC AGCGGTTATC CGCAATCCGC ACCTTTACCG
GCGAACAATG TGCTGCAAAT TGAGCGTCAT ATTCTTGGTA CTGATGGTAA GAGCAAATCG
CTGGACTCGT TACGTAGCGG CGATCTGGTG CTGGTCTGGT TGCAGGTAAA AGCTAGTAAC
AGCGTGCCGG ATGCGTTAGT CGTGGATCTG CTGCCTGCGG GTCTGGAACT GGAAAACCAG
AATCTGGCGA ACGGTAGCGC CAGCCTGGAG CAAAGTGGTG GCGAAGTGCA GAACTTACTG
AACCAGATGC AGCAGGCGAG TATTAAGCAC ATTGAGTTCC GTGACGATCG CTTTGTGGCG
GCGGTTGCCG TTGATGAATA CCAACCGGTA ACGCTGGTGT ATCTGGCGCG GGCGGTGACG
CCGGGAACGT ATCAGGTACC GCAACCGATG GTGGAATCAA TGTATGTTCC CCAATGGCGG
GCGACCGGCG CGGCTGAAGA TTTGCTGATT GTCAGACCGT AA
 
Protein sequence
MKKLRVAACM LMLALAGCDN NDNAPTAVKK DAPSEVTKAA SSENASSAKL SAPERQKLAQ 
QSAGKALTLL DLSEVQLDGA ATLVLTFSIP LDPDQDFSRV IHVVDKKSGK VDGAWELSDN
LKELRLRHLE PKRDLIVTIG KEVKALNNAT FSKDYEKTIT TRDIQPSVGF ASRGSLLPGK
VVEGLPVMAL NVNNVDVNFF RVKPESLPAF ISQWEYRNSL ANWQSDKLLQ MADLVYTGRF
DLNPARNTRE KLLLPLGDIK PLQQAGVYLA VMNQAGRYDY SNPATLFTLS DIGVSAHRYH
NRLDIFTQSL ENGAAQQGIE VSLLNEKGQT LTQATSDAQG HVQLENDKNA ALLLARKDGQ
TTLLDLKLPA LDLAEFNIAG APGYSKQFFM FGPRDLYRPG ETVILNGLLR DADGKALPNQ
PIKLDVIKPD GQVLRSVVSQ PENGLYHFTW PLDSNAATGM WHIRANTGDN QYRMWDFHVE
DFMPERMALN LTGEKTPLTP NDEVKFSVVG YYLYGAPANG NTLQGQLFLR PLREAVSALP
GFEFGDIAAE NLSRTLDEVQ LTLDDKGRGE VSTESQWKET HSPLQVIFQG SLLESGGRPV
TRRAEQAIWP ADALPGIRPQ FASKSVYDYR TDSTVKQPIV DEGSNAAFDI VYSDAQGVKK
AVSGLQVRLI RERRDYYWNW SEDEGWQSQF DQKDLIENEQ TLDLKADETG KVSFPVEWGA
YRLEVKAPNE AVSSVRFWAG YSWQDNSDGS GAVRPDRVTL KLDKAGYRPG DTIKLHIAAP
TAGKGYAMVE SSEGPLWWQE IDVPAQGLDL TIPVDKTWNR HDLYLSTLVV RPGDKSRSAT
PKRAVGVLHL PLGDENRRLD LALETPAKMR PNQPLTVKIK ASTKNGEKPK QVNVLVSAVD
SGVLNITDYV TPDPWQAFFG QKRYGADIYD IYGQVIEGQG RLAALRFGGD GDELKRGGKP
PVNHVNIVAQ QALPVTLNEQ GEGSVTLPIG DFNGELRVMA QAWTADEFGS NESKVIVAAP
VIAELNMPRF MASGDNSRLT LDITNLTDKP QKLNVALTAS GLLELVSDSP AAVELAPGVR
TTLFIPVRAL PGYGDGEIQA TISGLALPGE TVADQHKQWK IGVRPAFPAQ TVNYGTALQP
GETWAIPADG LQNFSPVTLE GQLLLSGKPP LNIARYIKEL KAYPYGCLEQ TASGLFPSLY
TNAAQLQALG IKGDSDEKRR ASVDIGISRL LQMQRDNGGF ALWDKNGDEE YWLTAYVMDF
LVRAGEQGYS VPTDAINRGN ERLLRYLQDP GMMSIPYADN LKASKFAVQS YAALVLARQQ
KAPLGALREI WEHRADAASG LPLLQLGVAL KTMGDATRDE EAIALALKTP RNSDERIWLG
DYGSPLRDNA LMLSLLEENK LLPDEQYSLL NTLSQQAFGE RWLSTQESNA LFLAARTIQD
LPGKWQAQTS FSAEPLTGEK AQNSNLNSDQ LATLQVTNSG EQPLWLRVDA SGYPQSAPLP
ANNVLQIERH ILGTDGKSKS LDSLRSGDLV LVWLQVKASN SVPDALVVDL LPAGLELENQ
NLANGSASLE QSGGEVQNLL NQMQQASIKH IEFRDDRFVA AVAVDEYQPV TLVYLARAVT
PGTYQVPQPM VESMYVPQWR ATGAAEDLLI VRP