Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_2523 |
Symbol | |
ID | 5590239 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 2508919 |
End bp | 2513433 |
Gene Length | 4515 bp |
Protein Length | 1504 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640926181 |
Product | alpha-2-macroglobulin family protein |
Protein accession | YP_001463575 |
Protein GI | 157157431 |
COG category | [R] General function prediction only |
COG ID | [COG2373] Large extracellular alpha-helical protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.659146 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGGAACAG GGCTTGCTAA TGCTGATGAT TCGCTTCCTT CCAGTAACTA TGCGCCGCCC GCCGGGGGAA CATTCTTTTT GCTTGCTGAC AGCAGTTTTA GCAGCAGTGA AGAGGCGAAA GTGCGACTGG AAGCGCCGGG GCGTGATTAT CGGCGCTATC AGATGGAAGA GTACGGCGGC GTGGACGTTC GCCTGTATCG TATTCCTGAC CCGATGGCAT TTTTGCGCCA GCAGAAAAAC CTGCATCGCA TTGTGGTGCA ACCGCAATAT CTGGGCGACG GGCTGAACAA TACGCTGACC TGGCTGTGGG ATAACTGGTA CGGCAAATCT CGCCGCGTAA TGCAGCGTAC TTTCTCTTCT CAGTCACGGC AGAATGTGAC TCAGGCATTA CCCGAATTAC ATCTCGGCAA TGCCATTATT AAACCTTCCC GTTATGTACA GAACAACCAG TTTTCCCCGC TGAAAAAATA TCCCTTGGTG GAACAGTTCC GTTATCCACT ATGGCAGGCT AAACCGGTCG AGCCGCAGCA AGGGGTAAAA CTGGAAGGCG CATCCAGCAA TTTCATCTCG CCGCAGCCGG GTAACATTTA TATTCCTCTC GGCCAACAAG AGCCGGGACT GTACCTCGTC GAGGCGATGG TTGGTGGGTA TCGGGCGACG ACGGTGGTGT TTGTTTCCGA TACCGTGGCG CTTAGCAAAG TGTCAGGCAA CGAGCTTCTG GTGTGGACCG CGGGTAAAAA ACAGGGTGAA GCGAAGCCCG GCTCAGAGAT CTTGTGGACT GACGGTCTTG GCGTGATGAC CCGCGGTGTG ACCGATGACA GCGGTACCTT GCAGTTACAA CATATATCGC CAGAACGTTC ATACATTCTG GGTAAGGATG CTGAAGGCGG TGTTTTTGTC TCCGAGAACT TCTTCTACGA AAGCGAAATC TACAACACCC GCTTGTATAT TTTTACCGAT CGCCCGCTAT ATCGCGCAGG AGATCGTGTC GATGTTAAAG TGATGGGCCG CGAGTTCCAC GATCCGTTGC ATTCATCCCC CATCGTCAGC GCCCCGGCGA AGCTTTCGGT GCTGGACGCT AACGGCAGTC TGTTGCAAAC CGTCAATGTC ACGCTGGATG CGCGCAATGG CGGGCAGGGA AGTTTCCGCC TGCCAGAAAA TGCCGTAGCC GGAGGTTATG AGTTACGTCT TGCTTACCGC AATCAGGTCT ATAGCAGCAG TTTTCGCGTG GCAAACTACA TCAAGCCACA TTTCGAGATT GGTTTAGCTC TCGACAAAAA AGAGTTCAAA ACTGGCGAAG CGGTCAGCGG CAAACTGCAA CTGCTTTACC CGGATGGTGA GCCGGTAAAA GATGCCCGCG TGCAGTTAAG TTTGCGCGCT CAGCAATTAT CAATGGTCGG TAACGATTTG CGTTATGCCG GACGTTTTCC CGTGTCGCTG GAAGGCAGCG AAACGGTGTC CGACGCCAGC GGTCATGTGG CGTTAAATCT CCCCGCCGCC GATAAACCGA GCCGCTATTT GTTAACCGTC TTCGCCAGTG ACGGCGCGGC GTATCGCGTC ACCACCACCA AAGAGATCCT CATTGAACGC GGCCTGGCGC ATTACTCTTT AAGTACCGCC GCACAATACA GTAATAGCGG CGAGTCGGTT GTGTTCCGTT ATGCCGCGCT GGAATCTTCA AAACAGGTTC CTGTTACGTA TGAATGGTTG CGTCTCGAAG ACCGCACGAG CCATAGCGGA GAGCTACCGT CAGGCGGCAA ATCCTTTACC GTCAATTTCG CTAAACCTGG CAACTACAAT CTGACATTAC GCGATAAAGA CGGCTTAATT CTCGCTGGGT TAAGTCATGC CGTCAGCGGT AAGGGCAGCA CGGCGCATAC TGGTACGGTA GATATCGTGG CGGATAAAAC GCTGTACCAG CCAGGCGAAA CTGCGAAGAT GCTGATTACC TTTCCGGAGC CAATTGATGA AGCATTATTG ACGCTGGAAC GCGATCGCGT GGAACAGCAG TCGCTGCTTT CGCATCCGGC AAACTGGCTG ACGTTACAAC GTTTAAACGA TACCCAGTAT GAAGCCCGGG TTCCAGTGAG CAATTCCTTT GCGCCTAACA TCACTTTTTC GGTGCTGTAT ACCCGTAATG GTCAGTACAG TTTTCAGAAC GCCGGGATCA AAGTTGCCGT TCCTCAGCTT GATATCCGGG TGAAAACGGA CAAAACCCAT TATCAGCCTG GTGAACTGGT CAATGTCGAA TTAACCTCGT CGCTGAAAGG TAAACCTGTT TCTGCGCAGC TAACGGTAGG CGTGGTCGAT GAAATGATCT ACGCGCTGCA ACCAGAAATC GCGCCGAATA TCGGCAAATT TTTCTATCCG CTGGGGCGTA ACAATGTGCG TACCAGCTCC AGTCTGTCGT TTATCAGCTA CGACCAGGCG CTCTCCAGCG AGCCGGTTGC GCCTGGCGCA ACTAACCGCA GCGAGCGGCG AGTAAAAATG CTTGAACGTC CACGGCGTGA AGAGGTGGAT ACCGCGGCAT GGATGCCGTC ACTCACAACC GATAAACAAG GCAAAGCGTA TTTCACGTTC CTGATGCCTG ATTCGTTAAC CCGCTGGCGT ATCACCGCGC GTGGGATGAA CGGCGACGGG CTGGTCGGGC AGGGGCGTGC TTATCTGCGT TCGGAAAAAA ATCTCTACAT GAAGTGGAGT ATGCCAACGG TGTATCGCGT GGGCGACAAA CCGGCGGCAG GACTGTTTAT CTTCAGTCAG CAGGATAACG AACCGGTGGC GCTGGTGACT AAATTTGCAG GCGCTGAGAT GCGCCAGACG CTGACGCTGC ACAAAGGGGC GAATTATATT TCGCTGACGC AGAACATTCA GCAATCTGGC TTGTTAAGTG CAGAACTGCA ACAAAATGGG CAAGTGCAGG ACAGCATTAG CACAAAACTG TCTTTTGTGG ACAACAGCTG GCCCGTTGAA CAGCAGAAAA ATGTCATGCT CGGCGGTGGC GAGAACGCGC TGATGTTGCC CGAGCAGGCG AGTAATATCC GGCTACAAAG TAGTGAAACG CCGCAGGAGA TTTTCCGCAA CAATCTTGAT GCGTTAGTCG ATGGACCATG GGGTGGCGTG ATCAACACCG GTAGCCGTCT GATCCCGCTC AGTCTCGCCT GGCGTTCGCT TGCCGATCAT CAAAGTGCCG CCGCTAACGA CATTCGTCAG ATGATTCAGG ATAACCGTCT GCGGCTGATG CAACTGGCGG GGCCCGGAGC GCGCTTTACC TGGTGGGGTG AAGATGGCAA TGGTGACGCC TTCCTTACGG CATGGGCATG GTACGCCGAC TGGCAGGCCA GCCAGGCGCT CGGCGTAACG CAACAACCGG AATACTGGCA GCATATGCTC GACAGTTATG CCGAGCAGGC GGATAACATG CCGTTATTGC ATCGGGCGCT GGTGCTGGCG TGGGCACAGG AGATGAATCT GCCGTGCAAA ACGTTGTTGA AAGGGTTGGA TGAAGCTATC GCCCGGCGCG GAACTAAAGA TGAAGATTTC TCTGAGGAAG ACATCCGCGA TATCAATGAC AGCCTGATCC TCGATACACC GGAATCTCCA CTGGCAGATG CGGTGGCAAA CGTCTTAACC ATGACGTTGC TGAAAAAAGC GCAGTTGAAG TCCACGGTGA TGCCACAGGT TCAGCAATAT GCGTGGGATA AAGCGGCAAA CAGCAATCAG CCGCTGGCGC ACACGGTTGT GCTGCTCAAT AGCGGTGGTG ACGCTACCCA GACGGCCGCT ATTTTAAGTG GTTTGACCGC TGAGCAATCC ACTATTGAGC GCGCGCTGGC CATGAACTGG CTGGCGAAAT ATATGGCGAC AATGCCTCCA GTTGTTTTGC CTGCGCCTGC GGGCGCATGG GCTAAACATA AGTTAACTGG AGGGGGCGAA GACTGGCGTT GGGTTGGTCA GGGCGTGCCG GACATTCTCT CTTTTGGTGA CGAATTATCG CCGCAAAATG TGCAGGTCCG CTGGCGTGAG CCGGCAAAAA TGGCTCAACA AAGTAACATT CCGGTGACCG TTGAACGCCA GTTGTATCGG CTTATCCCTG GTGAAGAAGA GATGAGCTTT ATTCTGCAAC CGGTGACCAG CAATGAGATT GACAGCGATG CGCTGTATCT CGATGAAATT ACGCTTACCA GCGAGCAGGA TGCAGTTCTG CGCTATGGTC AGGTGGAAGT ACCGCTCCCG CCGGGAGCCG ACGTTGAGCG CACAACATGG GGCATTTCAG TCAATAAACC CAACGCCGCG AAACAGCAGG GGCAATTGCT GGAAAAAGCG CGTAATGAAA TGGGCGAACT GGCCTATATG GTGCCGGTGA AAGAACTGAC GGGAACGGTC ACTTTCCGCC ATTTGCTGCG CTTCTCGCAA AAAGGGCAAT TCGTTCTGCC TCCTGCTCGT TATGTGCGTT CCTATGCACC TGCGCAGCAA AGTGTTGCGG CAGGGAGTGA ATGGACCGGG ATGCAGGTGA AATAA
|
Protein sequence | MGTGLANADD SLPSSNYAPP AGGTFFLLAD SSFSSSEEAK VRLEAPGRDY RRYQMEEYGG VDVRLYRIPD PMAFLRQQKN LHRIVVQPQY LGDGLNNTLT WLWDNWYGKS RRVMQRTFSS QSRQNVTQAL PELHLGNAII KPSRYVQNNQ FSPLKKYPLV EQFRYPLWQA KPVEPQQGVK LEGASSNFIS PQPGNIYIPL GQQEPGLYLV EAMVGGYRAT TVVFVSDTVA LSKVSGNELL VWTAGKKQGE AKPGSEILWT DGLGVMTRGV TDDSGTLQLQ HISPERSYIL GKDAEGGVFV SENFFYESEI YNTRLYIFTD RPLYRAGDRV DVKVMGREFH DPLHSSPIVS APAKLSVLDA NGSLLQTVNV TLDARNGGQG SFRLPENAVA GGYELRLAYR NQVYSSSFRV ANYIKPHFEI GLALDKKEFK TGEAVSGKLQ LLYPDGEPVK DARVQLSLRA QQLSMVGNDL RYAGRFPVSL EGSETVSDAS GHVALNLPAA DKPSRYLLTV FASDGAAYRV TTTKEILIER GLAHYSLSTA AQYSNSGESV VFRYAALESS KQVPVTYEWL RLEDRTSHSG ELPSGGKSFT VNFAKPGNYN LTLRDKDGLI LAGLSHAVSG KGSTAHTGTV DIVADKTLYQ PGETAKMLIT FPEPIDEALL TLERDRVEQQ SLLSHPANWL TLQRLNDTQY EARVPVSNSF APNITFSVLY TRNGQYSFQN AGIKVAVPQL DIRVKTDKTH YQPGELVNVE LTSSLKGKPV SAQLTVGVVD EMIYALQPEI APNIGKFFYP LGRNNVRTSS SLSFISYDQA LSSEPVAPGA TNRSERRVKM LERPRREEVD TAAWMPSLTT DKQGKAYFTF LMPDSLTRWR ITARGMNGDG LVGQGRAYLR SEKNLYMKWS MPTVYRVGDK PAAGLFIFSQ QDNEPVALVT KFAGAEMRQT LTLHKGANYI SLTQNIQQSG LLSAELQQNG QVQDSISTKL SFVDNSWPVE QQKNVMLGGG ENALMLPEQA SNIRLQSSET PQEIFRNNLD ALVDGPWGGV INTGSRLIPL SLAWRSLADH QSAAANDIRQ MIQDNRLRLM QLAGPGARFT WWGEDGNGDA FLTAWAWYAD WQASQALGVT QQPEYWQHML DSYAEQADNM PLLHRALVLA WAQEMNLPCK TLLKGLDEAI ARRGTKDEDF SEEDIRDIND SLILDTPESP LADAVANVLT MTLLKKAQLK STVMPQVQQY AWDKAANSNQ PLAHTVVLLN SGGDATQTAA ILSGLTAEQS TIERALAMNW LAKYMATMPP VVLPAPAGAW AKHKLTGGGE DWRWVGQGVP DILSFGDELS PQNVQVRWRE PAKMAQQSNI PVTVERQLYR LIPGEEEMSF ILQPVTSNEI DSDALYLDEI TLTSEQDAVL RYGQVEVPLP PGADVERTTW GISVNKPNAA KQQGQLLEKA RNEMGELAYM VPVKELTGTV TFRHLLRFSQ KGQFVLPPAR YVRSYAPAQQ SVAAGSEWTG MQVK
|
| |