Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_1423 |
Symbol | |
ID | 6067710 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 1562208 |
End bp | 1566722 |
Gene Length | 4515 bp |
Protein Length | 1504 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641600842 |
Product | alpha-2-macroglobulin domain-containing protein |
Protein accession | YP_001724413 |
Protein GI | 170019459 |
COG category | [R] General function prediction only |
COG ID | [COG2373] Large extracellular alpha-helical protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.427509 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.0000103653 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | GTGGGAACAG GGCTTGCTAA TGCTGATGAT TCGCTTCCTT CCAGCAACTA TGCGCCGCCT GCCGGGGGAA CATTCTTTTT GCTTGCTGAC AGCAGTTTTA GCAGCAGTGA AGAGGCGAAA GTGCGACTGG AAGCGCCGGG GCGTGATTAT CGGCGCTATC AGATGGAAGA GTACGGCGGC GTGGACGTTC GCCTGTATCG TATTCCTGAC CCGATGGCAT TTTTGCGCCA GCAGAAAAAC CTGCATCGCA TTGTGGTGCA ACCGCAATAT CTGGGCGACG GGCTGAACAA TACGCTAACC TGGCTGTGGG ATAACTGGTA CGGCAAATCT CGCCGCGTGA TGCAGCGTAC TTTCTCTTCT CAGTCACGGC AGAATGTGAC TCAGGCGTTA CCCGAATTAC AGCTCGGCAA TGCCATTATT AAACCTTCCC GTTATGTACA GAACAACCAG TTTTCCCCGC TGAAAAAATA TCCCCTGGTG GAACAGTTCC GTTATCCACT ATGGCAGGCT AAACCGTTCG AGCCGCAGCA AGGGGTAAAA CTGGAAGGCG CATCCAGCAA TTTCATCTCG CCGCAGCCGG GTAACATTTA TATTCCTCTC GGCCAACAAG AGCCGGGACT GTACCTCGTC GAGGCGATGG TTGGTGGGTA TCGGGCGACG ACGGTGGTGT TTGTTTCCGA TACCGTGGCG CTTAGCAAAG TGTCAGGCAA AGAGCTTCTG GTGTGGACCG CGGGTAAAAA ACAGGGTGAA GCGAAGCCCG GCTCAGAGAT CTTGTGGACT GACGGTCTTG GCGTGATGAC CCGCGGTGTG ACCGATGACA GCGGTACCTT GCAGTTACAA CATATATCGC CAGAACGTTC ATACATTCTG GGTAAGGATG CTGAAGGCGG CGTTTTTGTC TCCGAGAACT TCTTCTACGA AAGCGAAATC TACAACACCC GCTTGTATAT TTTTACCGAT CGCCCGCTAT ATCGCGCAGG CGATCGTGTC GATGTTAAAG TGATCGGCCG CGAGTTCCAC GATCCGTTGC ATTCATCCCC CATCGTCAGC GCCCCGGCGA AGCTTTCGGT GCTGGACGCC AACGGCAGTC TGTTGCAAAC CGTCAATGTC ACGCTGGATG CGCGCAATGG CGGGCAGGGA AGTTTCCGCC TGCCAGAAAA TGCCGTAGCC GGAGGTTATG AGTTACGTCT TGCTTACCGC AATCAGGTCT ATAGCAGCAG TTTTCGCGTG GCAAACTACA TCAAGCCACA TTTCGAGATT GGTTTAGCTC TCGACAAAAA AGAGTTCAAA ACTGGCGAAG CGGTCAGCGG CAAACTGCAA CTGCTCTACC CGGATGGCGA GCCGGTAAAA AATGCCCGCG TGCAGTTAAG TTTGCGCGCT CAGCAATTAT CAATGGTCGG TAACGATTTG CGTTATGCCG GACGTTTCCC CGTGTCGCTG GAAGGCAGCG AAACGGTGTC CGACGCCAGC GGTCATGTGA CGTTAAATCT CCCCGCCGCC GATAAACCGA GCCGCTATTT GTTAACCGTC TCCGCCAGTG ACGGCGCGGC GTATCGCGTC ACCACCACCA AAGAGATCCT CATTGAACGC GGCCTGGCGC ATTACTCTTT AAGTACCGCC GCACAATACA GTAATAGCGG CGAGTCGGTT GTGTTCCGTT ATGCCGCGCT GGAATCTTCA AAACAGGTTC CTGTTACGTA TGAATGGTTG CGTCTCGAAG ACCGCACGAG CCATAGCGGA GAGCTACCGT CAGGCGGCAA ATCCTTTACC GTCAATTTCG CTAAACCTGG CAACTACAAT CTGACATTAC GCGATAAAGA CGGCTTAATT CTCGCTGGGT TAAGTCATGC CGTCAGCGGT AAGGGCAGCA CGGCGCATAC TGGTACGGTA GATATCGTGG CGGATAAAAC GCTGTACCAG CCAGGCGAAA CCGCGAAGAT GCTGATTACC TTTCCGGAGC CAATTGATGA AGCATTATTG ACGCTGGAAC GCGATCGCGT GGAACAGCAG TCGCTGCTTT CGCATCCGGC AAACTGGCTA ACGCTACAAC GTTTAAACGA TACCCAGTAT GAAGCCCGGG TTCCAGTGAG CAATTCCTTT GCGCCTAACA TCACTTTTTC GGTGCTGTAT ACCCGTAATG GTCAGTACAG TTTTCAGAAC GCCGGGATCA AAGTTGCCGT TCCTCAGCTT GATATCCGGG TGAAAACGGA CAAAACCCAT TACCAGCCTG GTGAACTGGT CAATGTCGAA TTAACCTCGT CGCTGAAAGG TAAACCTGTT TCTGCGCAGC TAACGGTAGG CGTGGTCGAT GAAATGATCT ACGCGCTGCA ACCAGAAATC GCGCCGAATA TCGGCAAATT TTTCTATCCG CTGGGGCGTA ACAATGTGCG TACCAGCTCC AGTCTGTCGT TTATCAGCTA CGACCAGGCG CTCTCCAGCG AGCCGGTTGC GCCTGGCGCA ACTAACCGCA GCGAGCGGCG AGTAAAAATG CTTGAACGTC CACGGCGTGA AGAGGTGGAT ACCGCGGCAT GGATGCCGTC ACTCACAACC GATAAACAAG GCAAAGCGTA TTTCACGTTC CTGATGCCTG ATTCGTTAAC CCGCTGGCGT ATCACCGCGC GTGGGATGAA CGGCGACGGG CTGGTCGGGC AGGGGCGTGC TTATCTGCGT TCGGAAAAAA ATCTCTACAT GAAGTGGAGT ATGCCAACGG TGTATCGCGT GGGCGACAAA CCGGCGGCAG GACTGTTTAT CTTCAGTCAG CAGGATAACG AACCGGTGGC GCTGGTGACT AAATTTGCAG GCGCTGAGAT GCGCCAGACG CTGACGCTGC ACAAAGGGGC GAATTATATT TCGCTGACGC AGAATATTCA GCAATCTGGC TTGTTAAGTG CAGAACTGCA ACAAAATGGG CAAGTGCAGG ACAGCATTAG CACAAAACTG TCTTTTGTGG ATAACAGCTG GCCCGTTGAA CAGCAGAAAA ATGTCATGCT CGGTGGTGGC GATAACGCGC TGATGTTGCC CGAGCAGGCG AGCAATATCC GGCTACAAAG TAGTGAAACG CCGCAGGAGA TTTTCCGCAA CAATCTTGAT GCGTTAGTCG ATGAACCGTG GGGTGGCGTA ATCAACACCG GTAGCCGTCT GATCCCGCTC AGTCTCGCCT GGCGTTCGCT TGCCGATCAT CAAAGTGCCG CCGCTAACGA CATTCGTCAG ATGATTCAGG ATAACCGTCT GCGGCTGATG CAACTGGCGG GGCCCGGAGC GCGCTTTACC TGGTGGGGTG AAGATGGCAA TGGTGACGCC TTCCTTACGG CATGGGCATG GTACGCCGAC TGGCAGGCCA GCCAGGCGAT CGGCGTAACG CAACAACCGG AATACTGGCA GCATATGCTC GACAGCTACG CGGAGCAGGC AGATAACATG CCGTTATTGC ATCGGGCGCT GGTGCTGGCA TGGGCGCAGG AGATGAATTT GCCGTGCAAA ACGTTGTTGA AAGGGTTGGA TGAAGCTATC GCCCGGCGCG GAACTAAAAC TGAAGATTTC TCTGAGGAAG ACACCCGCGA TATCAATGAT AGCCTGATCC TCGATACACC GGAGTCTCCA CTGGCAGATG CGGTGGCAAA CGTCTTAACC ATGACGTTGC TGAAAAAAGC GCAGTTGAAG TCCACGGTGA TGCCACAGGT TCAGCAATAT GCGTGGGATA AAGCGGCAAA CAGCAATCAG CCGCTGGCGC ACACGGTTGT GCTGCTTAAT AGCGGTGGCG ACGCTACCCA GACGGCCGCT ATTTTAAGTG GTTTGACCGC TGAGCAATCC ACTATTGAGC GCGCGCTGGC CATGAACTGG CTGGCGAAAT ATATGGCGAC AATGCCTCCA GTTGTTTTGC CTGCGCCTGC GGGCGCATGG GCTAAACATA AGTTAACTGG AGGGGGCGAA GACTGGCGTT GGGTTGGTCA GGGTGTGCCG GACATTCTCT CTTTTGGTGA CGAATTATCG CCGCAAAATG TGCAGGTCCG CTGGCGTGAA CCGGCAAAAA CGGCTCAACA AAGTAACATT CCGGTGACCG TTGAACGCCA GTTGTATCGG CTTATCCCCG GTGAAGAAGA GATGAGCTTT ACTCTGCAGC CGGTGACCAG CAATGAGATT GACAGCGATG CGCTGTATCT CGATGAAATT ACGCTTACCA GCGAGCAGGA TGCAGTTCTG CGCTATGGTC AGGTGGAAGT ACCGCTCCCG CCGGGAGCCG ACGTTGAGCG CACAACATGG GGCATTTCAG TCAATAAACC CAACGCCGCG AAACAGCAGG GGCAATTGCT GGAAAAAGCG CGTAATGAAA TGGGCGAACT GGCCTATATG GTGCCGGTGA AAGAACTGAC GGGAACGGTC ACTTTCCGCC ATTTGCTGCG CTTCTCGCAA AAAGGGCAAT TCGTTCTGCC TCCTGCTCGT TATGTGCGTT CCTATGCACC TGCGCAGCAA AGTGTTGCGG CAGGGAGTGA ATGGACCGGG ATGCAGGTGA AATAA
|
Protein sequence | MGTGLANADD SLPSSNYAPP AGGTFFLLAD SSFSSSEEAK VRLEAPGRDY RRYQMEEYGG VDVRLYRIPD PMAFLRQQKN LHRIVVQPQY LGDGLNNTLT WLWDNWYGKS RRVMQRTFSS QSRQNVTQAL PELQLGNAII KPSRYVQNNQ FSPLKKYPLV EQFRYPLWQA KPFEPQQGVK LEGASSNFIS PQPGNIYIPL GQQEPGLYLV EAMVGGYRAT TVVFVSDTVA LSKVSGKELL VWTAGKKQGE AKPGSEILWT DGLGVMTRGV TDDSGTLQLQ HISPERSYIL GKDAEGGVFV SENFFYESEI YNTRLYIFTD RPLYRAGDRV DVKVIGREFH DPLHSSPIVS APAKLSVLDA NGSLLQTVNV TLDARNGGQG SFRLPENAVA GGYELRLAYR NQVYSSSFRV ANYIKPHFEI GLALDKKEFK TGEAVSGKLQ LLYPDGEPVK NARVQLSLRA QQLSMVGNDL RYAGRFPVSL EGSETVSDAS GHVTLNLPAA DKPSRYLLTV SASDGAAYRV TTTKEILIER GLAHYSLSTA AQYSNSGESV VFRYAALESS KQVPVTYEWL RLEDRTSHSG ELPSGGKSFT VNFAKPGNYN LTLRDKDGLI LAGLSHAVSG KGSTAHTGTV DIVADKTLYQ PGETAKMLIT FPEPIDEALL TLERDRVEQQ SLLSHPANWL TLQRLNDTQY EARVPVSNSF APNITFSVLY TRNGQYSFQN AGIKVAVPQL DIRVKTDKTH YQPGELVNVE LTSSLKGKPV SAQLTVGVVD EMIYALQPEI APNIGKFFYP LGRNNVRTSS SLSFISYDQA LSSEPVAPGA TNRSERRVKM LERPRREEVD TAAWMPSLTT DKQGKAYFTF LMPDSLTRWR ITARGMNGDG LVGQGRAYLR SEKNLYMKWS MPTVYRVGDK PAAGLFIFSQ QDNEPVALVT KFAGAEMRQT LTLHKGANYI SLTQNIQQSG LLSAELQQNG QVQDSISTKL SFVDNSWPVE QQKNVMLGGG DNALMLPEQA SNIRLQSSET PQEIFRNNLD ALVDEPWGGV INTGSRLIPL SLAWRSLADH QSAAANDIRQ MIQDNRLRLM QLAGPGARFT WWGEDGNGDA FLTAWAWYAD WQASQAIGVT QQPEYWQHML DSYAEQADNM PLLHRALVLA WAQEMNLPCK TLLKGLDEAI ARRGTKTEDF SEEDTRDIND SLILDTPESP LADAVANVLT MTLLKKAQLK STVMPQVQQY AWDKAANSNQ PLAHTVVLLN SGGDATQTAA ILSGLTAEQS TIERALAMNW LAKYMATMPP VVLPAPAGAW AKHKLTGGGE DWRWVGQGVP DILSFGDELS PQNVQVRWRE PAKTAQQSNI PVTVERQLYR LIPGEEEMSF TLQPVTSNEI DSDALYLDEI TLTSEQDAVL RYGQVEVPLP PGADVERTTW GISVNKPNAA KQQGQLLEKA RNEMGELAYM VPVKELTGTV TFRHLLRFSQ KGQFVLPPAR YVRSYAPAQQ SVAAGSEWTG MQVK
|
| |