Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2368 |
Symbol | |
ID | 5594535 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2370780 |
End bp | 2375294 |
Gene Length | 4515 bp |
Protein Length | 1504 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640921495 |
Product | alpha-2-macroglobulin family protein |
Protein accession | YP_001459029 |
Protein GI | 157161711 |
COG category | [R] General function prediction only |
COG ID | [COG2373] Large extracellular alpha-helical protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGGAACAG GGCTTGCTAA TGCTGATGAT TCGCTTCCTT CCAGTAACTA TGCGCCGCCC GCCGGGGGAA CATTCTTTTT GCTTGCTGAC AGCAGTTTTA GCAGCAGTGA AGAGGCGAAA GTGCGACTGG AAGCGCCGGG GCGTGATTAT CGGCGCTATC AGATGGAAGA GTACGGCGGC GTGGACGTTC GCCTGTATCG TATTCCTGAC CCGATGGCAT TTTTGCGCCA GCAGAAAAAC CTGCATCGCA TTGTGGTGCA ACCGCAATAT CTGGGCGACG GGCTGAACAA TACGCTGACC TGGCTGTGGG ATAACTGGTA CGGCAAATCT CGCCGCGTGA TGCAGCGTAC TTTCTCTTCT CAGTCACGGC AGAATGTGAC TCAGGCATTA CCCGAATTAC ATCTCGGCAA TGCCATTATT AAACCTTCCC GTTATGTACA GAACAACCAG TTTTCCCCGC TGAAAAAATA TCCCTTGGTG GAACAGTTCC GTTATCCACT ATGGCAGGCT AAACCGGTCG AGCCGCAGCA AGGGGTAAAA CTGGAAGGCG CATCCAGCAA TTTCATCTCG CCGCAGCCGG GTAACATTTA TATTCCTCTC GGCCAACAAG AGCCGGGACT GTACCTCGTC GAGGCGATGG TTGGTGGGTA TCGGGCGACG ACGGTGGTGT TTGTTTCCGA TACCGTGGCG CTTAGCAAAG TGTCAGGTAA CGAGCTTCTG GTATGGACCG CGGGTAAAAA ACAGGGTGAA GCGAAGCCCG GCTCAGAGAT CTTGTGGACT GACGGTCTTG GCGTGATGAC CCGCGGTGTG ACCGATGACA GCGGTACCTT GCAGTTACAA CATATATCGC CAGAACGTTC ATACATTCTG GGTAAGGATG CTGAAGGCGG CGTTTTTGTC TCCGAGAACT TCTTCTACGA AAGCGAAATC TACAACACCC GCTTGTATAT TTTTACCGAT CGCCCGCTAT ATCGCGCAGG CGATCGTGTC GATGTTAAAG TGATGGGCCG CGAGTTCCAC GATCCGTTGC ATTCATCCCC CATCGTCAGC GCCCCGGCGA AGCTTTCGGT GCTGGACGCT AACGGCAGTC TGTTGCAAAC CGTCGATGTC ACGCTGGATG CGCGCAATGG CGGGCAGGGA AGTTTCCGCC TGCCAGAAAA TGCCGTAGCC GGAGGTTATG AGTTACGTCT TGCTTACCGC AATCAGGTCT ATAGCAGCAG TTTTCGCGTG GCAAACTACA TCAAGCCACA TTTCGAGATT GGTTTAGCTC TCGACAAAAA AGAGTTCAAA ACTGGCGAAG CGGTCAGCGG CAAACTGCAA CTTCTCTACC CGGATGGCGA GCCGGTAAAA AATGCCCGCG TGCAGTTAAG TTTGCGCGCT CAGCAATTAT CAATGGTCGG TAACGATTTG CGTTATGCCG GACGTTTCCC CGTGTCGCTG GAAGGCAGCG AAACGGTGTC CGACGCCAGC GGTCATGTGG CGTTAAATCT CCCCGCCGCC GATAAACCGA GCCGCTATTT GTTAACCGTC TCCGCCAGTG ACGGCGCGGC GTATCGCGTC ACCACCACCA AAGAGATCCT CATTGAACGC GGTCTGGCGC ATTACTCATT AAGTACTGCC GCACAATACA GTAATAGCGG CGAGTCGGTT GTGTTCCGTT ATGCCGCGCT GGAATCTTCA AAACAGGTTC CTGTTACGTA TGAATGGTTG CGTCTCGAAG ACCGCACGAG CCATAGCGGA GAGCTACCGT CAGGCGGCAA ATCCTTTACC GTCAATTTCG CTAAACCTGG CAACTACAAT CTGACATTAC GCGATAAAGA CGGCTTAATT CTCGCTGGGT TAAGTCATGC CGTCAGCGGT AAGGGCAGCA CGGCGCATAC TGGTACGGTA GATATCGTGG CGGATAAAAC GCTGTACCAG CCAGGCGAAA CCGCGAAGAT GCTGATTACC TTTCCGGAGC CAATTGATGA AGCATTATTG ACGCTGGAAC GCGATCGCGT GGAACAGCAG TCGCTGCTTT CGCATCCGGC AAACTGGCTA ACGCTACAAC GTTTAAACGA TACCCAGTAT GAAGCCCGGG TTCCAGTGAG CAATTCCTTT GCGCCTAACA TCACTTTTTC GGTGCTGTAT ACCCGTAATG GTCAGTACAG TTTTCAGAAC GCCGGGATCA AAGTTGCCGT TCCTCAGCTT GATATCCGGG TGAAAACGGA CAAAACCCAT TACCAGCCTG GTGAACTGGT CAATGTCGAA TTAACCTCGT CGCTGAAAGG TAAACCTGTT TCTGCGCAGC TAACGGTAGG CGTGGTCGAT GAAATGATCT ACGCGCTGCA ACCAGAAATC GCGCCGAATA TCGGCAAATT TTTCTATCCG CTGGGGCGTA ACAATGTGCG TACCAGCTCC AGTCTGTCGT TTATCAGCTA CGACCAGGCG CTCTCCAGCG AGCCGGTTGC GCCTGGCGCG ACTAACCGCA GCGAGCGGCG AGTAAAAATG CTTGAACGTC CACGGCGTGA AGAGGTGGAT ACCGCGGCAT GGATGCCGTC ACTCACAACC GATAAACAAG GCAAAGCGTA TTTCACGTTC CTGATGCCTG ATTCGTTAAC CCGCTGGCGT ATCACCGCGC GTGGGATGAA CGGCGACGGG CTGGTCGGGC AGGGGCGTGC TTATCTGCGT TCGGAAAAAA ATCTCTACAT GAAGTGGAGT ATGCCAACGG TGTATCGCGT GGGCGACAAA CCGGCGGCAG GACTGTTTAT CTTCAGTCAG CAGGATAACG AACCGGTGGC GCTGGTGACT AAATTTGCAG GCGCTGAGAT GCGCCAGACG CTGACGCTGC ACAAAGGGGC GAATTATATT TCGCTGACGC AGAATATTCA GCAATCTGGC TTGTTAAGTG CAGAACTGCA ACAAAATGGG CAAGTGCAGG ACAGCATTAG CACAAAACTG TCTTTTGTGG ACAACAGCTG GCCCGTTGAA CAGCAGAAAA ATGTCATGCT CGGCGGTGGC GAGAACGCGC TGATGTTGCC CGAGCAGGCG AGTAATATCC GGCTACAAAG TAGTGAAACG CCGCAGGAGA TTTTCCGCAA CAATCTTGAT GCGTTAGTCG ATGAACCGTG GGGTGGGGTG ATCAACACCG GTAGCCGTCT GATCCCGCTC AGTCTCGCCT GGCGTTCGCT TGCCGATCAT CAAAGTGCCG CCGCTAACGA CATTCGTCAG ATGATTCAGG ATAACCGTCT GCGGCTGATG CAACTGGCGG GGCCCGGAGC GCGCTTTACC TGGTGGGGTG AAGATGGCAA TGGTGACGCC TTCCTTACGG CATGGGCATG GTACGCCGAC TGGCAGGCCA GCCAGGCGCT CGGCGTAACG CAACAACCGG AATACTGGCA GCATATGCTC GACAGTTATG CCGAGCAGGC GGATAACATG CCGTTATTGC ATCGGGCGCT GGTGCTGGCG TGGGCACAGG AGATGAATCT GCCGTGCAAA ACGTTGTTGA AAGGGTTGGA TGAAGCTATC GCCCGGCGCG GAACTAAAGA TGAAGATTTC TCTGAGGAAG ACATCCGCGA TATCAATGAC AGCCTGATCC TCGATACACC GGAATCTCCA CTGGCAGATG CGGTGGCAAA CGTCTTAACC ATGACGTTGC TGAAAAAAGC GCAGTTGAAG TCCACGGTGA TGCCACAGGT TCAGCAATAT GCGTGGGATA AAGCGGCAAA CAGCAATCAG CCGCTGGCGC ACACGGTTGT GCTGCTCAAT AGCGGGGGCG ACGCTACCCA GGCGGCCGCT ATTTTAAGTG GTTTGACCGC TGAGCAATCC ACTATTGAGC GCGCACTGGC CATGAACTGG CTGGCGAAAT ATATGGCGAC AATGCCTCCG GTTGTGTTGC CTGCGCCTGC GGGCGCATGG GCTAAACATA AGTTAACTGG AGGGGGCGAA GACTGGCGTT GGGTTGGTCA GGGTGTGCCG GACATTCTCT CTTTTGGTGA CGAATTATCC CCGCAAAATG TGCAGGTCCG CTGGCGTGAA CCGGCAAAAA CGGCTCAACA AAGTAACATT CCGGTGACCG TTGAACGCCA GTTGTATCGG CTTATCCCTG GTGAAGAAGA GATGAGCTTT ACTCTGCAAC CGGTGACCAG CAATGAGATT GACAGCGATG CGCTGTATCT CGATGAAATC ACGCTTACCA GCGAGCAGGA TGCAGTTCTG CGCTATGGTC AGGTGGAAGT ACCGCTCCCA CCGGGAGCCG ACGTTGAGCG CACAACATGG GGCATTTCGG TCAATAAACC CAACGCCGCG AAACAGCAGG GGCAATTGCT GGAAAAAGCG CGAAATGAAA TGGGCGAACT GGCCTATATG GTGCCGGTGA AAGAACTGAC GGGAACGGTC ACTTTCCGCC ATTTGCTGCG CTTCTCGCAA AAAGGGCAAT TCGTTCTGCC TCCTGCTCGT TATGTGCGTT CCTATGCACC TGCACAGCAA AGTGTTGCGG CAGGGAGTGA ATGGACCGGG ATGCAGGTGA AATAA
|
Protein sequence | MGTGLANADD SLPSSNYAPP AGGTFFLLAD SSFSSSEEAK VRLEAPGRDY RRYQMEEYGG VDVRLYRIPD PMAFLRQQKN LHRIVVQPQY LGDGLNNTLT WLWDNWYGKS RRVMQRTFSS QSRQNVTQAL PELHLGNAII KPSRYVQNNQ FSPLKKYPLV EQFRYPLWQA KPVEPQQGVK LEGASSNFIS PQPGNIYIPL GQQEPGLYLV EAMVGGYRAT TVVFVSDTVA LSKVSGNELL VWTAGKKQGE AKPGSEILWT DGLGVMTRGV TDDSGTLQLQ HISPERSYIL GKDAEGGVFV SENFFYESEI YNTRLYIFTD RPLYRAGDRV DVKVMGREFH DPLHSSPIVS APAKLSVLDA NGSLLQTVDV TLDARNGGQG SFRLPENAVA GGYELRLAYR NQVYSSSFRV ANYIKPHFEI GLALDKKEFK TGEAVSGKLQ LLYPDGEPVK NARVQLSLRA QQLSMVGNDL RYAGRFPVSL EGSETVSDAS GHVALNLPAA DKPSRYLLTV SASDGAAYRV TTTKEILIER GLAHYSLSTA AQYSNSGESV VFRYAALESS KQVPVTYEWL RLEDRTSHSG ELPSGGKSFT VNFAKPGNYN LTLRDKDGLI LAGLSHAVSG KGSTAHTGTV DIVADKTLYQ PGETAKMLIT FPEPIDEALL TLERDRVEQQ SLLSHPANWL TLQRLNDTQY EARVPVSNSF APNITFSVLY TRNGQYSFQN AGIKVAVPQL DIRVKTDKTH YQPGELVNVE LTSSLKGKPV SAQLTVGVVD EMIYALQPEI APNIGKFFYP LGRNNVRTSS SLSFISYDQA LSSEPVAPGA TNRSERRVKM LERPRREEVD TAAWMPSLTT DKQGKAYFTF LMPDSLTRWR ITARGMNGDG LVGQGRAYLR SEKNLYMKWS MPTVYRVGDK PAAGLFIFSQ QDNEPVALVT KFAGAEMRQT LTLHKGANYI SLTQNIQQSG LLSAELQQNG QVQDSISTKL SFVDNSWPVE QQKNVMLGGG ENALMLPEQA SNIRLQSSET PQEIFRNNLD ALVDEPWGGV INTGSRLIPL SLAWRSLADH QSAAANDIRQ MIQDNRLRLM QLAGPGARFT WWGEDGNGDA FLTAWAWYAD WQASQALGVT QQPEYWQHML DSYAEQADNM PLLHRALVLA WAQEMNLPCK TLLKGLDEAI ARRGTKDEDF SEEDIRDIND SLILDTPESP LADAVANVLT MTLLKKAQLK STVMPQVQQY AWDKAANSNQ PLAHTVVLLN SGGDATQAAA ILSGLTAEQS TIERALAMNW LAKYMATMPP VVLPAPAGAW AKHKLTGGGE DWRWVGQGVP DILSFGDELS PQNVQVRWRE PAKTAQQSNI PVTVERQLYR LIPGEEEMSF TLQPVTSNEI DSDALYLDEI TLTSEQDAVL RYGQVEVPLP PGADVERTTW GISVNKPNAA KQQGQLLEKA RNEMGELAYM VPVKELTGTV TFRHLLRFSQ KGQFVLPPAR YVRSYAPAQQ SVAAGSEWTG MQVK
|
| |