Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2376 |
Symbol | |
ID | 6144217 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2413395 |
End bp | 2417909 |
Gene Length | 4515 bp |
Protein Length | 1504 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641617249 |
Product | alpha-2-macroglobulin family protein |
Protein accession | YP_001744421 |
Protein GI | 170680850 |
COG category | [R] General function prediction only |
COG ID | [COG2373] Large extracellular alpha-helical protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGAACAG GGCTTGCTAA TGCTGATGAT TCGCTTCCTT CCAGTAACTA TGCGCCGCCC GGCGGGGGAA CATTCTTTTT GCTCGCTGAC AGCAGTTTTA GCAGCAGTGA AGAGGCGAAA GTGCGACTGG AAGCGCCGGG GCGTGATTAT CGACGCTATC AGATGGAAGA GTACGGTGGC GTGGACGTTC GCCTGTATCG TATTCCTGAC CCGATGGCAT TTTTGCGCCA GCAGAAAAAC CTGCATCGCA TTGTGGTGCA ACCGAAATAT CTGGGCGACG GGCTAAACAA TACGCTGACC TGGCTGTGGG ATAACTGGTA TGGCAAATCT CGCCGTGTAA TGCAGCGTAC TTTCTCTTCT CAGTCACGGC AGAATGTGAC TCAGGCGTTA CCCGAATTAC AGCTCGGTAA TGCCATTGTT AAACCTTCCC GTTATGTACA GAACAGCCAG TTTTCACCGC TGAAAAAATA TCCACTGGTG GAACAGTTCC GTTACCCATT ATGGCAGGCT AAACCAGTCG AGCCGCAGCA AGGGGTAAAA CTGGAAGGCG CATCCAGCAA TTTCATCTCG CCGCAGCCGG GTAACATCTA TATTCCTCTC GGCAAACAAG AGCCGGGACT GTATCTCGTC GAAGCGATGG TCGGTGGGTA TCGGGCAACG ACGGTGGTAT TTGTCTCCGA TACAGTGGCG CTTAGCAAAG TGTCAGGCAA CGAACTTCTG GTGTGGACCG CGGGCAAAAA ACAGGGTGAA GCGAAGCCAG GCTCAGAGAT CCTGTGGACT GACGGTCTTG GCGTGATGAC TCGCGGTGTG ACCGATGACA GCGGTACCTT GCAGTTACAA CATATATCGC CAGAACGTTC ATACATTCTG GGTAAGGATG CTGAAGGCGG CGTTTTTGTC TCCGAGAACT TTTTCTACGA AAGTGAAATC TACAACACCC GCTTGTATAT CTTTACCGAT CGCCCGCTAT ATCGCGCAGG CGATCGTGTC GATGTTAAAG TGATGGGGCG CGAGTTCCAC GATCCGTTGC ATTCATCCCC CATCGTCAGC GCCCCGGCGA AGCTTTCGGT GCTGGACGCT AACGGCAGTC TGTTGCAAAC CATCAATGTC ACGCTGGATG CGCGCAATGG CGGGCAGGGG AGTTTCCGCC TGCCAGAAAA TGCCGTTGCC GGAGGTTATG AGTTGCGTCT TGCTTACCGC AATCAGGTCT ATAGCAGTAG TTTTCGCGTG GCAAACTACA TCAAGCCACA TTTCGAGATT GGCTTAGCTC TCGACAAAAA AGAGTTCAAA ACTGGCGAAG CGGTCAGCGG CAAACTGCAA CTGCTCTACC CGGATGGTGA GCCGGTAAAA GATGCCCGTG TGCAGTTAAG TTTGCGCGCT CAGCAATTAT CAATGGTCGG TAACGATTTG CGTTATGCCG GACGTTTTCC CGTGTCGCTG GAAGGCAGTG AAACGGTATC CGACGATAGC GGTCATGTGG CGTTAAATCT CCCCGCCGCT GATAAACCAA GCCGCTATTT GTTAACCGTC TCCGCCAGTG ACGGTGCGGC GTATCGTGTC ACCACCACCA AAGAGATCCT CATTGAACGT GGACTGGCAC ATTACTCATT AAGTACTGCC GCACAATACA GTAATAGCGG TGAATCGGTT GTGTTCCGTT ATGCCGCGCT GGAATCTTCA AAACAGGTTC CTGTTACGTA TGAATGGTTG CGTCTCGAAG ACCGCACGAG CCATAGCGGA GAGCTACCGT CAGGCGGCCA ATCCTTTACC GTCAATTTCG CTAAACCTGG CAACTACAAT CTGACATTAC GCGATAAAGA CGGCTTAATT CTCGCTGGGT TAAGTCATGC CGTCAGCGGT AAGGGCAGCA CGGCGCATAC TGGTACGGTA GATATCGTGG CGGATAAAAC GCTGTATCAG CCAGGCGAAA CTGCGAAGAT GCTGATTACC TTTCCGGAGC CAATTGATGA AGCATTATTG ACGCTGGAAC GCGATCGCGT GGAACAGCAG TCGCTGCTTT CGCATCCGGC AAACTGGCTG ACGTTACAAC GTTTAAACGA TACCCAGTAT GAAGCCCGGG TTCCAGTGAG CAATTCCTTT GCGCCTAACA TCACTTTTTC GGTGCTGTAT ACCCGTAACG GTCAGTACAG TTTTCAGAAC GCCGGGATCA AAGTTGCCGT TCCCCAGCTG GATATCCGGG TGAAAACGGA CAAAACCCAT TACCAGCCTG GTGAACTGGT CAATGTCGAA TTAACCTCGT CGCTGAAAGG TAAACCTGTT TCTGCGCAGT TAACGGTAGG CGTGGTCGAT GAAATGATCT ACGCGCTGCA ACCGGAAATC GCGCCGAATA TCGGCAAATT TTTCTATCCG CTGGGGCGTA ACAATGTGCG TACCAGCTCC AGTTTGTCGT TTATCAGCTA CGACCAGGCG CTCTCCAGCG AGCCGGTTGC GCCTGGCGCA ACTAACCGCA GCGAGCGGCG AGTAAAAATG CTTGAACGTC CACGGCGTGA AGAGGTGGAT ACCGCGGCAT GGATGCCGTC ACTCACAACC GATAAACAAG GCAAAGCGTA TTTCACGTTC CTGATGCCTG ATTCGTTAAC CCGCTGGCGT ATCACCGCGC GTGGGATGAA CGGCGACGGG CTGGTCGGGC AGGGGCGTGC TTATCTGCGT TCGGAAAAAA ATCTCTACAT GAAGTGGAGT ATGCCAACGG TGTATCGCGT GGGCGACAAA CCGGCGGCAG GACTGTTTAT CTTCAGTCAG CAGGATAACG AACCGGTGGC GCTGGTGACT AAATTTGCAG GCGCTGAGAT GCGCCAGACG CTGACGCTGC ACAAAGGGGC GAATTATATT TCGCTGACGC AGAATATTCA GCAATCTGGC TTGTTAAGCG CAGAACTGCA ACAAAATGGG CAAGTGCAGG ACAGCATTAG CACAAAACTG TCTTTTGTGG ATAACAGCTG GCCCGTTGAA CAGCAGAAAA ATGTCATGCT CGGCGGTGGC GATAATGCGC TGATGTTGCC CGAGCAGGCG AGCAATATCC GGCTACAAAG TAGTGAAACG CCGCAGGAGA TTTTCCGTAA CAATCTTGAT GCGTTAGTCG ATGAACCGTG GGGTGGCGTG ATCAACACCG GTAGCCGTCT GATCCCGATC AGTCTCGCCT GGCGTTCGCT TGCCGATCAT CAAAGTGCCG CCGCTAACGA CATTCGTCAG ATGATTCAGG ATAACCGTCT GCGGCTGATG CAACTGGCAG GGCCCGGTGC GCGCTTTACC TGGTGGGGTG AAGATGGCAA TGGTGACGCC TTCCTCACGG CATGGGCATG GTACGCCGAC TGGCAGGCCA GCCAGGCGCT CGGCGTAACG CAACAACCGG AATACTGGCA GCATATGCTC GACAGTTATG CCGAGCAGGC AGATAACATG CCGTTATTGC ATCGGGCGCT GGTGCTGGCG TGGGCGCAGG AGATGAATCT GCCGTGCAAA ACGTTGTTGA AAGGGTTGGA TGAAGCTATC GCCCGGCGCG GAACTAAAAC TAAAGATTTC TCTGAGGCAG ACACCAGCGA TATCAATGAC AGTCTGATCC TCGATACACC TGAATCTCCA CTGGCAGATG CGGTGGCAAA CGTCTTAACC ATGACGTTGC TGAAAAAAGC GCAGTTGAAG TCCACTGTGA TGCCACAGGT TCAGCAATAT GCGTGGGATA AAGCGGCAAA CAGCAATCAG CCGCTGGCGC ACACGGTTGT GCTGCTCAAT AGCGGGGGCG ACGCTACCCA GGCTGCTGCT ATTTTAAGTG GTTTGACCGC TGAGCAATCC ACTATTGAGC GCGCGCTGGC CATGAACTGG CTGGCGAAAT ATATGGCGAC AATGCCTCCG GTTGTGTTGC CTGCGCCTGC GGGCGCATGG GCCAAACATA AGTTAACTGG AGGGGGCGAA TACTGGCGTT GGGTTGGTCA GGGCGTGCCG GACATTCTCT CTTTTGGTGA TGAATTGTCG CCGCAAAATG TGCAGGTCCG CTGGCGTGAA CCGGCAAAAA CGGCTCAACA AAGTAACATT CCGGTGACCG TTGAACGCCA GTTGTATCGG CTTATCCCCG GTGAAGAAGA GATGAGCTTT ACTCTGCAGC CGGTGACCAG CAATGAGATT GACAGCGATG CGCTGTATCT CGATGAAATT ACGCTTACCA GCGAGCAGGA TGCCGTTCTG CGCTACGGTC AGGTGGAAGT ACCGCTCCCG CCAGGGGCTG ACGTTGAGAG GACTACGTGG GGCATTTCGG TCAATAAACC GAACGCTGGA AAACAGCAGG GGCAGTTGCT GGAAAAAGCG CGAAATGAAA TGGGCGAACT GGCCTATATG GTGCCGGTGA AAGAACTGAC GGGAACGGTC ACTTTCCGCC ATTTGCTGCG CTTCTCGCAA AAAGGGCAAT TCGTTCTGCC TCCTGCTCGT TATGTGCGTT CCTATGCACC TGCACAGCAA AGTGTTGCGG CAGGGAGCGA ATGGACCGGG ATGCAGGTGA AATAA
|
Protein sequence | MGTGLANADD SLPSSNYAPP GGGTFFLLAD SSFSSSEEAK VRLEAPGRDY RRYQMEEYGG VDVRLYRIPD PMAFLRQQKN LHRIVVQPKY LGDGLNNTLT WLWDNWYGKS RRVMQRTFSS QSRQNVTQAL PELQLGNAIV KPSRYVQNSQ FSPLKKYPLV EQFRYPLWQA KPVEPQQGVK LEGASSNFIS PQPGNIYIPL GKQEPGLYLV EAMVGGYRAT TVVFVSDTVA LSKVSGNELL VWTAGKKQGE AKPGSEILWT DGLGVMTRGV TDDSGTLQLQ HISPERSYIL GKDAEGGVFV SENFFYESEI YNTRLYIFTD RPLYRAGDRV DVKVMGREFH DPLHSSPIVS APAKLSVLDA NGSLLQTINV TLDARNGGQG SFRLPENAVA GGYELRLAYR NQVYSSSFRV ANYIKPHFEI GLALDKKEFK TGEAVSGKLQ LLYPDGEPVK DARVQLSLRA QQLSMVGNDL RYAGRFPVSL EGSETVSDDS GHVALNLPAA DKPSRYLLTV SASDGAAYRV TTTKEILIER GLAHYSLSTA AQYSNSGESV VFRYAALESS KQVPVTYEWL RLEDRTSHSG ELPSGGQSFT VNFAKPGNYN LTLRDKDGLI LAGLSHAVSG KGSTAHTGTV DIVADKTLYQ PGETAKMLIT FPEPIDEALL TLERDRVEQQ SLLSHPANWL TLQRLNDTQY EARVPVSNSF APNITFSVLY TRNGQYSFQN AGIKVAVPQL DIRVKTDKTH YQPGELVNVE LTSSLKGKPV SAQLTVGVVD EMIYALQPEI APNIGKFFYP LGRNNVRTSS SLSFISYDQA LSSEPVAPGA TNRSERRVKM LERPRREEVD TAAWMPSLTT DKQGKAYFTF LMPDSLTRWR ITARGMNGDG LVGQGRAYLR SEKNLYMKWS MPTVYRVGDK PAAGLFIFSQ QDNEPVALVT KFAGAEMRQT LTLHKGANYI SLTQNIQQSG LLSAELQQNG QVQDSISTKL SFVDNSWPVE QQKNVMLGGG DNALMLPEQA SNIRLQSSET PQEIFRNNLD ALVDEPWGGV INTGSRLIPI SLAWRSLADH QSAAANDIRQ MIQDNRLRLM QLAGPGARFT WWGEDGNGDA FLTAWAWYAD WQASQALGVT QQPEYWQHML DSYAEQADNM PLLHRALVLA WAQEMNLPCK TLLKGLDEAI ARRGTKTKDF SEADTSDIND SLILDTPESP LADAVANVLT MTLLKKAQLK STVMPQVQQY AWDKAANSNQ PLAHTVVLLN SGGDATQAAA ILSGLTAEQS TIERALAMNW LAKYMATMPP VVLPAPAGAW AKHKLTGGGE YWRWVGQGVP DILSFGDELS PQNVQVRWRE PAKTAQQSNI PVTVERQLYR LIPGEEEMSF TLQPVTSNEI DSDALYLDEI TLTSEQDAVL RYGQVEVPLP PGADVERTTW GISVNKPNAG KQQGQLLEKA RNEMGELAYM VPVKELTGTV TFRHLLRFSQ KGQFVLPPAR YVRSYAPAQQ SVAAGSEWTG MQVK
|
| |