Gene EcSMS35_2376 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2376 
Symbol 
ID6144217 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2413395 
End bp2417909 
Gene Length4515 bp 
Protein Length1504 aa 
Translation table11 
GC content53% 
IMG OID641617249 
Productalpha-2-macroglobulin family protein 
Protein accessionYP_001744421 
Protein GI170680850 
COG category[R] General function prediction only 
COG ID[COG2373] Large extracellular alpha-helical protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGAACAG GGCTTGCTAA TGCTGATGAT TCGCTTCCTT CCAGTAACTA TGCGCCGCCC 
GGCGGGGGAA CATTCTTTTT GCTCGCTGAC AGCAGTTTTA GCAGCAGTGA AGAGGCGAAA
GTGCGACTGG AAGCGCCGGG GCGTGATTAT CGACGCTATC AGATGGAAGA GTACGGTGGC
GTGGACGTTC GCCTGTATCG TATTCCTGAC CCGATGGCAT TTTTGCGCCA GCAGAAAAAC
CTGCATCGCA TTGTGGTGCA ACCGAAATAT CTGGGCGACG GGCTAAACAA TACGCTGACC
TGGCTGTGGG ATAACTGGTA TGGCAAATCT CGCCGTGTAA TGCAGCGTAC TTTCTCTTCT
CAGTCACGGC AGAATGTGAC TCAGGCGTTA CCCGAATTAC AGCTCGGTAA TGCCATTGTT
AAACCTTCCC GTTATGTACA GAACAGCCAG TTTTCACCGC TGAAAAAATA TCCACTGGTG
GAACAGTTCC GTTACCCATT ATGGCAGGCT AAACCAGTCG AGCCGCAGCA AGGGGTAAAA
CTGGAAGGCG CATCCAGCAA TTTCATCTCG CCGCAGCCGG GTAACATCTA TATTCCTCTC
GGCAAACAAG AGCCGGGACT GTATCTCGTC GAAGCGATGG TCGGTGGGTA TCGGGCAACG
ACGGTGGTAT TTGTCTCCGA TACAGTGGCG CTTAGCAAAG TGTCAGGCAA CGAACTTCTG
GTGTGGACCG CGGGCAAAAA ACAGGGTGAA GCGAAGCCAG GCTCAGAGAT CCTGTGGACT
GACGGTCTTG GCGTGATGAC TCGCGGTGTG ACCGATGACA GCGGTACCTT GCAGTTACAA
CATATATCGC CAGAACGTTC ATACATTCTG GGTAAGGATG CTGAAGGCGG CGTTTTTGTC
TCCGAGAACT TTTTCTACGA AAGTGAAATC TACAACACCC GCTTGTATAT CTTTACCGAT
CGCCCGCTAT ATCGCGCAGG CGATCGTGTC GATGTTAAAG TGATGGGGCG CGAGTTCCAC
GATCCGTTGC ATTCATCCCC CATCGTCAGC GCCCCGGCGA AGCTTTCGGT GCTGGACGCT
AACGGCAGTC TGTTGCAAAC CATCAATGTC ACGCTGGATG CGCGCAATGG CGGGCAGGGG
AGTTTCCGCC TGCCAGAAAA TGCCGTTGCC GGAGGTTATG AGTTGCGTCT TGCTTACCGC
AATCAGGTCT ATAGCAGTAG TTTTCGCGTG GCAAACTACA TCAAGCCACA TTTCGAGATT
GGCTTAGCTC TCGACAAAAA AGAGTTCAAA ACTGGCGAAG CGGTCAGCGG CAAACTGCAA
CTGCTCTACC CGGATGGTGA GCCGGTAAAA GATGCCCGTG TGCAGTTAAG TTTGCGCGCT
CAGCAATTAT CAATGGTCGG TAACGATTTG CGTTATGCCG GACGTTTTCC CGTGTCGCTG
GAAGGCAGTG AAACGGTATC CGACGATAGC GGTCATGTGG CGTTAAATCT CCCCGCCGCT
GATAAACCAA GCCGCTATTT GTTAACCGTC TCCGCCAGTG ACGGTGCGGC GTATCGTGTC
ACCACCACCA AAGAGATCCT CATTGAACGT GGACTGGCAC ATTACTCATT AAGTACTGCC
GCACAATACA GTAATAGCGG TGAATCGGTT GTGTTCCGTT ATGCCGCGCT GGAATCTTCA
AAACAGGTTC CTGTTACGTA TGAATGGTTG CGTCTCGAAG ACCGCACGAG CCATAGCGGA
GAGCTACCGT CAGGCGGCCA ATCCTTTACC GTCAATTTCG CTAAACCTGG CAACTACAAT
CTGACATTAC GCGATAAAGA CGGCTTAATT CTCGCTGGGT TAAGTCATGC CGTCAGCGGT
AAGGGCAGCA CGGCGCATAC TGGTACGGTA GATATCGTGG CGGATAAAAC GCTGTATCAG
CCAGGCGAAA CTGCGAAGAT GCTGATTACC TTTCCGGAGC CAATTGATGA AGCATTATTG
ACGCTGGAAC GCGATCGCGT GGAACAGCAG TCGCTGCTTT CGCATCCGGC AAACTGGCTG
ACGTTACAAC GTTTAAACGA TACCCAGTAT GAAGCCCGGG TTCCAGTGAG CAATTCCTTT
GCGCCTAACA TCACTTTTTC GGTGCTGTAT ACCCGTAACG GTCAGTACAG TTTTCAGAAC
GCCGGGATCA AAGTTGCCGT TCCCCAGCTG GATATCCGGG TGAAAACGGA CAAAACCCAT
TACCAGCCTG GTGAACTGGT CAATGTCGAA TTAACCTCGT CGCTGAAAGG TAAACCTGTT
TCTGCGCAGT TAACGGTAGG CGTGGTCGAT GAAATGATCT ACGCGCTGCA ACCGGAAATC
GCGCCGAATA TCGGCAAATT TTTCTATCCG CTGGGGCGTA ACAATGTGCG TACCAGCTCC
AGTTTGTCGT TTATCAGCTA CGACCAGGCG CTCTCCAGCG AGCCGGTTGC GCCTGGCGCA
ACTAACCGCA GCGAGCGGCG AGTAAAAATG CTTGAACGTC CACGGCGTGA AGAGGTGGAT
ACCGCGGCAT GGATGCCGTC ACTCACAACC GATAAACAAG GCAAAGCGTA TTTCACGTTC
CTGATGCCTG ATTCGTTAAC CCGCTGGCGT ATCACCGCGC GTGGGATGAA CGGCGACGGG
CTGGTCGGGC AGGGGCGTGC TTATCTGCGT TCGGAAAAAA ATCTCTACAT GAAGTGGAGT
ATGCCAACGG TGTATCGCGT GGGCGACAAA CCGGCGGCAG GACTGTTTAT CTTCAGTCAG
CAGGATAACG AACCGGTGGC GCTGGTGACT AAATTTGCAG GCGCTGAGAT GCGCCAGACG
CTGACGCTGC ACAAAGGGGC GAATTATATT TCGCTGACGC AGAATATTCA GCAATCTGGC
TTGTTAAGCG CAGAACTGCA ACAAAATGGG CAAGTGCAGG ACAGCATTAG CACAAAACTG
TCTTTTGTGG ATAACAGCTG GCCCGTTGAA CAGCAGAAAA ATGTCATGCT CGGCGGTGGC
GATAATGCGC TGATGTTGCC CGAGCAGGCG AGCAATATCC GGCTACAAAG TAGTGAAACG
CCGCAGGAGA TTTTCCGTAA CAATCTTGAT GCGTTAGTCG ATGAACCGTG GGGTGGCGTG
ATCAACACCG GTAGCCGTCT GATCCCGATC AGTCTCGCCT GGCGTTCGCT TGCCGATCAT
CAAAGTGCCG CCGCTAACGA CATTCGTCAG ATGATTCAGG ATAACCGTCT GCGGCTGATG
CAACTGGCAG GGCCCGGTGC GCGCTTTACC TGGTGGGGTG AAGATGGCAA TGGTGACGCC
TTCCTCACGG CATGGGCATG GTACGCCGAC TGGCAGGCCA GCCAGGCGCT CGGCGTAACG
CAACAACCGG AATACTGGCA GCATATGCTC GACAGTTATG CCGAGCAGGC AGATAACATG
CCGTTATTGC ATCGGGCGCT GGTGCTGGCG TGGGCGCAGG AGATGAATCT GCCGTGCAAA
ACGTTGTTGA AAGGGTTGGA TGAAGCTATC GCCCGGCGCG GAACTAAAAC TAAAGATTTC
TCTGAGGCAG ACACCAGCGA TATCAATGAC AGTCTGATCC TCGATACACC TGAATCTCCA
CTGGCAGATG CGGTGGCAAA CGTCTTAACC ATGACGTTGC TGAAAAAAGC GCAGTTGAAG
TCCACTGTGA TGCCACAGGT TCAGCAATAT GCGTGGGATA AAGCGGCAAA CAGCAATCAG
CCGCTGGCGC ACACGGTTGT GCTGCTCAAT AGCGGGGGCG ACGCTACCCA GGCTGCTGCT
ATTTTAAGTG GTTTGACCGC TGAGCAATCC ACTATTGAGC GCGCGCTGGC CATGAACTGG
CTGGCGAAAT ATATGGCGAC AATGCCTCCG GTTGTGTTGC CTGCGCCTGC GGGCGCATGG
GCCAAACATA AGTTAACTGG AGGGGGCGAA TACTGGCGTT GGGTTGGTCA GGGCGTGCCG
GACATTCTCT CTTTTGGTGA TGAATTGTCG CCGCAAAATG TGCAGGTCCG CTGGCGTGAA
CCGGCAAAAA CGGCTCAACA AAGTAACATT CCGGTGACCG TTGAACGCCA GTTGTATCGG
CTTATCCCCG GTGAAGAAGA GATGAGCTTT ACTCTGCAGC CGGTGACCAG CAATGAGATT
GACAGCGATG CGCTGTATCT CGATGAAATT ACGCTTACCA GCGAGCAGGA TGCCGTTCTG
CGCTACGGTC AGGTGGAAGT ACCGCTCCCG CCAGGGGCTG ACGTTGAGAG GACTACGTGG
GGCATTTCGG TCAATAAACC GAACGCTGGA AAACAGCAGG GGCAGTTGCT GGAAAAAGCG
CGAAATGAAA TGGGCGAACT GGCCTATATG GTGCCGGTGA AAGAACTGAC GGGAACGGTC
ACTTTCCGCC ATTTGCTGCG CTTCTCGCAA AAAGGGCAAT TCGTTCTGCC TCCTGCTCGT
TATGTGCGTT CCTATGCACC TGCACAGCAA AGTGTTGCGG CAGGGAGCGA ATGGACCGGG
ATGCAGGTGA AATAA
 
Protein sequence
MGTGLANADD SLPSSNYAPP GGGTFFLLAD SSFSSSEEAK VRLEAPGRDY RRYQMEEYGG 
VDVRLYRIPD PMAFLRQQKN LHRIVVQPKY LGDGLNNTLT WLWDNWYGKS RRVMQRTFSS
QSRQNVTQAL PELQLGNAIV KPSRYVQNSQ FSPLKKYPLV EQFRYPLWQA KPVEPQQGVK
LEGASSNFIS PQPGNIYIPL GKQEPGLYLV EAMVGGYRAT TVVFVSDTVA LSKVSGNELL
VWTAGKKQGE AKPGSEILWT DGLGVMTRGV TDDSGTLQLQ HISPERSYIL GKDAEGGVFV
SENFFYESEI YNTRLYIFTD RPLYRAGDRV DVKVMGREFH DPLHSSPIVS APAKLSVLDA
NGSLLQTINV TLDARNGGQG SFRLPENAVA GGYELRLAYR NQVYSSSFRV ANYIKPHFEI
GLALDKKEFK TGEAVSGKLQ LLYPDGEPVK DARVQLSLRA QQLSMVGNDL RYAGRFPVSL
EGSETVSDDS GHVALNLPAA DKPSRYLLTV SASDGAAYRV TTTKEILIER GLAHYSLSTA
AQYSNSGESV VFRYAALESS KQVPVTYEWL RLEDRTSHSG ELPSGGQSFT VNFAKPGNYN
LTLRDKDGLI LAGLSHAVSG KGSTAHTGTV DIVADKTLYQ PGETAKMLIT FPEPIDEALL
TLERDRVEQQ SLLSHPANWL TLQRLNDTQY EARVPVSNSF APNITFSVLY TRNGQYSFQN
AGIKVAVPQL DIRVKTDKTH YQPGELVNVE LTSSLKGKPV SAQLTVGVVD EMIYALQPEI
APNIGKFFYP LGRNNVRTSS SLSFISYDQA LSSEPVAPGA TNRSERRVKM LERPRREEVD
TAAWMPSLTT DKQGKAYFTF LMPDSLTRWR ITARGMNGDG LVGQGRAYLR SEKNLYMKWS
MPTVYRVGDK PAAGLFIFSQ QDNEPVALVT KFAGAEMRQT LTLHKGANYI SLTQNIQQSG
LLSAELQQNG QVQDSISTKL SFVDNSWPVE QQKNVMLGGG DNALMLPEQA SNIRLQSSET
PQEIFRNNLD ALVDEPWGGV INTGSRLIPI SLAWRSLADH QSAAANDIRQ MIQDNRLRLM
QLAGPGARFT WWGEDGNGDA FLTAWAWYAD WQASQALGVT QQPEYWQHML DSYAEQADNM
PLLHRALVLA WAQEMNLPCK TLLKGLDEAI ARRGTKTKDF SEADTSDIND SLILDTPESP
LADAVANVLT MTLLKKAQLK STVMPQVQQY AWDKAANSNQ PLAHTVVLLN SGGDATQAAA
ILSGLTAEQS TIERALAMNW LAKYMATMPP VVLPAPAGAW AKHKLTGGGE YWRWVGQGVP
DILSFGDELS PQNVQVRWRE PAKTAQQSNI PVTVERQLYR LIPGEEEMSF TLQPVTSNEI
DSDALYLDEI TLTSEQDAVL RYGQVEVPLP PGADVERTTW GISVNKPNAG KQQGQLLEKA
RNEMGELAYM VPVKELTGTV TFRHLLRFSQ KGQFVLPPAR YVRSYAPAQQ SVAAGSEWTG
MQVK