Gene EcSMS35_2672 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2672 
Symbol 
ID6147448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2743284 
End bp2748242 
Gene Length4959 bp 
Protein Length1652 aa 
Translation table11 
GC content53% 
IMG OID641617543 
Productalpha-2-macroglobulin domain-containing protein 
Protein accessionYP_001744708 
Protein GI170682175 
COG category[R] General function prediction only 
COG ID[COG2373] Large extracellular alpha-helical protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.757649 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.278199 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGT TACGCTTAGC CGCCTGCATG CTAATGCTGG CGCTGGCAGG GTGCGACAAC 
AACGATAACG CGCCAACAGC GGTGAAAAAA GATGCACCTT CTGAAGTTAC TAAAGCGGCA
TCTTCAGAAA ACGCGAGTTC AGCAAAACTC TCTGCACCGG AGCGACAGAA ACTGGCCCAA
CAGAGTGCCG GTAAGGCGCT GACATTGCTG GATCTCTCTG AAGTCCAACT TGATGGCGCA
GCCACGCTGG TGCTGACCTT CTCTATCCCT CTCGACCCGG ATCAGGATTT CTCGCGCGTT
ATTCACGTCG TTGATAAAAA AAGCGGCAAA GTGGATGGGG CCTGGGAGCT GTCAGATAAT
CTTAAAGAGC TGCGTTTACG CCACCTCGAG CCGAAACGTG ATTTGATCGT TACTATTGGC
AAGGAGGTCA AGGCACTCAA CAACGCAACC TTCAGTAAAG ATTACGAAAA AACTATAACT
ACCCGCGATA TCCAGCCCAG CGTCGGTTTT GCCAGCCGTG GTTCGCTGCT GCCTGGAAAA
GTCATTGAAG GGCTGCCGGT AATGGCGCTC AATGTTAACA ATGTCGATGT TAACTTCTTC
CGTGTTAAGC CAGAATCTCT GCCAGCATTT ATTAGCCAAT GGGAATACCG CAATTCGCTG
GCGAACTGGC AGTCAGATAA ACTGCTGCAG ATGGCGGATC TGGTCTACAC CGGACGGTTT
GATCTCAATC CTGCGCGTAA CACCCGTGAA AAATTATTGC TGCCGCTGGG CGATATCAAA
CCGCTTCAGC AGGCGGGCGT GTATCTGGCT GTGATGAATC AGGCTGGACG TTACGATTAC
AGTAATCCCG CGACGCTGTT TACGTTAAGT GATATTGGCG TTTCAGCTCA CCGTTATCAC
AATCGTCTGG ATATCTTTAC CCAAAGTCTG GAAAACGGCG CGGCCCAGCA AGGAATTGAA
GTCTCTTTAT TAAATGAGAA AGGGCAGACT CTGACTCAGG CAACCAGTGA CGCTCAGGGG
CATGTGCAGC TGGAAAATGA TAAAAACGCG GCATTACTGC TGGCGCGTAA AAACGGTCAG
ACAACGCTGC TCGATTTAAA ACTCCCGGCG CTGGACTTAG CGGAATTTAA CATTGCTGGC
GCGCCGGGCT ATAGCAAACA GTTTTTCATG TTTGGCCCGC GCGATCTTTA TCGCCCGGGT
GAAACGGTCA TCCTCAATGG TTTGCTGCGT GATGCAGACG GTAAAGCATT GCCCGATCAA
CCCATCAAGT TAGACGTGAT TAAACCCGAC GGGCAGGTAC TCAGGAGCGT CGTTAGTCAG
CCGGAGAATG GCCTCTACCA CTTTACCTGG CCACTCGATA GCAATGCGGC AACCGGTATG
TGGCATATTC GCGCTAACAC GGGCGATAAC CAGTATCGGA TGTGGGATTT CCACGTCGAA
GATTTTATGC CAGAGCGTAT GGCGCTGAAT CTGACCGGTG AGAAAACCCC GCTAACGCCG
AAAGATGAAG TGAAATTCTC CGTGGTGGGG TACTACCTGT ATGGTGCACC TGCTAATGGT
AATACTTTGC AAGGGCAACT TTTCCTGCGC CCACTGCGTG AAGCTGTGTC AGCCTTACCT
GGTTTTGAAT TCGGCGATAT AGCTGCCGAA AATCTTTCCC GCACGCTGGA TGAAGTTCAG
TTGACGCTGG ATGATAAAGG GCGTGGCGAA GTTTCCACTG AAAGCCAGTG GAAGGAAACG
CATTCCCCAT TACAGGTTAT TTTCCAGGGG AGTTTGTTGG AATCGGGCGG TCGCCCGGTG
ACGCGCCGCG CTGAGCAGGC TATCTGGCCT GCCGATGCAT TGCCGGGGAT CCGTCCGCAG
TTCGCCTCGA AATCGGTTTA CGATTATCGC ACTGACAGCA CGGTGAAACA GCCCATTGTT
GATGAAGGCA GTAACGCCGC TTTTGACATC GTTTATAGCG ATGCGCAAGG CGTAAAAAAA
GCCGTGTCGG GCTTGCAGGT GCGCCTGATT CGCGAACGCC GCGATTACTA CTGGAACTGG
TCAGAAGATG AAGGCTGGCA GTCACAGTTT GATCAAAAAG ATCTGATTGA AAATGAACAA
ACTCTGGATC TGAAAGCGGA CGAAACCGGC AAGGTCAGTT TCCCGGTAGA GTGGGGCGCT
TATCGTCTGG AAGTCAAAGC GCCGAATGAA GCGGTCAGTA GTGTGCGTTT CTGGGCTGGC
TATAGCTGGC AGGACAACAG CGACGGGAGC GGTGCCGTGC GACCCGACCG TGTCACGCTG
AAACTGGATA AAGCCAGTTA TCGCCCTGGC GATACCATTA AGTTGCATAT TGCCGCGCCA
ACGGCGGGTA AAGGTTATGC GATGGTCGAG TCCAGTGAAG GGCCGCTGTG GTGGCAAGAG
ATTGATGTTC CGGCTCAAGG GCTGGATCTG ACGATTCCGG TAAATAAAAC CTGGAATCGT
CATGATCTTT ATTTGAGTAC GCTGGTGGTG CGTCCTGGCG ATAAATCTCG CTCCGCGACG
CCAAAACGCG CGGTTGGGGT GTTGCATCTG CCGCTTGGTG ATGAAAACCG TCGCCTCGAT
CTGGCGCTGG AAACCCCCAC CAAAATGCGC CCCAATCAGC CATTAACCGT GAAAATTAAA
GCCAGCACTA AAAATGGTGA GATGCCAAAA CAGGTGAATG TGCTGGTGTC TGCCGTTGAT
AGCGGTGTGC TGAATATTAC TGATTACGTC ACGCCAGATC CGTGGCAGGC GTTCTTTGGT
CAGAAACGCT ATGGCGCAGA CATTTACGAT ATTTACGGTC AGGTTATTGA AGGTCAGGGG
CGTCTGGCAG CTCTGCGTTT CGGTGGCGAT GGTGATGAGC TGAAACGTGG TGGTAAACCG
CCGGTCAATC ACGTCAATAT TGTCGCGCAG CAGGCGCTGC CGGTAACGCT CAACGAACAG
GGCGAAGGCT CGGTTACACT GCCAATTGGC GATTTTAATG GTGAGTTACG AGTCATGGCG
CAGGCCTGGA CAGCAGATGA TTTCGGTAGC AACGAAAGCA AAGTGGTTGT TGCCGCACCG
GTGATAGCAG AACTGAACAT GCCGCGCTTT ATGGCCAGCG GTGACACCTC ACGTCTGACG
CTGGATATCA CTAATCTTAC CGATAAACCG CAAAAACTGA ACGTTGCCCT GACCGCCAGT
GGTTTGCTTG AACTGGTCAG CGATTCACCC GCAGCCGTTG AATTAGCGCC AGGTGTGCGT
ACTACGCTGT TTATCCCGGT GCGAGCATTG CCGGGTTATG GTGATGGTGA AATTCAGGCC
ACCATTAGCG GGTTAGCGTT ACCCGGGGAA ACCGTTGCCG ATCAGCATAA ACAGTGGAAA
ATCGGCGTCC GTCCGGCGTT CCCGGCACAA ACGGTCAATT ACGGTACGGC GTTGCAGCCT
GGTGAGACAT GGGCGATTCC GGCGGATGGA TTGCAAAACT TCTCGCCTGT TACGCTGGAA
GGGCAATTGT TGTTGAGCGG CAAACCACCG CTGAACATCG CAAGTTATAT CAAAGAGTTA
AAAGCGTATC CGTACGGCTG TCTTGAGCAA ACCGCCAGCG GCCTGTTCCC GTCACTTTAT
ACCAACGCAG CCCAACTGCA GGCGTTGGGC ATCAAAGGCG ACAGTGATGA GAAACGCCGT
GCATCGGTCG ATATCGGCAT TTCCCGTTTG CTGCAAATGC AACGTGATAA CGGCGGCTTT
GCGCTGTGGG ATAAAAACGG TGACGAAGAG TACTGGCTGA CGGCTTACGT GATGGATTTC
CTGGTCCGCG CAGGTGAGCA GGGTTACAGC GTGCCGACAG ACGCCATTAA CCGGGGAAAT
GAGCGTCTGC TGCGCTATTT ACAAGATCCG GGCATGATGT CGATCCCGTA CGCGGATAAT
CTCAAAGCCA GTAAATTCGC CGTACAGTCT TACGCTGCGC TGGTGCTGGC CCGTCAGCAA
AAAGCTCCAC TGGGTGCGCT GCGTGAAATC TGGGAGCATC GTGCAGATGC CGCTTCTGGT
TTACCGCTGC TGCAACTTGG CGTTGCGCTG AAAACCATGG GTGATGCGAC GCGTGGTGAA
GAAGCAATTG CGCTGGCGCT GAAAACGCCG CGTAATGATG AGCGGAAATG GCTGGGTGAT
TACGGTAGCC CACTGCGCGA CAACGCGTTA ATGCTCTCCT TGCTGGAAGA AAATAAACTG
CTACCCGATG AGCAGAACAC TTTGCTGAAT ACGCTTTCGC AGCAGGCGTT TGGTGAACGC
TGGCTATCGA CGCAGGAAAG TAACGCGTTG TTCCTGGCTG CCCGTACGGT TCAGGATTTA
CCGGGTAAAT GGCAGGCGCA AACCTCTTTC TCAGCTGAGC CGCTGACAGG CGAGAAAGCG
CAAACCAGCA ATCTGAATAG CGATCAACTT GCCACCTTGC AGGTGACCAA CAGTGGCGAT
CAGCCGTTAT GGCTGCGTGT GGATGCCAGC GGTTATCCGC AATCCGCACC TTTACCGGCG
AACAATGTGC TGCAAATCGA GCGTCATATT CTTGGTACTG ATGGTAAGAG CAAATCGCTG
GACTCGTTAC GTAGCGGCGA TCTGGTGCTG GTGTGGTTGC AGGTTAAAGC CAGTAACAGC
GTGCCGGATG CGTTGGTCGT GGATCTGCTG CCAGCGGGTC TGGAACTGGA AAACCAGAAT
CTGGCGAACG GTAGCGCCAG CCTGGAGCAA AGTGGTGGCG AAGTGCAGAA CTTACTGAAC
CAGATGCAGC AGGCGAGTAT TAAGCACATT GAGTTCCGTG ACGATCGCTT TGTGGCGGCG
GTTGCCGTTG ATGAATACCA ACCGGTAACG CTGGTGTATC TGGCGCGGGC GGTGACGCCG
GGAACGTATC AGGTACCGCA ACCGATGGTG GAATCAATGT ATGTTCCCCA ATGGCGGGCG
ACCGGCGCGG CTGAAGATTT GCTAATTGTC AGACCGTAA
 
Protein sequence
MKKLRLAACM LMLALAGCDN NDNAPTAVKK DAPSEVTKAA SSENASSAKL SAPERQKLAQ 
QSAGKALTLL DLSEVQLDGA ATLVLTFSIP LDPDQDFSRV IHVVDKKSGK VDGAWELSDN
LKELRLRHLE PKRDLIVTIG KEVKALNNAT FSKDYEKTIT TRDIQPSVGF ASRGSLLPGK
VIEGLPVMAL NVNNVDVNFF RVKPESLPAF ISQWEYRNSL ANWQSDKLLQ MADLVYTGRF
DLNPARNTRE KLLLPLGDIK PLQQAGVYLA VMNQAGRYDY SNPATLFTLS DIGVSAHRYH
NRLDIFTQSL ENGAAQQGIE VSLLNEKGQT LTQATSDAQG HVQLENDKNA ALLLARKNGQ
TTLLDLKLPA LDLAEFNIAG APGYSKQFFM FGPRDLYRPG ETVILNGLLR DADGKALPDQ
PIKLDVIKPD GQVLRSVVSQ PENGLYHFTW PLDSNAATGM WHIRANTGDN QYRMWDFHVE
DFMPERMALN LTGEKTPLTP KDEVKFSVVG YYLYGAPANG NTLQGQLFLR PLREAVSALP
GFEFGDIAAE NLSRTLDEVQ LTLDDKGRGE VSTESQWKET HSPLQVIFQG SLLESGGRPV
TRRAEQAIWP ADALPGIRPQ FASKSVYDYR TDSTVKQPIV DEGSNAAFDI VYSDAQGVKK
AVSGLQVRLI RERRDYYWNW SEDEGWQSQF DQKDLIENEQ TLDLKADETG KVSFPVEWGA
YRLEVKAPNE AVSSVRFWAG YSWQDNSDGS GAVRPDRVTL KLDKASYRPG DTIKLHIAAP
TAGKGYAMVE SSEGPLWWQE IDVPAQGLDL TIPVNKTWNR HDLYLSTLVV RPGDKSRSAT
PKRAVGVLHL PLGDENRRLD LALETPTKMR PNQPLTVKIK ASTKNGEMPK QVNVLVSAVD
SGVLNITDYV TPDPWQAFFG QKRYGADIYD IYGQVIEGQG RLAALRFGGD GDELKRGGKP
PVNHVNIVAQ QALPVTLNEQ GEGSVTLPIG DFNGELRVMA QAWTADDFGS NESKVVVAAP
VIAELNMPRF MASGDTSRLT LDITNLTDKP QKLNVALTAS GLLELVSDSP AAVELAPGVR
TTLFIPVRAL PGYGDGEIQA TISGLALPGE TVADQHKQWK IGVRPAFPAQ TVNYGTALQP
GETWAIPADG LQNFSPVTLE GQLLLSGKPP LNIASYIKEL KAYPYGCLEQ TASGLFPSLY
TNAAQLQALG IKGDSDEKRR ASVDIGISRL LQMQRDNGGF ALWDKNGDEE YWLTAYVMDF
LVRAGEQGYS VPTDAINRGN ERLLRYLQDP GMMSIPYADN LKASKFAVQS YAALVLARQQ
KAPLGALREI WEHRADAASG LPLLQLGVAL KTMGDATRGE EAIALALKTP RNDERKWLGD
YGSPLRDNAL MLSLLEENKL LPDEQNTLLN TLSQQAFGER WLSTQESNAL FLAARTVQDL
PGKWQAQTSF SAEPLTGEKA QTSNLNSDQL ATLQVTNSGD QPLWLRVDAS GYPQSAPLPA
NNVLQIERHI LGTDGKSKSL DSLRSGDLVL VWLQVKASNS VPDALVVDLL PAGLELENQN
LANGSASLEQ SGGEVQNLLN QMQQASIKHI EFRDDRFVAA VAVDEYQPVT LVYLARAVTP
GTYQVPQPMV ESMYVPQWRA TGAAEDLLIV RP