Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2672 |
Symbol | |
ID | 6147448 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2743284 |
End bp | 2748242 |
Gene Length | 4959 bp |
Protein Length | 1652 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641617543 |
Product | alpha-2-macroglobulin domain-containing protein |
Protein accession | YP_001744708 |
Protein GI | 170682175 |
COG category | [R] General function prediction only |
COG ID | [COG2373] Large extracellular alpha-helical protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.757649 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.278199 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAGT TACGCTTAGC CGCCTGCATG CTAATGCTGG CGCTGGCAGG GTGCGACAAC AACGATAACG CGCCAACAGC GGTGAAAAAA GATGCACCTT CTGAAGTTAC TAAAGCGGCA TCTTCAGAAA ACGCGAGTTC AGCAAAACTC TCTGCACCGG AGCGACAGAA ACTGGCCCAA CAGAGTGCCG GTAAGGCGCT GACATTGCTG GATCTCTCTG AAGTCCAACT TGATGGCGCA GCCACGCTGG TGCTGACCTT CTCTATCCCT CTCGACCCGG ATCAGGATTT CTCGCGCGTT ATTCACGTCG TTGATAAAAA AAGCGGCAAA GTGGATGGGG CCTGGGAGCT GTCAGATAAT CTTAAAGAGC TGCGTTTACG CCACCTCGAG CCGAAACGTG ATTTGATCGT TACTATTGGC AAGGAGGTCA AGGCACTCAA CAACGCAACC TTCAGTAAAG ATTACGAAAA AACTATAACT ACCCGCGATA TCCAGCCCAG CGTCGGTTTT GCCAGCCGTG GTTCGCTGCT GCCTGGAAAA GTCATTGAAG GGCTGCCGGT AATGGCGCTC AATGTTAACA ATGTCGATGT TAACTTCTTC CGTGTTAAGC CAGAATCTCT GCCAGCATTT ATTAGCCAAT GGGAATACCG CAATTCGCTG GCGAACTGGC AGTCAGATAA ACTGCTGCAG ATGGCGGATC TGGTCTACAC CGGACGGTTT GATCTCAATC CTGCGCGTAA CACCCGTGAA AAATTATTGC TGCCGCTGGG CGATATCAAA CCGCTTCAGC AGGCGGGCGT GTATCTGGCT GTGATGAATC AGGCTGGACG TTACGATTAC AGTAATCCCG CGACGCTGTT TACGTTAAGT GATATTGGCG TTTCAGCTCA CCGTTATCAC AATCGTCTGG ATATCTTTAC CCAAAGTCTG GAAAACGGCG CGGCCCAGCA AGGAATTGAA GTCTCTTTAT TAAATGAGAA AGGGCAGACT CTGACTCAGG CAACCAGTGA CGCTCAGGGG CATGTGCAGC TGGAAAATGA TAAAAACGCG GCATTACTGC TGGCGCGTAA AAACGGTCAG ACAACGCTGC TCGATTTAAA ACTCCCGGCG CTGGACTTAG CGGAATTTAA CATTGCTGGC GCGCCGGGCT ATAGCAAACA GTTTTTCATG TTTGGCCCGC GCGATCTTTA TCGCCCGGGT GAAACGGTCA TCCTCAATGG TTTGCTGCGT GATGCAGACG GTAAAGCATT GCCCGATCAA CCCATCAAGT TAGACGTGAT TAAACCCGAC GGGCAGGTAC TCAGGAGCGT CGTTAGTCAG CCGGAGAATG GCCTCTACCA CTTTACCTGG CCACTCGATA GCAATGCGGC AACCGGTATG TGGCATATTC GCGCTAACAC GGGCGATAAC CAGTATCGGA TGTGGGATTT CCACGTCGAA GATTTTATGC CAGAGCGTAT GGCGCTGAAT CTGACCGGTG AGAAAACCCC GCTAACGCCG AAAGATGAAG TGAAATTCTC CGTGGTGGGG TACTACCTGT ATGGTGCACC TGCTAATGGT AATACTTTGC AAGGGCAACT TTTCCTGCGC CCACTGCGTG AAGCTGTGTC AGCCTTACCT GGTTTTGAAT TCGGCGATAT AGCTGCCGAA AATCTTTCCC GCACGCTGGA TGAAGTTCAG TTGACGCTGG ATGATAAAGG GCGTGGCGAA GTTTCCACTG AAAGCCAGTG GAAGGAAACG CATTCCCCAT TACAGGTTAT TTTCCAGGGG AGTTTGTTGG AATCGGGCGG TCGCCCGGTG ACGCGCCGCG CTGAGCAGGC TATCTGGCCT GCCGATGCAT TGCCGGGGAT CCGTCCGCAG TTCGCCTCGA AATCGGTTTA CGATTATCGC ACTGACAGCA CGGTGAAACA GCCCATTGTT GATGAAGGCA GTAACGCCGC TTTTGACATC GTTTATAGCG ATGCGCAAGG CGTAAAAAAA GCCGTGTCGG GCTTGCAGGT GCGCCTGATT CGCGAACGCC GCGATTACTA CTGGAACTGG TCAGAAGATG AAGGCTGGCA GTCACAGTTT GATCAAAAAG ATCTGATTGA AAATGAACAA ACTCTGGATC TGAAAGCGGA CGAAACCGGC AAGGTCAGTT TCCCGGTAGA GTGGGGCGCT TATCGTCTGG AAGTCAAAGC GCCGAATGAA GCGGTCAGTA GTGTGCGTTT CTGGGCTGGC TATAGCTGGC AGGACAACAG CGACGGGAGC GGTGCCGTGC GACCCGACCG TGTCACGCTG AAACTGGATA AAGCCAGTTA TCGCCCTGGC GATACCATTA AGTTGCATAT TGCCGCGCCA ACGGCGGGTA AAGGTTATGC GATGGTCGAG TCCAGTGAAG GGCCGCTGTG GTGGCAAGAG ATTGATGTTC CGGCTCAAGG GCTGGATCTG ACGATTCCGG TAAATAAAAC CTGGAATCGT CATGATCTTT ATTTGAGTAC GCTGGTGGTG CGTCCTGGCG ATAAATCTCG CTCCGCGACG CCAAAACGCG CGGTTGGGGT GTTGCATCTG CCGCTTGGTG ATGAAAACCG TCGCCTCGAT CTGGCGCTGG AAACCCCCAC CAAAATGCGC CCCAATCAGC CATTAACCGT GAAAATTAAA GCCAGCACTA AAAATGGTGA GATGCCAAAA CAGGTGAATG TGCTGGTGTC TGCCGTTGAT AGCGGTGTGC TGAATATTAC TGATTACGTC ACGCCAGATC CGTGGCAGGC GTTCTTTGGT CAGAAACGCT ATGGCGCAGA CATTTACGAT ATTTACGGTC AGGTTATTGA AGGTCAGGGG CGTCTGGCAG CTCTGCGTTT CGGTGGCGAT GGTGATGAGC TGAAACGTGG TGGTAAACCG CCGGTCAATC ACGTCAATAT TGTCGCGCAG CAGGCGCTGC CGGTAACGCT CAACGAACAG GGCGAAGGCT CGGTTACACT GCCAATTGGC GATTTTAATG GTGAGTTACG AGTCATGGCG CAGGCCTGGA CAGCAGATGA TTTCGGTAGC AACGAAAGCA AAGTGGTTGT TGCCGCACCG GTGATAGCAG AACTGAACAT GCCGCGCTTT ATGGCCAGCG GTGACACCTC ACGTCTGACG CTGGATATCA CTAATCTTAC CGATAAACCG CAAAAACTGA ACGTTGCCCT GACCGCCAGT GGTTTGCTTG AACTGGTCAG CGATTCACCC GCAGCCGTTG AATTAGCGCC AGGTGTGCGT ACTACGCTGT TTATCCCGGT GCGAGCATTG CCGGGTTATG GTGATGGTGA AATTCAGGCC ACCATTAGCG GGTTAGCGTT ACCCGGGGAA ACCGTTGCCG ATCAGCATAA ACAGTGGAAA ATCGGCGTCC GTCCGGCGTT CCCGGCACAA ACGGTCAATT ACGGTACGGC GTTGCAGCCT GGTGAGACAT GGGCGATTCC GGCGGATGGA TTGCAAAACT TCTCGCCTGT TACGCTGGAA GGGCAATTGT TGTTGAGCGG CAAACCACCG CTGAACATCG CAAGTTATAT CAAAGAGTTA AAAGCGTATC CGTACGGCTG TCTTGAGCAA ACCGCCAGCG GCCTGTTCCC GTCACTTTAT ACCAACGCAG CCCAACTGCA GGCGTTGGGC ATCAAAGGCG ACAGTGATGA GAAACGCCGT GCATCGGTCG ATATCGGCAT TTCCCGTTTG CTGCAAATGC AACGTGATAA CGGCGGCTTT GCGCTGTGGG ATAAAAACGG TGACGAAGAG TACTGGCTGA CGGCTTACGT GATGGATTTC CTGGTCCGCG CAGGTGAGCA GGGTTACAGC GTGCCGACAG ACGCCATTAA CCGGGGAAAT GAGCGTCTGC TGCGCTATTT ACAAGATCCG GGCATGATGT CGATCCCGTA CGCGGATAAT CTCAAAGCCA GTAAATTCGC CGTACAGTCT TACGCTGCGC TGGTGCTGGC CCGTCAGCAA AAAGCTCCAC TGGGTGCGCT GCGTGAAATC TGGGAGCATC GTGCAGATGC CGCTTCTGGT TTACCGCTGC TGCAACTTGG CGTTGCGCTG AAAACCATGG GTGATGCGAC GCGTGGTGAA GAAGCAATTG CGCTGGCGCT GAAAACGCCG CGTAATGATG AGCGGAAATG GCTGGGTGAT TACGGTAGCC CACTGCGCGA CAACGCGTTA ATGCTCTCCT TGCTGGAAGA AAATAAACTG CTACCCGATG AGCAGAACAC TTTGCTGAAT ACGCTTTCGC AGCAGGCGTT TGGTGAACGC TGGCTATCGA CGCAGGAAAG TAACGCGTTG TTCCTGGCTG CCCGTACGGT TCAGGATTTA CCGGGTAAAT GGCAGGCGCA AACCTCTTTC TCAGCTGAGC CGCTGACAGG CGAGAAAGCG CAAACCAGCA ATCTGAATAG CGATCAACTT GCCACCTTGC AGGTGACCAA CAGTGGCGAT CAGCCGTTAT GGCTGCGTGT GGATGCCAGC GGTTATCCGC AATCCGCACC TTTACCGGCG AACAATGTGC TGCAAATCGA GCGTCATATT CTTGGTACTG ATGGTAAGAG CAAATCGCTG GACTCGTTAC GTAGCGGCGA TCTGGTGCTG GTGTGGTTGC AGGTTAAAGC CAGTAACAGC GTGCCGGATG CGTTGGTCGT GGATCTGCTG CCAGCGGGTC TGGAACTGGA AAACCAGAAT CTGGCGAACG GTAGCGCCAG CCTGGAGCAA AGTGGTGGCG AAGTGCAGAA CTTACTGAAC CAGATGCAGC AGGCGAGTAT TAAGCACATT GAGTTCCGTG ACGATCGCTT TGTGGCGGCG GTTGCCGTTG ATGAATACCA ACCGGTAACG CTGGTGTATC TGGCGCGGGC GGTGACGCCG GGAACGTATC AGGTACCGCA ACCGATGGTG GAATCAATGT ATGTTCCCCA ATGGCGGGCG ACCGGCGCGG CTGAAGATTT GCTAATTGTC AGACCGTAA
|
Protein sequence | MKKLRLAACM LMLALAGCDN NDNAPTAVKK DAPSEVTKAA SSENASSAKL SAPERQKLAQ QSAGKALTLL DLSEVQLDGA ATLVLTFSIP LDPDQDFSRV IHVVDKKSGK VDGAWELSDN LKELRLRHLE PKRDLIVTIG KEVKALNNAT FSKDYEKTIT TRDIQPSVGF ASRGSLLPGK VIEGLPVMAL NVNNVDVNFF RVKPESLPAF ISQWEYRNSL ANWQSDKLLQ MADLVYTGRF DLNPARNTRE KLLLPLGDIK PLQQAGVYLA VMNQAGRYDY SNPATLFTLS DIGVSAHRYH NRLDIFTQSL ENGAAQQGIE VSLLNEKGQT LTQATSDAQG HVQLENDKNA ALLLARKNGQ TTLLDLKLPA LDLAEFNIAG APGYSKQFFM FGPRDLYRPG ETVILNGLLR DADGKALPDQ PIKLDVIKPD GQVLRSVVSQ PENGLYHFTW PLDSNAATGM WHIRANTGDN QYRMWDFHVE DFMPERMALN LTGEKTPLTP KDEVKFSVVG YYLYGAPANG NTLQGQLFLR PLREAVSALP GFEFGDIAAE NLSRTLDEVQ LTLDDKGRGE VSTESQWKET HSPLQVIFQG SLLESGGRPV TRRAEQAIWP ADALPGIRPQ FASKSVYDYR TDSTVKQPIV DEGSNAAFDI VYSDAQGVKK AVSGLQVRLI RERRDYYWNW SEDEGWQSQF DQKDLIENEQ TLDLKADETG KVSFPVEWGA YRLEVKAPNE AVSSVRFWAG YSWQDNSDGS GAVRPDRVTL KLDKASYRPG DTIKLHIAAP TAGKGYAMVE SSEGPLWWQE IDVPAQGLDL TIPVNKTWNR HDLYLSTLVV RPGDKSRSAT PKRAVGVLHL PLGDENRRLD LALETPTKMR PNQPLTVKIK ASTKNGEMPK QVNVLVSAVD SGVLNITDYV TPDPWQAFFG QKRYGADIYD IYGQVIEGQG RLAALRFGGD GDELKRGGKP PVNHVNIVAQ QALPVTLNEQ GEGSVTLPIG DFNGELRVMA QAWTADDFGS NESKVVVAAP VIAELNMPRF MASGDTSRLT LDITNLTDKP QKLNVALTAS GLLELVSDSP AAVELAPGVR TTLFIPVRAL PGYGDGEIQA TISGLALPGE TVADQHKQWK IGVRPAFPAQ TVNYGTALQP GETWAIPADG LQNFSPVTLE GQLLLSGKPP LNIASYIKEL KAYPYGCLEQ TASGLFPSLY TNAAQLQALG IKGDSDEKRR ASVDIGISRL LQMQRDNGGF ALWDKNGDEE YWLTAYVMDF LVRAGEQGYS VPTDAINRGN ERLLRYLQDP GMMSIPYADN LKASKFAVQS YAALVLARQQ KAPLGALREI WEHRADAASG LPLLQLGVAL KTMGDATRGE EAIALALKTP RNDERKWLGD YGSPLRDNAL MLSLLEENKL LPDEQNTLLN TLSQQAFGER WLSTQESNAL FLAARTVQDL PGKWQAQTSF SAEPLTGEKA QTSNLNSDQL ATLQVTNSGD QPLWLRVDAS GYPQSAPLPA NNVLQIERHI LGTDGKSKSL DSLRSGDLVL VWLQVKASNS VPDALVVDLL PAGLELENQN LANGSASLEQ SGGEVQNLLN QMQQASIKHI EFRDDRFVAA VAVDEYQPVT LVYLARAVTP GTYQVPQPMV ESMYVPQWRA TGAAEDLLIV RP
|
| |