Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3001 |
Symbol | asmA |
ID | 6967064 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2780441 |
End bp | 2782294 |
Gene Length | 1854 bp |
Protein Length | 617 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643386841 |
Product | putative assembly protein |
Protein accession | YP_002271309 |
Protein GI | 209395947 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2982] Uncharacterized protein involved in outer membrane biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.352254 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACGAT TTCTGACGAC GCTGATGATA CTCCTGGTCG TGCTGGTGGC CGGGTTATCT GCGTTAGTGT TGCTGGTGAA TCCGAATGAT TTCCGCGACT ATATGGTCAA GCAAGTTGCT GCACGTAGCG GTTATCAATT GCAGCTCGAC GGGCCACTGC GTTGGCACGT CTGGCCGCAG CTTAGTATCC TCTCCGGGCG AATGTCTCTC ACCGCCCAGG GCGCGAGCCA GCCACTGGTT CGCGCCGACA ACATGCGCCT GGACGTGGCG CTTTTACCAC TACTGAGTCA TCAACTGAGC GTTAAGCAGG TGATGCTAAA AGGGGCAGTG ATCCAACTGA CGCCGCAGAC GGAAGCGGTG CGCAGTGAAG ACGCTCCGGT TGCACCGCGC GACAATACCT TGCCGGATCT GTCAGACGAT CGCGGATGGT CGTTTGATAT ATCCAGTCTT AAGGTGGCGG ACAGCGTGCT GGTGTTCCAG CATGAAGATG ACGAGCAGGT GACAATCCGC AATATCCGCC TGCAAATGGA ACAAGATCCC CAACATCGTG GCTCATTTGA GTTCTCCGGG CGGGTTAATC GCGATCAGCG CGATCTCACG ATATCCCTTA ACGGTACGGT AGATGCTTCT GATTATCCGC ATGATTTAAC GGCGGCTATT GAACAAATTA ACTGGCAGTT GCAGGGTGCC GATTTACCAA AACAAGGTAT TCAGGGGCAG GGGAGTTTCC AGGCCCAGTG GCAGGAGTCA CGTAAACGCC TTTCATTTAA CCAAATTAGT TTGACCGCCA ATGATAGTAC GCTGAGCGGG CAAGCACAGG TCACGCTGAC AGAGAAACCG GAATGGCAGC TGAGGCTGCA ATTCCCGCAA CTGAATCTTG ACAACCTCAT CCCGCTTAAT GAAACCGCGA ATGGTGAAAA CGGTGCCGCG CAGCAGGGGC AGAGCCAATC AACGTTGCCG CGCCCGGTTA TTTCTTCGCG TATTGATGAA CCGGCCTATC AGGGACTGCA AGGCTTTACG GCTGATATTT TGTTGCAGGC CAGTAACGTG CGCTGGCGCG GAATGAATTT TACAGATGTT GCCACGCAAA TGACCAACAA GTCGGGTTTG CTGGAAATTA CTCAACTACA GGGCAAGCTT AACGGTGGAC AGGTTTCACT GCCGGGCACG CTGGACGCGA CATCAATAAA TCCGCGGATA AACTTCCAGC CACGGCTGGA AAACGTTGAG ATTGGCACCA TTCTGAAGGC GTTTAACTAT CCGATTTCGT TGACCGGAAA AATGTCATTG GCTGGTGATT TCTCCGGTGC TGACATAGAT GCCGACGCAT TCCGCCACAA CTGGCAAGGA CAGGCACATG TCGAAATGGC CGATACGCGC ATGGAAGGGA TGAACTTCCA GCAGATGATT CAGCAAGCGG TAGAACGTAA TGGTGGTGAT GTGAAGGCCG CTGAAAACTT CGATAACGTA ACGCGTCTTG ACCGCTTTAC CACCGATTTG ACGTTGAAGG ATGGCGTCGT GACGTTAAAC GACATGCAAG GTCAATCGCC TGTGCTGGCG CTTACAGGGG AAGGCATGTT GAATCTGGCA GCTCAAACCT GCGACACCCA GTTTGATATT CGGGTCGTGG GTGGCTGGAA CGGGGAAAGC AAACTGATTG ATTTCCTCAA AGAAACGCCA GTACCGCTGC GGGTTTATGG CAACTGGCAG CAACTCAATT ACAGTCTGCA AGTGGATCAG TTACTGCGCA AACATCTACA GGACGAAGCG AAACGTCGCC TGAATGACTG GGCCGAGCGG AATAAAGATT CCCGCAATGG CAAAGATGTG AAGAAGTTGC TGGAGAAGAT GTAA
|
Protein sequence | MRRFLTTLMI LLVVLVAGLS ALVLLVNPND FRDYMVKQVA ARSGYQLQLD GPLRWHVWPQ LSILSGRMSL TAQGASQPLV RADNMRLDVA LLPLLSHQLS VKQVMLKGAV IQLTPQTEAV RSEDAPVAPR DNTLPDLSDD RGWSFDISSL KVADSVLVFQ HEDDEQVTIR NIRLQMEQDP QHRGSFEFSG RVNRDQRDLT ISLNGTVDAS DYPHDLTAAI EQINWQLQGA DLPKQGIQGQ GSFQAQWQES RKRLSFNQIS LTANDSTLSG QAQVTLTEKP EWQLRLQFPQ LNLDNLIPLN ETANGENGAA QQGQSQSTLP RPVISSRIDE PAYQGLQGFT ADILLQASNV RWRGMNFTDV ATQMTNKSGL LEITQLQGKL NGGQVSLPGT LDATSINPRI NFQPRLENVE IGTILKAFNY PISLTGKMSL AGDFSGADID ADAFRHNWQG QAHVEMADTR MEGMNFQQMI QQAVERNGGD VKAAENFDNV TRLDRFTTDL TLKDGVVTLN DMQGQSPVLA LTGEGMLNLA AQTCDTQFDI RVVGGWNGES KLIDFLKETP VPLRVYGNWQ QLNYSLQVDQ LLRKHLQDEA KRRLNDWAER NKDSRNGKDV KKLLEKM
|
| |