Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0996 |
Symbol | asmA |
ID | 6143404 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1013338 |
End bp | 1015191 |
Gene Length | 1854 bp |
Protein Length | 617 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641615883 |
Product | putative assembly protein |
Protein accession | YP_001743075 |
Protein GI | 170681043 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2982] Uncharacterized protein involved in outer membrane biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACGAT TTCTGACGAC GCTGATGATA CTCCTGGTCG TGCTGGTGGC CGGGTTATCT GCGTTAGTGT TGCTGGTGAA TCCGAATGAT TTCCGCGACT ATATGGTCAA GCAAGTTGCT GCACGTAGCG GTTATCAATT GCAGCTCGAC GGGCCACTGC GTTGGCACGT CTGGCCGCAG CTTAGTATCC TCTCCGGGCG AATGTCTCTC ACCGCCCAGG GCGCGAGCCA GCCACTGGTT CGCGCCGACA ACATGCGTCT GGACGTGGCG CTTTTACCAC TACTGAGTCA TCAACTGAGC GTTAAGCAGG TGATGCTAAA AGGGGCAGTG ATTCAACTGA CGCCGCAGAC GGAAGCGGTG CGCAGTGAAG ACGCTCCGGT TGCACCGCGC GACAATACCT TGCCGGATCT GTCAGACGAT CGCGGATGGT CGTTTGATAT ATCCAGTCTT AAGGTGGCGG ACAGCGTGCT GGTGTTCCAG CATGAAGATG ACGAGCAGGT GACAATCCGC AATATTCGCC TGCAAATGGA ACAAGATCCC CAACATCGTG GCTCATTTGA GTTCTCCGGG CGGGTTAATC GCGATCAGCG CGATCTCACG ATATCCCTTA ACGGTACGGT AGATGCTTCT GATTATCCGC ATGATTTAAC GGCGGCTATT GAACAAATTA ACTGGCAGTT GCAGGGTGCC GATTTACCAA AACAAGGTAT TCAGGGGCAG GGGAGTTTCC AGGCCCAGTG GCAGGAGTCA CATAAACGCC TTTCATTTAA CCAAATTAGT TTGACCGCCA ATGACAGTAC GCTGAGCGGG CAAGCACAGG TCACGTTGAC AGAGAAACCG GAATGGCAGC TGAGACTGCA ATTCCCGCAA CTGAATCTTG ACAACCTCAT CCCCCTTAAT GAAACAGCGA ACGGTGAGAA CGGTGCCGCG CAGCAAGGGC AGAGCCAATC AACGTTGCCG CGCCCGGTCA TTTCTTCGCG TATTGATGAA CCGGCCTATC AGGGACTGCA AGGCTTTACG GCTGATATTT TGTTGCAGGC CAGTAACGTG CGCTGGCGCG GAATGAATTT TACAGATGTT GCCACGCAAA TGACCAACAA GTCGGGTTTG CTGGAAATTA CTCAACTGCA GGGCAAACTT AACGGTGGAC AGGTTTCACT GCCGGGCACG CTGGACGCGA CATCAATAAA TCCGCGGATA AACTTCCAGC CACGGCTGGA AAACGTTGAG ATTGGCACCA TTCTGAAGGC GTTTAACTAT CCGATTTCGT TGACCGGAAA GATGTCACTG GCTGGTGATT TCTCCGGTGC TGACATAGAT GCCGACGCAT TCCGCCATAA CTGGCAAGGA CAGGCACATG TCGAAATGAC CGACACGCGC ATGGAAGGGA TGAACTTCCA GCAGATGATT CAGCAAGCGG TAGAACGTAA TGGTGGTGAT GTGAAGGCCG CTGAAAACTT CGATAACGTA ACGCGTCTTG ACCGCTTTAC CACCGATTTG ACGTTGAAGG ATGGCGTCGT GACGTTAAAC GACATGCAAG GTCAATCGCC AGTGCTGGCG CTGACAGGGG AAGGCATGTT GAATCTGGCA GATCAAACCT GCGACACCCA GTTTGACATT CGTGTGATTG GTGGTTGGAA CGGGGAAAGC AAACTGATTG ATTTCCTCAA AGAAACGCCA GTACCGCTGC GGGTTTATGG CAACTGGCAG CAACTCAATT ACAGCCTGCA AGTGGATCAG TTACTGCGCA AACATCTACA GGACGAAGCG AAACGTCGCC TGAATGACTG GGCCGAGCGG AATAAAGATT CCCGCAATGG CAAAGATGTG AAGAAGTTGC TGGAGAAGAT GTAA
|
Protein sequence | MRRFLTTLMI LLVVLVAGLS ALVLLVNPND FRDYMVKQVA ARSGYQLQLD GPLRWHVWPQ LSILSGRMSL TAQGASQPLV RADNMRLDVA LLPLLSHQLS VKQVMLKGAV IQLTPQTEAV RSEDAPVAPR DNTLPDLSDD RGWSFDISSL KVADSVLVFQ HEDDEQVTIR NIRLQMEQDP QHRGSFEFSG RVNRDQRDLT ISLNGTVDAS DYPHDLTAAI EQINWQLQGA DLPKQGIQGQ GSFQAQWQES HKRLSFNQIS LTANDSTLSG QAQVTLTEKP EWQLRLQFPQ LNLDNLIPLN ETANGENGAA QQGQSQSTLP RPVISSRIDE PAYQGLQGFT ADILLQASNV RWRGMNFTDV ATQMTNKSGL LEITQLQGKL NGGQVSLPGT LDATSINPRI NFQPRLENVE IGTILKAFNY PISLTGKMSL AGDFSGADID ADAFRHNWQG QAHVEMTDTR MEGMNFQQMI QQAVERNGGD VKAAENFDNV TRLDRFTTDL TLKDGVVTLN DMQGQSPVLA LTGEGMLNLA DQTCDTQFDI RVIGGWNGES KLIDFLKETP VPLRVYGNWQ QLNYSLQVDQ LLRKHLQDEA KRRLNDWAER NKDSRNGKDV KKLLEKM
|
| |