Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2204 |
Symbol | asmA |
ID | 5593647 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2184001 |
End bp | 2185854 |
Gene Length | 1854 bp |
Protein Length | 617 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640921334 |
Product | putative assembly protein |
Protein accession | YP_001458873 |
Protein GI | 157161555 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2982] Uncharacterized protein involved in outer membrane biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 60 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGACGAT TTCTGACGAC GCTGATGATA CTCCTGGTCG TGCTGGTGGC CGGGTTATCT GCGTTAGTGT TGCTGGTGAA TCCGAATGAT TTCCGCGACT ATATGGTCAA GCAAGTTGCT GCACGTAGCG GTTATCAATT GCAGCTCGAC GGGCCACTGC GTTGGCACGT CTGGCCGCAG CTTAGTATCC TCTCCGGGCG AATGTCTCTC ACCGCCCAGG GCGCAAGCCA GCCACTGGTT CGCGCCGACA ACATGCGTCT GGACGTGGCG CTTTTACCAC TACTGAGTCA TCAACTGAGC GTTAAGCAGG TGATGCTAAA GGGGGCAGTG ATCCAACTGA CGCCGCAGAC GGAAGCTGTG CGCAGTGAAG ACGCTCCGGT TGCACCGCGC GACAATACCT TGCCGGATCT GTCAGACGAT CGCGGATGGT CGTTTGATAT ATCCAGTCTT AAGGTGGCGG ACAGCGTTCT GGTGTTCCAG CATGAAGATG ACGAGCAGGT GACAATCCGC AATATCCGCC TGCAAATGGA ACAAGATCCC CAACATCGTG GTTCATTTGA GTTCTCCGGG CGGGTTAATC GCGATCAGCG CGATCTCACG ATATCCCTTA ACGGTACGGT GGATGCTTCT GATTATCCGC ATGATTTAAT GGCGGCTATT GAACAAATTA ACTGGCAGTT GCAGGGTGCC GATTTACCAA AACAAGGTAT TCAGGGGCAG GGGAGTTTCC AGGTCCAGTG GCAGGAGTCA CATAAACGCC TTTCATTTAA CCAAATTAGT TTGACCGCCA ATGACAGTAC GCTGAGCGGG CAAGCACAGG TCACGTTGAC AGAGAAACCT GAATGGCAGC TGAGACTGCA ATTCCCGCAA CTGAATCTTG ACAACCTCAT CTCCCTTAAT GAAACAGCGA ATGGTGAGAA CGGTGCCGCG CAGCAAGGGC TGAGCCAATC AACGTTGCCG CGCCCGGTCA TTTCTTCGCG TATTGATGAA CCGGCCTATC AGGGACTGCA AGGCTTTACG GCTGATATTT TGTTGCAGGC CAGTAACGTG CGCTGGCGCG GAATGAATTT TACAGATGTT GCCACGCAAA TGACCAGCAA GTCGGGTTTG CTGGAAATTA CTCAACTGCA GGGCAAACTT AACGGTGGAC AGGTTTCACT GCCGGGCACG TTGGACGCGA CATCAATAAA TCCGCGGATA AACTTCCAGC CACGGCTGGA AAACGTTGAG ATTGGCACCA TTCTGAAGGC GTTTAACTAT CCGATTTCGT TGACCGGAAA GATGTCACTG GCTGGTGATT TCTCCGGTGC TGACATAGAT GCCGACGCAT TCCGCCACAA CTGGCAAGGA CAGGCACATG TCGAAATGAC CGACACACGC ATGGAAGGGA TGAACTTCCA GCAGATGATT CAGCAAGCGG TAGAACGTAA TGGTGGTGAT GTGAAGGCCG CTGAAAACTT CGATAACGTG ACGCGTCTTG ACCGCTTTAT CACCGATTTG ACGTTGAAGG ATGGCGTCGT GACGTTAAAC GACATGCAAG GTCAATCGCC CATGCTGGCG CTTTCCGGGG CTGGTACGTT GAATCTGGCG GAGCAAACCT GCGACACCCA GTTTGATATT CGGGTCGTGG GTGGCTGGAA CGGAGAAAGC AAACTGATTG ATTTCCTCAA AGAAACGCCA GTACCGCTGC GGGTTTATGG CAACTGGCAG CAACTTAATT ACAGCCTGCA AGTGGATCAG TTACTGCGTA AACATCTACA GGATGAAGCA AAACGCCGTC TTAACGACTG GGCCGAGCGG AATAAAGATT CCCGCAATGG CAAAGATGTG AAGAAGTTGC TGGAGAAGAT GTAG
|
Protein sequence | MRRFLTTLMI LLVVLVAGLS ALVLLVNPND FRDYMVKQVA ARSGYQLQLD GPLRWHVWPQ LSILSGRMSL TAQGASQPLV RADNMRLDVA LLPLLSHQLS VKQVMLKGAV IQLTPQTEAV RSEDAPVAPR DNTLPDLSDD RGWSFDISSL KVADSVLVFQ HEDDEQVTIR NIRLQMEQDP QHRGSFEFSG RVNRDQRDLT ISLNGTVDAS DYPHDLMAAI EQINWQLQGA DLPKQGIQGQ GSFQVQWQES HKRLSFNQIS LTANDSTLSG QAQVTLTEKP EWQLRLQFPQ LNLDNLISLN ETANGENGAA QQGLSQSTLP RPVISSRIDE PAYQGLQGFT ADILLQASNV RWRGMNFTDV ATQMTSKSGL LEITQLQGKL NGGQVSLPGT LDATSINPRI NFQPRLENVE IGTILKAFNY PISLTGKMSL AGDFSGADID ADAFRHNWQG QAHVEMTDTR MEGMNFQQMI QQAVERNGGD VKAAENFDNV TRLDRFITDL TLKDGVVTLN DMQGQSPMLA LSGAGTLNLA EQTCDTQFDI RVVGGWNGES KLIDFLKETP VPLRVYGNWQ QLNYSLQVDQ LLRKHLQDEA KRRLNDWAER NKDSRNGKDV KKLLEKM
|
| |