Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2139 |
Symbol | |
ID | 5595409 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 2120165 |
End bp | 2121916 |
Gene Length | 1752 bp |
Protein Length | 583 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640921272 |
Product | antigen 43, truncation |
Protein accession | YP_001458811 |
Protein GI | 157161493 |
COG category | [M] Cell wall/membrane/envelope biogenesis [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3468] Type V secretory pathway, adhesin AidA |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 55 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGAC ATCTGAACAC CAGCTACAGG CTGGTATGGA ATCACATTAC GGGCACCCTG GTGGTGGCCT CCGAACTGGC CCGCTCACGG GGAAAACGCG CCGGTGTGGC GGTTGCACTG TCTCTTGCTG CTGTCACATC AGTCCCGGCA CTGGCTGCTG ACACGGTTGT ACAGGCGGGA GAAACCGTGA ACGACGGAAC ACTGACAAAT CATGACAACC AGATTGTCCT CGGTACTGCC AACGGAATGA CCATCAGTAC CGGGCTGGAG TATGGGCCGG ATAACGAAGC CAATACCGGC GGACAATGGA TACAAAATGG CGGTATCGCC AACAACACTA CTGTCACCGG TGGTGGTCTT CAGAGAGTGA ATGCCGGAGG AAGCGTTTCA GACACGGTTA TCAGTGCCGG AGGAGGACAG AGCCTTCAGG GGCAGGCAGT GAACACCACT CTGAACGGCG GTGAGCAGTG GGTACATGAA GGCGGGATTG CAACGGGTAC CGTCATTAAT GAGAAGGGCT GGCAGGCCGT CAAATCCGGC GCAATGGCAA CCGACACGGT TGTGAATACC GGCGCGGAAG GGGGACCGGA TGCAGAAAAT GGTGATACCG GGCAGTTTGT TCGCGGAAAT GCCGTACGTA CCACTATCAA TGAGAATGGT CGTCAGATTG TGGCTGCTGA AGGAACGGCA AATACCACTG TGGTTTATGC CGGCGGCGAC CAGACTGTAC ATGGCTACGC GCTGGATACC ACACTGAACG GCGGTAACCA GTATGTGCAC AACGGCGGTA CAGCGTCTGG CACTGTTGTG AACAGTGACG GCTGGCAGAT TGTCAAGGAA GGTGGTCTGG CGGATTTCAC CATCGTTAAC CAGAAAGGCA AACTGCAGGT GAACGCCGGT GGTACAGCCA CGAATGTCAC CCTGAAGCAG GGAGGCGCAC TGGTCACCAG TACGGCGGCA ACCGTCACCG GCAGCAACCG TCTGGGCAAT TTCACAGTGG AAAACGGTAA TGCTGACGGT GTTGTTCTGG AGTCCGGTGG TCGCCTGGAT GTACTGGAGG GCCATTCAGC CTGGAAAACA CTGGTGGATG ACGGCGGTAC CCTGGCAGTG TCTGCCGGTG GTAAGGCAAC AGATGTCACC ATGACATCCG GTGGTGCCCT GATAGCAGAC AGTGGTGCCA CTGTTGAGGG GACCAATGCC AGCGGTAAGT TCAGTATTGA TGGCATATCC GGTCAGGCCA GCGGCCTGCT ACTGGAAAAT GGCGGCAGCT TTACGGTTAA TGCCGGGGGA CAGGCAGGCA ACACCACTGT CGGACATCGT GGAACACTGA CGCTGGCTGC CGGGGGAAGT CTGAGTGGCA GAACACAGCT CAGTAAAGGC GCCAGTATGG TACTGAATGG TGATGTGGTC AGTACCGGCG ATATTGTTAA CGCAGGAGAG ATTCACTTTG ATAATCAGAC GACACAGGAT GCCGTGCTGA GCCGTGCTGT TGCAAAAGGC GACTCCCCGG TAACGTTCCA TAAACTGACC ACCACCAACC TCACTGGTCA GGGCGGCACC ATCAATATGC GTGTTCGCCT TGATGGCAGC AATACCTCTG ACCAGCTGGT GATTAATGGT GGTCAGGCAA CCGGCAAAAC CTGGCTTGCG TTTACAAATG TCGGAAACAG TAACCTCGGG GTGGCAACCT CCGGACAGGG TACCCTGCAC AGTGACGCCC TGTTTCGGCG CCCGTCTGGT GCAGGAGGGT AA
|
Protein sequence | MKRHLNTSYR LVWNHITGTL VVASELARSR GKRAGVAVAL SLAAVTSVPA LAADTVVQAG ETVNDGTLTN HDNQIVLGTA NGMTISTGLE YGPDNEANTG GQWIQNGGIA NNTTVTGGGL QRVNAGGSVS DTVISAGGGQ SLQGQAVNTT LNGGEQWVHE GGIATGTVIN EKGWQAVKSG AMATDTVVNT GAEGGPDAEN GDTGQFVRGN AVRTTINENG RQIVAAEGTA NTTVVYAGGD QTVHGYALDT TLNGGNQYVH NGGTASGTVV NSDGWQIVKE GGLADFTIVN QKGKLQVNAG GTATNVTLKQ GGALVTSTAA TVTGSNRLGN FTVENGNADG VVLESGGRLD VLEGHSAWKT LVDDGGTLAV SAGGKATDVT MTSGGALIAD SGATVEGTNA SGKFSIDGIS GQASGLLLEN GGSFTVNAGG QAGNTTVGHR GTLTLAAGGS LSGRTQLSKG ASMVLNGDVV STGDIVNAGE IHFDNQTTQD AVLSRAVAKG DSPVTFHKLT TTNLTGQGGT INMRVRLDGS NTSDQLVING GQATGKTWLA FTNVGNSNLG VATSGQGTLH SDALFRRPSG AGG
|
| |