Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_4892 |
Symbol | flu1 |
ID | 5587671 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 4879454 |
End bp | 4882300 |
Gene Length | 2847 bp |
Protein Length | 948 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640928493 |
Product | antigen 43 |
Protein accession | YP_001465820 |
Protein GI | 157158925 |
COG category | [M] Cell wall/membrane/envelope biogenesis [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3468] Type V secretory pathway, adhesin AidA |
TIGRFAM ID | [TIGR01414] outer membrane autotransporter barrel domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGAC ATCTGAACAC CAGCTACAGG CTGGTATGGA ATCACATTAC GGGCACCCTG GTGGTGGCTT CCGAACTGGC GCGCTCACGG GGAAAACGCA CCGGTGTGGC GGTTGCGCTG TCTCTTGCTG CTGTCACGTC AGTCCCGGTA CTGGCTGCTG ACACGGTCGT ACAGGCGGGA GAAACCGTGA GCGGCGGAAC ACTGACAAAT CATGACAACC AGATTGTCTT CGGTACGGCC AACGGAATGA CCATCAGTAC CGGTCTGGAG TATGGGCCGG ATAACGAGGC CAATACCGGC GGACAATGGA TACAAAATGG CGGTATCGCC AACAACACTA CTGTCACCGG TGGTGGTCTT CAGAGAGTGA ATGCCGGAGG AAGCGTTTCA GACACGGTTA TCAGTGCCGG AGGCGGACAG AGCCTTCAGG GGCAGGCAGT GAACACCACT CTGAACGGCG GTGAGCAGTG GGTACATGAA GGCGGGATTG CAACGGGTAC CGTCATTAAT GAGAAGGGCT GGCAGGCCGT CAAATCCGGC GCAATGGCAA CCGACACGGT TGTGAATACC GGCGCGGAAG GGGGACCGGA TGCAGAAAAT GGTGATACCG GGCAGTTTGT TCGCGGAAAT GCCGTACGTA CCACTATCAA TGAAAATGGT CGTCAGATTG TGGCTGCTGA AGGAACAGCA AATACCACTG TGGTTTATGC CGGCGGCGAC CAGACGGTAC ACGGGCATGC GCTGGATACC ACACTGAATG GCGGTTACCA GTATGTGCAC AACGGAGGCA CAGCCTCTGA CACGGTTGTA AACAGTGACG GCTGGCAGAT TGTCAAGGAA GGTGGTCTGG CGGATTTCAC CACCGTTAAC CAGAAAGGCA AACTGCAGGT GAACGCCGGT GGTACAGCCA CGAATGTCAC CCTGAAGCAG GGAGGCGCAC TGGTCACCAG TACGGCGGCA ACCGTCACCG GCAGCAACCG TCTGGGCAAT TTCACTGTGG AAAACGGTAA TGCTGACGGT GTTGTTCTGG AGTCCGGTGG TCGCCTGGAT GTACTGGAGG GCCATTCAGC CTGGAAAACA CTGGTGGATG ACGGCGGAAC CCTGGCAGTG TCTGCCGGTG GTAAGGCAAC AGATGTCACC ATGACATCCG GTAGTGCCCT GATTGCAGAC AGTGGTGCCA CTGTTGAGGG GACCAATGCC AGCGGTAAGT TCAGTATTGA TGGCACATCC GGTCAGGCCA GCGGACTGCT GCTGGAAAAT GGCGGCAGCT TTACGGTTAA TGCCGGAGGA CTGGCCAGCA ACACCACTGT CGGACATCGT GGAACACTGA CGCTGGCTGC CGGGGGAAGT CTGAGTGGCA GAACACAGCT CAGTAAAGGT GCCAGCATGG TACTGAATGG CGATGTGGTC AGTACCGGCG ATATTGTTAA CGCAGGGGAG ATTCGCTTTG ATAATCAGAC GACACCGGAT GCCGCGCTGA GCCGTGCTGT TGCAAAAGGC GACTCCCCGG TAACGTTCCA TAAACTGACC ACCAGTAACC TCACCGGTCA GGGTGGCACC ATCAATATGC GTGTTCGCCT TGATGGCAGC AATACCTCTG ACCAGCTGGT GATTAATGGT GGTCAGGCAA CCGGCAAAAC CTGGCTTGCG TTTACAAATG TCGGAAACAG TAACCTCGGG GTGGCAACCT CCGGACAGGG TATCCGGGTT GTGGATGCAC AGAATGGCGC CACCACAGAA GAAGGTGCGT TTGCCCTGAG TCGCCCGCTT CAGGCCGGCG CCTTTAACTA CACCCTGAAC CGTGACAGCG ATGAAGACTG GTACCTGCGC AGTGAAAATG CTTATCGTGC TGAAGTCCCC CTGTATACAT CCATGCTGAC ACAGGCAATG GACTATGACC GGATTCTGGC AGGCTCCCGC AGCCATCAGA CCGGTGTAAA CGGTGAAAAT AACAGCGTCC GTCTCAGCAT TCAGGGCGGT CATCTCGGTC ACGATAACAA CGGCGGTATT GCCCGTGGAG CCACGCCGGA AAGCAGCGGC AGCTATGGCT TCGTCCGTCT GGAGGGTGAC CTGCTCAGAA CAGAGGTTGC CGGTATGTCT CTGACGACAG GGGTGCATGG TGCCGCAGGC CATTCTTCCG TTGATGTTAA GGATGATGAC GGTTCCCGCG CCGGCACGGT CCGGGATGAT GCCGGCAGCC TGGGCGGATA CCTGAATCTG ACACACACGT CCTCCGGCCT GTGGGCTGAC ATTGTGGCAC AGGGAACCCG CCACAGCATG AAAGCGTCAT CGGACAATAA CGACTTCCGC GCCCGCGGCT GGGGCTGGCT GGGCTCACTG GAAACCGGTC TGCCCTTCAG TATCACTGAC AACCTGATGC TGGAGCCACA ACTGCAGTAT ACCTGGCAGG GACTTTCCCT GGATGACGGC CAGGATAACG CCGGTTATGT GAAGTTCGGG CATGGCAGTG CACAACATGT GCGTGCCGGT TTCCGTCTGG GCAGCCACAA CGATATGAGC TTTGGTGAAG GCACCTCATC CCGTGACACC CTGCGCGACA GTGCAAAACA CCGTGTGCGT GAACTGCCGG TGAACTGGTG GGTACAGCCT TCTGTTATCC GCACCTTCAG TTCCCGGGGT GACATGAGCA TGGGGACAGC CGCCGCCGGC AGTAACATGA CGTTCTCACC GTCCCGGAAT GGCACGTCAC TGGACCTGCA GGCCGGACTG GAAGCCCGTG TCCGGGAAAA TATCACCCTG GGCGTTCAGG CCGGTTATGC CCACAGTGTC AGCGGCAGCA GCGCTGAAGG TTATAACGGT CAGGCCACGC TGAATGTGAC TTTCTGA
|
Protein sequence | MKRHLNTSYR LVWNHITGTL VVASELARSR GKRTGVAVAL SLAAVTSVPV LAADTVVQAG ETVSGGTLTN HDNQIVFGTA NGMTISTGLE YGPDNEANTG GQWIQNGGIA NNTTVTGGGL QRVNAGGSVS DTVISAGGGQ SLQGQAVNTT LNGGEQWVHE GGIATGTVIN EKGWQAVKSG AMATDTVVNT GAEGGPDAEN GDTGQFVRGN AVRTTINENG RQIVAAEGTA NTTVVYAGGD QTVHGHALDT TLNGGYQYVH NGGTASDTVV NSDGWQIVKE GGLADFTTVN QKGKLQVNAG GTATNVTLKQ GGALVTSTAA TVTGSNRLGN FTVENGNADG VVLESGGRLD VLEGHSAWKT LVDDGGTLAV SAGGKATDVT MTSGSALIAD SGATVEGTNA SGKFSIDGTS GQASGLLLEN GGSFTVNAGG LASNTTVGHR GTLTLAAGGS LSGRTQLSKG ASMVLNGDVV STGDIVNAGE IRFDNQTTPD AALSRAVAKG DSPVTFHKLT TSNLTGQGGT INMRVRLDGS NTSDQLVING GQATGKTWLA FTNVGNSNLG VATSGQGIRV VDAQNGATTE EGAFALSRPL QAGAFNYTLN RDSDEDWYLR SENAYRAEVP LYTSMLTQAM DYDRILAGSR SHQTGVNGEN NSVRLSIQGG HLGHDNNGGI ARGATPESSG SYGFVRLEGD LLRTEVAGMS LTTGVHGAAG HSSVDVKDDD GSRAGTVRDD AGSLGGYLNL THTSSGLWAD IVAQGTRHSM KASSDNNDFR ARGWGWLGSL ETGLPFSITD NLMLEPQLQY TWQGLSLDDG QDNAGYVKFG HGSAQHVRAG FRLGSHNDMS FGEGTSSRDT LRDSAKHRVR ELPVNWWVQP SVIRTFSSRG DMSMGTAAAG SNMTFSPSRN GTSLDLQAGL EARVRENITL GVQAGYAHSV SGSSAEGYNG QATLNVTF
|
| |