Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2249 |
Symbol | flu1 |
ID | 6143968 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2267288 |
End bp | 2270134 |
Gene Length | 2847 bp |
Protein Length | 948 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641617125 |
Product | antigen 43 |
Protein accession | YP_001744298 |
Protein GI | 170680707 |
COG category | [M] Cell wall/membrane/envelope biogenesis [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3468] Type V secretory pathway, adhesin AidA |
TIGRFAM ID | [TIGR01414] outer membrane autotransporter barrel domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACGAC ATCTGAACAC CAGCTACAGG CTGGTATGGA ATCACATTAC GGGCACCCTG GTGGTGGCTT CCGAACTGGC CCGCTCACGG GGAAAACGCG CCGGTGTGGC GGTTGCGCTG TCTCTTGCTG CTGTCACGCC AGTCCCGGCA CTGGCTGCTG ACACGGTTGT AGAGGCGGGA GAAACCGTGA ACGGCGGAAC ACTGACAAAT CATGACAACC AGATTGTCTT CGGTACGACC AACGGAATGA CCATCAGTAC CGGGCTGGAG TATGGGACGG ATAACGAGGC CAATACCGGC GGGCAATGGG TACAGGATGG CGGCACAGCC AGTAATACCA CCATCAGCAG CGGCGGGCTC CAGTTTGTAG GTGCCGGGGG AAAGGCTACA GACACAATAA TCAATGAAGG AGGCGGACAA AGCCTGAAGG GACTGGCACT GAACACCACG CTGAATGGTG GCGAGCAGTG GATGCATGAA GGTGCGATAG CCACAGGAAC CGTCATTAAT GATAAGGGCT GGCAGGTCGT CAAACCCGGC GCAGTGGCAA CAGACACCGT TGTGAATACC GGCGCAGAAG GAGGACCGGA TGCAGAAAAT GGTGATACCG GGCAGTTTGT TCGCGGAAAT GCCGTACGTA CCACCATCAA TAAAAATGGT CGTCAGATTG TGACTGTTGA AGGAACAGCA AATACCACTG TGGTTTATGC CGGCGGCGAC CAGACGGTAC ATGGCCACGC ACTGGATACC ACGCTGAATG GCGGTAACCA GTATGTACAC AACGGCGGTA CAACGTCTGA CACTGTTGTG AACAGTGACG GCTGGCAGAT TATCAAGGAA GGTGGTCTGG CGGATTTCAC CACCGTTAAC CAGAAAGGCA AACTGCAGGT GAACGCCGGT GGTACAGCCA CGAATGTCAC CCTGAAGCAG GGAGGCGCAC TGGTCACCAG TACGGCGGCA ACCGTCACCG GCAGCAACCG TCTGGGCAAT TTCGCGGTGG AAAACGGTAA GGCTGACGGT GTTGTTCTGG AGTCCGGCGG TCGTCTGGAT GTACTGGAGG GTCATTCAGC GCAGAAAACA CGGGTGGATG ACGGCGGTAC CCTGGCAGTG TCTGCCGGTG GTAAGGCGAC AGGTGTCACC ATGACATCCG GTGGTGCCCT GATTGCAGAC AGTGGTGCCA CTGTTGAGGG GACCAATGCC AGCGGTAAGT TCAGTATTGA TGGCATATCC GGTCAGGCCA GCGGCCTGCT GCTGGAAAAT GGCGGCAGCT TTACGGTTAA TGCCGGGGGA CAGGCTGGCA ACACCACTGT CGGACATCGT GGAACACTGA CGCTGGCTGC CGGGGGAAGT CTGAGTGGCA GAACACAGCT CAGTAAAGGC GCCAGCATGG TACTGAATGG TGATGTGGTC AGTACCGGCG ATATTGTTAA CGCAGGGGAG ATTCGCTTTG ATAATCAGAC GACACAGGAT GCCGTCCTGA GCCGTGCTGT TGCAAAAGGC GACGCCCCGG TAACGTTCCA TAAACTGACC ACCAGTAACC TCACCGGTCA GGGTGGCACC ATCAATATGC GTGTTCGCCT TGATGGCAGC AATGCCTCTG ACCAGCTGGT GATTAATGGT GGTCAGGCAA CCGGCAAAAC CTGGCTTGCG TTCACAAATG TCGGAAACAG TAACCTCGGG GTGGCAACTT CCGGACAGGG TATCCGGGTT GTGGATGCAC AGAATGGTGC CACCACAGAA GAAGGTGCGT TTGCCCTGAG TCGCCCGCTT CAGGCCGGTG CCTTTAACTA CACCCTGAAC CGTGACAGCG ATGAAGACTG GTACCTGCGC AGTGAAAATG CTTATCGTGC TGAAGTCCCC CTGTATGCCT CCATGCTGAC ACAGGCAATG GACTATGACC GGATTCTGGC AGGCTCCCGC AGCCATCAGA CCGGTGTAAA CGGTGAAAAT AACAGCGTCC GTCTCAGCAT TCAGGGCGGT CATCTCGGTC ACGATAACAA CGGCGGTATT GCCCGTGGGG CCACGCCGGA AAGCAGCGGC AGCTATGGCT TCGTCCGTCT GGAGAGTGAC CTGCTGAGAA CAGAGGTTGC CGGTATGTCT GTGACCGCGG GGGTATATAG TGCTGCTGGC CATTCTTCCG TTGATGTTAA GGATGATGAC GGCTCCCGCG CCGGCACGGT CCGGGATGAT GCCGGCAGCC TGGGCGGATA CCTGAATCTG GTACACACGT CCTCCGGCCT GTGGGCTGAC ATTATGGCAC AGGGAACCCG CCACAGCATG AAAGCGTCAT CGGACAATAA CGACTTCCGC GCCCGGGGCT GGGGCTGGCT GGGCTCACTG GAAACCGGTC TGCCCTTCAG TATCACTGAC AACCTGATGC TGGAACCACA ACTGCAGTAC ACCTGGCAGG GACTTTCCCT GGATGACGGC CAGGATAACG CCGGTTATGT GAAGTTCGGG CATGGCAGTG CACAACATAT GCGTGCCGGC TTCCGTCTGG GCAGCCACAA CGATATGAGC TTTGGTGAAG GCACCTCATC CCGTGACACC CTGCGCGACA GTGCAAAACA CCGTGTGCGT GAACTGCCGG TGAACTGGTG GGTACAGCCT TCTGTTATCC GCACCTTCAG TTCCCGGGGT GACATGAGCA TGGGGACAGC CGCCGCCGGC AGTAACATGA CGTTCTCACC GTCCCAGAAT GGCACGTCAC TGGACCTGCA GGCCGGACTG GAAGCCCGTG TCCGGGAAAA TATCACCCTG GGCGTTCAGG CCGGTTATGC CCACAGCGTC AGCGGCAGCA GCGCCGAAGG CTATAACGGT CAGGCCACGC TGAATGTGAC TTTCTGA
|
Protein sequence | MKRHLNTSYR LVWNHITGTL VVASELARSR GKRAGVAVAL SLAAVTPVPA LAADTVVEAG ETVNGGTLTN HDNQIVFGTT NGMTISTGLE YGTDNEANTG GQWVQDGGTA SNTTISSGGL QFVGAGGKAT DTIINEGGGQ SLKGLALNTT LNGGEQWMHE GAIATGTVIN DKGWQVVKPG AVATDTVVNT GAEGGPDAEN GDTGQFVRGN AVRTTINKNG RQIVTVEGTA NTTVVYAGGD QTVHGHALDT TLNGGNQYVH NGGTTSDTVV NSDGWQIIKE GGLADFTTVN QKGKLQVNAG GTATNVTLKQ GGALVTSTAA TVTGSNRLGN FAVENGKADG VVLESGGRLD VLEGHSAQKT RVDDGGTLAV SAGGKATGVT MTSGGALIAD SGATVEGTNA SGKFSIDGIS GQASGLLLEN GGSFTVNAGG QAGNTTVGHR GTLTLAAGGS LSGRTQLSKG ASMVLNGDVV STGDIVNAGE IRFDNQTTQD AVLSRAVAKG DAPVTFHKLT TSNLTGQGGT INMRVRLDGS NASDQLVING GQATGKTWLA FTNVGNSNLG VATSGQGIRV VDAQNGATTE EGAFALSRPL QAGAFNYTLN RDSDEDWYLR SENAYRAEVP LYASMLTQAM DYDRILAGSR SHQTGVNGEN NSVRLSIQGG HLGHDNNGGI ARGATPESSG SYGFVRLESD LLRTEVAGMS VTAGVYSAAG HSSVDVKDDD GSRAGTVRDD AGSLGGYLNL VHTSSGLWAD IMAQGTRHSM KASSDNNDFR ARGWGWLGSL ETGLPFSITD NLMLEPQLQY TWQGLSLDDG QDNAGYVKFG HGSAQHMRAG FRLGSHNDMS FGEGTSSRDT LRDSAKHRVR ELPVNWWVQP SVIRTFSSRG DMSMGTAAAG SNMTFSPSQN GTSLDLQAGL EARVRENITL GVQAGYAHSV SGSSAEGYNG QATLNVTF
|
| |