Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3258 |
Symbol | |
ID | 6066834 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 3565225 |
End bp | 3568227 |
Gene Length | 3003 bp |
Protein Length | 1000 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641602673 |
Product | outer membrane autotransporter |
Protein accession | YP_001726207 |
Protein GI | 170021253 |
COG category | [M] Cell wall/membrane/envelope biogenesis [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3468] Type V secretory pathway, adhesin AidA |
TIGRFAM ID | [TIGR01414] outer membrane autotransporter barrel domain [TIGR03304] outer membrane insertion C-terminal signal |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.372144 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACTCCT GGAAAAAGAA ACTTGTAGTA TCACAATTAG CATTGGCTTG CACTCTGGCT ATCACCTCTC AGGCTAATGC AGCGACCAAC GATATTTCTG GTCAAACTTA CAATACTTTC CATCACTACA ACGACGCCAC CTATGCTGAT GACGTTTACT ATGATGGTTA TGTAGGCTGG AACAACTATG CCGCTGATAG CTATTACAAC GGCGATATCT ACCCGGTCAT TAATAACGCT ACCGTTAACG GCGTGATTTC TACCTACTAT CTGGACGACG GTATTTCTAC CAATACCAAC GCCAATAGTC TGACAATCAA AAACAGCACT ATTCACGGTA TGATTTATTC CGAGTGCATG ACAACTGATT GTGCTGATCG TGCGGATGAT TACTACCATG ACCGTCTGGC GCTGACTGTT GATAATTCAA CGATCGACGA CAACTACGAA CACTACACAT ATAACGGCAC TTACAATAAT GCCGCTGATA CTCATGTTGT AGACGTTTAC AACATTGGTA CGGCTATTAC TCTGGATCAG GAAGTTGATC TGTCCATCAC TAATAACTCT CACGTTGCGG GTATTACGTT GACTCAGGGT TATGAGTGGG AAGATATTGA CGACAACACT GTCAGCACTG GCGTAAACAG CAGCGAAGTG TTTAATAACA CCATCACTGT TAAAGATTCT ACCGTAACCT CTGGTTCATG GACTGATGAA GGTACTACTG GTTGGTTCGG TAATACAGGT AATGCCAGCG ATTATAGTGG TAAATCTAAC TTTGTCACCG TAGATACTGA CGGAGATGGT GTTGCTGATA GCACTATTGC AAGCTGGGAT GATGTTGCAT TAGCTGTAGT TGCACATCCG AATGCTGATA ATGCTATGCA GACCACCGCT GACTTTAGTA ACTCTACTTT GATGGGCGAT GTAATCTTCT CTAGCAACTT TGATGAAAAC TTCTTCCCAC GTGGTGCAGA TAGTTATCGC GATGCTGATG GTGAAGTAGA CACCAACGGT TGGGATGGAA CAGATCGCCT GGATCTGACA TTAAATAACG GTAGTAAATG GGTAGGGGCA GCACAATCTG TTCATCAAAC TGGTTCTATT GATGTTGATG GAGATGGTAA AGGTGACATC GCTACATACG GTGTTGGCAC CGAGGCAACT GCAACTCTGA TTGATATTGA GGATAATAGC CTGTGGCCGT TATCAACCGT TGGTGTTGAA AACGATGATA CAAGTTACAG TGAATTCGAT CATATTACTG GTAATCAGGT TTACCAGAGC GGTCTGTTCA ATGTGACCCT GAATACCGGT TCACAGTGGG ATACCACCAA AACTTCTCTG ATTGATACTC TGAGCATCAA CAGTGGTTCA ACTGTTAATG TTGCTGATTC AACGCTGATT TCTGACTCTA TCTCTCTGAC AGGTCTTTCT GCGCTGAACA TCAACGAAGA TGGTCATGTT GCAACTGATT CACTGACTGT CGACAACAGT ACCGTAACCA TTTCTGATGA AGTTTCTGCT GGTTGGGCTG TAGGTGATGC TGCTCTGTAT GCCAACAACA TCAAAGTGAC TAACGACGGT ATTCTGGATG TAGGTAACAC TGCGGCGAAT GCTCTGCAGG TTGATACTCT GAACCTGACC AGCACTACTG ATACCAGTGG TAACATTCAC GCTGGTGTGT TCAACATCGA AAGCAACCGC TTCGTACTTG ATGCAGACCT GACCAACGAC CGTACCAACG ATACTACCAA GTCAAACTAC GGTTATGGCT TAATCGCAAT GAACTCTGAT GGTCACCTGA CCATTAACGG TAACGGCGAT AACGACAACA CTGCTTCTAT CGAAGCTGGT CAGAACGAAG TTGATAACAA CGGTGACCAT GTTGCAGCTG CAACCGGTAA CTACAAAGTT CGTATCGACA ACGCTACTGG TGCTGGTTCT ATCGCTGACT ACAACGGCAA CGAGCTGATC TACGTCAACG ACAAAAATAG CAACGCGACC TTCTCTGCTG CTAACAAAGC TGACCTGGGT GCATACACCT ATCAGGCTGA ACAGCGCGGT AACACCGTTG TTCTGCAACA GATGGAGCTG ACCGACTACG CTAACATGGC GCTGAGCATC CCTTCTGCGA ACACCAATAT CTGGAACCTG GAACAAGACA CCGTTGGTAC TCGTCTGACC AACTCTCGTC ATGGCCTGGC TGATAACGGC GGCGCATGGG TAAGCTACTT CGGTGGTAAC TTCAACGGCG ACAACGGCAC CATCAACTAT GATCAGGATG TTAACGGCAT CATGGTCGGT GTTGATACCA AAATTGACGG TAACAACGCT AAGTGGATCG TCGGTGCGGC TGCAGGCTTC GCTAAAGGTG ACATGAATGA CCGTTCTGGT CAGGTAGATC AAGACAGCCA GACTGCCTAC ATCTACTCTT CTGCTCACTT CGCGAACAAC GTCTTTGTTG ATGGTAGCTT GAGCTACTCT CACTTCAACA ACGACCTGTC TGCAACCATG AGCAACGGTA CTTACGTTGA CGGTAGCACC AACTCCGACG CTTGGGGCTT CGGCTTGAAA GCCGGTTACG ACTTCAAACT GGGTGATGCT GGTTATGTGA CTCCTTACGG CAGCATTTCT GGTCTGTTCC AGTCTGGTGA TGACTACCAG CTGAGCAACG ACATGAAAGT TGACGGTCAG TCTTACGACA GCATGCGTTA TGAACTGGGT GTAGATGCAG GTTATACCTT CACCTACAGC GAAGACCAGG CTCTGACTCC GTACTTCAAA CTGGCTTACG TCTACGACGA CTCTAACAAC GATAACGATG TGAACGGTGA TTCCATCGAT AACGGTACTG AAGGGTCTGC GGTACGTGTT GGTCTGGGTA CTCAGTTCAG CTTCACCAAG AACTTCAGCG CCTATACCGA TGCTAACTAC CTCGGTGGTG GTGACGTAGA TCAAGATTGG TCCGCGAACG TGGGTGTTAA ATATACCTGG TAA
|
Protein sequence | MHSWKKKLVV SQLALACTLA ITSQANAATN DISGQTYNTF HHYNDATYAD DVYYDGYVGW NNYAADSYYN GDIYPVINNA TVNGVISTYY LDDGISTNTN ANSLTIKNST IHGMIYSECM TTDCADRADD YYHDRLALTV DNSTIDDNYE HYTYNGTYNN AADTHVVDVY NIGTAITLDQ EVDLSITNNS HVAGITLTQG YEWEDIDDNT VSTGVNSSEV FNNTITVKDS TVTSGSWTDE GTTGWFGNTG NASDYSGKSN FVTVDTDGDG VADSTIASWD DVALAVVAHP NADNAMQTTA DFSNSTLMGD VIFSSNFDEN FFPRGADSYR DADGEVDTNG WDGTDRLDLT LNNGSKWVGA AQSVHQTGSI DVDGDGKGDI ATYGVGTEAT ATLIDIEDNS LWPLSTVGVE NDDTSYSEFD HITGNQVYQS GLFNVTLNTG SQWDTTKTSL IDTLSINSGS TVNVADSTLI SDSISLTGLS ALNINEDGHV ATDSLTVDNS TVTISDEVSA GWAVGDAALY ANNIKVTNDG ILDVGNTAAN ALQVDTLNLT STTDTSGNIH AGVFNIESNR FVLDADLTND RTNDTTKSNY GYGLIAMNSD GHLTINGNGD NDNTASIEAG QNEVDNNGDH VAAATGNYKV RIDNATGAGS IADYNGNELI YVNDKNSNAT FSAANKADLG AYTYQAEQRG NTVVLQQMEL TDYANMALSI PSANTNIWNL EQDTVGTRLT NSRHGLADNG GAWVSYFGGN FNGDNGTINY DQDVNGIMVG VDTKIDGNNA KWIVGAAAGF AKGDMNDRSG QVDQDSQTAY IYSSAHFANN VFVDGSLSYS HFNNDLSATM SNGTYVDGST NSDAWGFGLK AGYDFKLGDA GYVTPYGSIS GLFQSGDDYQ LSNDMKVDGQ SYDSMRYELG VDAGYTFTYS EDQALTPYFK LAYVYDDSNN DNDVNGDSID NGTEGSAVRV GLGTQFSFTK NFSAYTDANY LGGGDVDQDW SANVGVKYTW
|
| |