Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1488 |
Symbol | |
ID | 5591261 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 1488585 |
End bp | 1494440 |
Gene Length | 5856 bp |
Protein Length | 1951 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640920645 |
Product | autotransporter (AT) family porin |
Protein accession | YP_001458201 |
Protein GI | 157160883 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01414] outer membrane autotransporter barrel domain |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAAGGA AAACTCTATT GTCGGCCTGT ATTGCATTAG CTCTGAGTGG TCAGGGTTGG GCGGCAGATA TCACAGAGGT AGAAACCACC ACAGGTGAAA AGAAAAATAC CAATGTGACT TGTCCGGCAG ACCCAGGAAA ACTCAGTCCG GAAGAGCTTA AACGCTTACC CTCTGAATGC TCTCCTTTAG TCGAACAAAA CCTGATGCCA TGGCTTGCCA CAGGCGCTGC TGCGTTAATC ACGGCCTTAG CCGTAGTGGA ACTAAACGAC GATGATGATC ATCATCATCG CAACAATTCT CCACTCCCAC CGACACCCCC TGATGATGAA TCAGACGACA CTCCAGTTCC CCCAAGTCCT GGCGGAGATG AGATAATACC GGACGATCCG GATGATACGC CTACACCTCC CAAACCGATT TCGTTTAATA ATGACGTTAT TCTCGATAAA GCAGAAAAAA CGTTAACTAT TCGCGATTCA GTTTTTACTT ATACCGAGAA TGCTGACGGG ACTATTTCTC TGCAAGATAG CAATGGTCGT AAGGCAACGA TTAATCTTTG GCAGATTGAT GAAGCGAATA ACACTGTTGC CCTTGAAGGG GTGAGCGCAG ATGGCGCAAC GAAGTGGCAA TATAATCATA AAGGTGAGCT TGTTATTACG GGTGATAATG CCACAGTAAA CAACAATGGC AAAACCACCG TTGACGGCAA GGATTCCACC GGTACGGAAA TCAACGGTAA TAACGGGAAA GTGATTCAGG ACGGCGATCT GGATGTCAGC GGCGGCGGTC ACGGTATTGA TATCACCGGT GACAGCGCGA CGGTAGATAA CAAGGGCACC ATGACCGTTA CCGATCCGGA GTCCATCGGT ATCCAGATTG ACGGCGACAA GGCGGTTGTT AATAACGAAG GCGAGAGCAC CATCACCAAC GGCGGCACCG GCACGCAGAT TAACGGTGAC GACGCCACGG CGAATAACAG CGGCAAAACC ACCGTTGACG GCAAGGATTC CACTGGCACG GAAATCAACG GTAATAACGG GAAAGTTATC CAGGACGGCG ATCTGGATGT CAGCGGCGGC GGTCACGGTA TTGATATCAC CGGTGACAGC GCGACGGTAG ATAACAAGGG CACCATGACC GTTACCGATC CGGAGTCCAT CGGTATCCAG ATTGACGGCG ACAAGGCGGT TGTTAATAAC GAAGGCGAGA GCACCATCAC CAACGGTGGC ACCGGCACGC AGATTAACGG TGATGACGCC ACGGCAAACA ACAACGGCAA AACCACCGTT GACGGCAAGG ATTCCACCGG TACGGAAATT GCTGGCAATA ACGGGAAGGT GATTCAGGAC GGCGATCTGG ATGTCAGCGG CGGCGGTCAC GGTATTGATA TCACCGGCGA CAGCGCAACG GTGGATAACA AGGGCACCAT GACCGTCACC GATCCGGAGT CCATCGGTAT CCAGGTTGAC GGCGACCAGG CCATCGTCAA TAACGAAGGC GAGAGCACTA TCACCAATGG CGGCACCGGC ACTCAGATCA ACGGTAACGA CGCCACCGCG AATAACAGTG GAAAAACCAC TGTTGATGGA AAAGATTCCA CGGGTACCAA AATCGCGGGC AATATCGGCA TTGTAAATCT GGATGGTAGC CTGACTGTTA CAGGCGGTGC GCATGGTGTT GAGAACATTG GTGACAACGG CACGGTTAAC AACAAAGGAG ATATTGTTGT TTCCGATACT GGATCGATTG GCGTGCTCAT CAACGGTGAG GGGGCAACAG TATCCAATAC GGGTGATGTT AACGTTAGCA ATGAAGCGAC AGGGTTCAGC ATCACAACCA ACAGTGGGAA GGTTTCGCTG GCAGGCAGTA TGCAGGTTGG CGATTTCTCG ACCGGGGTAG ATCTTAATGG CAACAATAAC AGCGTGACGC TGGCGGCAAA AGATCTAAAA GTGGTCGGGC AGAAAGCGAC GGGCATAAAC GTTTCTGGCG ATGCGAATAC AGTGAATATC ACTGGTAACG TTCTGGTTGA TAAGGATAAA ACCGCAGACA ATGCGGCGGA ATATTTCTTC GATCCATCCG TGGGTATCAA CGTTTACGGC AGTGATAATA ACGTGACGCT GGATGGAAAG TTAACTGTTG TATCAGACAG TGAGGTTACT TCTCGTCAGA GTAATTTATT TGATGGCAGC GCAGAGAAAA CGTCAGGTCT GGTTGTGATT GGCGATGGCA ATACCGTTAA TATGAATGGT GGACTTGAAC TGATTGGAGA GAAAAACGCG CTTGCAGATG GGTCGCAGGT TGCTTCCTTG CGCACAGGAT ATAGTTATAC CAGCGTTATT GTCGTTAGTG GTGAGTCGTC GGTATATCTG AATGGAGATA CGACAATCAG CGGAGAATTC CCTCTGGGGT TTGCCGGGGT TATTCGGGTA CAGGATAAAG CTTTGCTGGA AATTGGCAGT GGCGCTACGC TAACAATGCA GGATATTGAC AGTTTTGAAC ATCATGGGAC AAGAACCCCA GAACTTACTT ATGCTGATTC CGGTGCGAAA ATTGTTAATA AAGGTACTGT TGAAATTCAG AATTTAGGTT TTGCTTTTGT TACTGGTGAA AATACAACAG GTATAAATAG TGGCACGATC TCGTTATTAC AAAATGGTAA AGATCCGGCA CCGTCTCCCA TTGTTTTACT GGCTACTAAC GGAGGGAGCG CCACTAATGC AGGTACGATC ACAGGTAAAG TGACGGAACG ACATAGCGTA TTTAACAAGT ATTCAACGGG CACATCGAAT TCATTTATTT TTAATAACGA TGTCAGTAGC ATAACAGGGT TAGTCGCTCA ATCGAATAGC ACAATTATCA ATACTGACAG CGGCATCATT GATTTGTATG GTCGTGGTAG TGTCGGCATG CTTGCTATAG CAGATTCAAC AGCAGAAAAT CAGGGTAAAA TTACACTGGA TTCTATGTGG GTAGATGCAA ATGACACTAC CGCAATGCGA GATATAGCTA GCAACAGCGC CATTGACTTC GGTACAGGTG TGGGAGTTGG TACTGATAGT TATAGTGGTG CAGGGAAAAA TGCAACAGCA ATTAACCAAT TGGGCGGTGT TATAACTATT TATAACGCCG GCGCAGGTAT GGCGGCCTAT GGCGCCAGCA ATACAGTTAT TAACCAGGGG ACGATTAACC TCGAAAAAAA TGGTAATTAT GACGATAGTC TGGCAGCAAA TACTCTGGTA GGGATGGCTG TTTATGAGCA TGGTACTGCT ATCAACGACC AGACGGGTGT TATCAATATC AATGTTGGTA CTAGTCAGGC GTTTTATAAC GATGGCACAG GAACAATTGT TAACTATGGT ACAATCTGCA CTTTCGGCGT GTGCCAATCG GGGAATGAGT ACAATAACAC AGATGATTTC ACCTCACTGA TCTATACCGG TGGCGATACG ATTACACGAA GCGGAGAAAC TGTAACGCTA AATAATGCCG GAGAAATGAC TGCGCAAATT ACCATGAATG CTGGTGCTGA TAGTTCGTTA GTGAACAACA CCGGAACTAT CAATAAAATC GTGCAGAACG CGGGGGTATT CAATAATAGT GGCAGTGTAA CAGGGCGGAT GATGTCGGCT GGCGGGGTCT TTAATAATCA AACTGACGGG GCGATTATGA GAGGTGCTGC GCTGACAGGT ACTGCAGTGG CAAATAACGA AGGAACCTGG AACCTCGGAA GTAGCAGTGA GGGTAACAAC ACCGGGATGC TGGAAGTTAA TAATAATTCT GCTTTCAATA ACCGCGGCGA GTTTATTCTT GATAACGACA AGAATGCTGT GCACATCAAC CAGTCCGGTA CGCTTTATAA TACCGGTCAC ATGAACATCA GTAATTCTTC CCACAACGGA GCCGTTAATA TGTGGGGCGG AAATGGTCGT TTTATCAATG ACGGAACGAT TGATGTTTCT GCGAAGTCAC TGGTAGTCAG CGCTAATAAT GCCGGCGATC AGAATGCCTT CTTCTGGAAC CAGGATAACG GGGTCATCAA CTTCGATCAC GACAGCGCCA GTGCCGTGAA AGCCACCCAC AGCAACTTTA TTGCCCAGAA TGACGGCATC ATGAACATCA GCGGCACCGG TGCTGTGGCT ATGGAAGGTG ATAAGAACGC GCAGCTGGTT AACAATGGCA CCATCAACCT CGGTACCGCA GGCACTACTG ACACGGATAT GATCGGTATG CAACTCAATG CCAACGCCAC GGCGGATGCG GTGATCGAGA ACAACGGCAC CATCAATATC TTCGCTAATA ACTCGTTTGC ATTTAGCGTA CTGGGTACAG TAGGTCATGT GGTTAACAAC GGCACGGTGG TGATTGCCGA TGGGGTTACG GGTTCTGGCC TGATCAAGCA GGGCGACAGC ATCAATGTTG AAGGTATGAA CGGTAACAAC GGTAATAGCA GCGAAGTGCA TTATGGCGAC TATACGTTGC CGGATGTGCC GAAGCCCAAT ACGGTTAGTG TAACGTCGGG AAGTGATGAG GCTGGTGGCA GCATGAACAA CCTCAACGGC TATGTCGTCG GTACCAACGT TAACGGCAGC GCCGGGAAGC TGAAGGTTAA CAATGCCAGC ATGAACGGCG TGGAGATTAA CACGGGCTTT ACCGCTGGTA CGGCAGACAC CATTGTGAGT TTTGATAACG TAGTGGAAGG TAGCAACCTG ACCGACGCTG ACGCCATCAC CTCAACGTCC GTGGTATGGA CTGCCAAAGG CAGCACCGAT GCCAGCGGTA ACGTTGACGT CACCATGAGC AAAAACGCTT ATACCGATGT GGCAACAGAT GCCTCGGTGA ATGACATCGC GAAAGCACTG GATGCGGGTT ACACCAACAA CGAACTGTTT ACCAGCCTGA ACGTCGGCAC GACTGCTGAA CTGAACAGTG CTCTGAAACA GGTCAGCGGT AGCCAGGCGA CCACGGTATT CCGCGAAGCG CGCGTGTTAA GCAACCGCTT TAGTATGCTG GCAGATGCCG CGCCGAAAGT GGGTAACGGT CTGGCGTTCA ACGTTGTCGC GAAAGGCGAT CCGCGTGCCG AGTTAGGTAA TAATACCGAA TACGACATGC TGGCATTGCG TAAAACTATC GACCTGAGCG AAAGCCAGAC GATGAGTCTG GAGTACGGTA TCGCTCGTCT CGATGGTGAT GGTGCGCAGA AAGCGGGTGA TAATGGCGTT ACAGGCGGTT ATAGCCAGTT CTTTGGCCTG AAACATCAGA TGTCGTTCGA TAACGGCATG AACTGGAATA ACGCCTTGCG TTACGACGTT CACAACCTTG ACAGCAGCCG CTCGATTGCA TTTGGCAACA CGAACAAAAC GGCTGATACC GACGTGAAAC AGCAGTACCT GGAGTTCCGC AGCGAAGGGG CGAAGACTTT CGAACCGAGC GAAGGACTGA AGGTTACGCC ATATGCGGGT GTAAAACTGC GTCACACACT GGAAGGTGGC TATCAGGAGC GCAATGCCGG AGACTTTAAC CTGAATATGA ACAGTGGCAG CGAAACGGCG GTGGACAGCA TCGTCGGGCT GAAACTGGAC TACGCAGGTA AAGACGGCTG GAGCGCTAGC GCTACGCTGG AAGGCGGGCC GAACCTGAGC TACGCGAAGA GCCAGCGTAC GGCAAGCCTG GCAGGCGCAG GTAGTCAGCA CTTTAACGTC GATGACGGTC AGAAGGGCGG CGGCATCAAT AGCCTGACAA GCGTCGGCGT GAAGTACAGC AGCAAAGAAA GTTCGCTGAA TCTGGATGCG TACAACTGGA AAGAGGATGG CATCAGCGAT AAAGGCGTGA TGCTGAACTT CAAGAAAACG TTCTAA
|
Protein sequence | MQRKTLLSAC IALALSGQGW AADITEVETT TGEKKNTNVT CPADPGKLSP EELKRLPSEC SPLVEQNLMP WLATGAAALI TALAVVELND DDDHHHRNNS PLPPTPPDDE SDDTPVPPSP GGDEIIPDDP DDTPTPPKPI SFNNDVILDK AEKTLTIRDS VFTYTENADG TISLQDSNGR KATINLWQID EANNTVALEG VSADGATKWQ YNHKGELVIT GDNATVNNNG KTTVDGKDST GTEINGNNGK VIQDGDLDVS GGGHGIDITG DSATVDNKGT MTVTDPESIG IQIDGDKAVV NNEGESTITN GGTGTQINGD DATANNSGKT TVDGKDSTGT EINGNNGKVI QDGDLDVSGG GHGIDITGDS ATVDNKGTMT VTDPESIGIQ IDGDKAVVNN EGESTITNGG TGTQINGDDA TANNNGKTTV DGKDSTGTEI AGNNGKVIQD GDLDVSGGGH GIDITGDSAT VDNKGTMTVT DPESIGIQVD GDQAIVNNEG ESTITNGGTG TQINGNDATA NNSGKTTVDG KDSTGTKIAG NIGIVNLDGS LTVTGGAHGV ENIGDNGTVN NKGDIVVSDT GSIGVLINGE GATVSNTGDV NVSNEATGFS ITTNSGKVSL AGSMQVGDFS TGVDLNGNNN SVTLAAKDLK VVGQKATGIN VSGDANTVNI TGNVLVDKDK TADNAAEYFF DPSVGINVYG SDNNVTLDGK LTVVSDSEVT SRQSNLFDGS AEKTSGLVVI GDGNTVNMNG GLELIGEKNA LADGSQVASL RTGYSYTSVI VVSGESSVYL NGDTTISGEF PLGFAGVIRV QDKALLEIGS GATLTMQDID SFEHHGTRTP ELTYADSGAK IVNKGTVEIQ NLGFAFVTGE NTTGINSGTI SLLQNGKDPA PSPIVLLATN GGSATNAGTI TGKVTERHSV FNKYSTGTSN SFIFNNDVSS ITGLVAQSNS TIINTDSGII DLYGRGSVGM LAIADSTAEN QGKITLDSMW VDANDTTAMR DIASNSAIDF GTGVGVGTDS YSGAGKNATA INQLGGVITI YNAGAGMAAY GASNTVINQG TINLEKNGNY DDSLAANTLV GMAVYEHGTA INDQTGVINI NVGTSQAFYN DGTGTIVNYG TICTFGVCQS GNEYNNTDDF TSLIYTGGDT ITRSGETVTL NNAGEMTAQI TMNAGADSSL VNNTGTINKI VQNAGVFNNS GSVTGRMMSA GGVFNNQTDG AIMRGAALTG TAVANNEGTW NLGSSSEGNN TGMLEVNNNS AFNNRGEFIL DNDKNAVHIN QSGTLYNTGH MNISNSSHNG AVNMWGGNGR FINDGTIDVS AKSLVVSANN AGDQNAFFWN QDNGVINFDH DSASAVKATH SNFIAQNDGI MNISGTGAVA MEGDKNAQLV NNGTINLGTA GTTDTDMIGM QLNANATADA VIENNGTINI FANNSFAFSV LGTVGHVVNN GTVVIADGVT GSGLIKQGDS INVEGMNGNN GNSSEVHYGD YTLPDVPKPN TVSVTSGSDE AGGSMNNLNG YVVGTNVNGS AGKLKVNNAS MNGVEINTGF TAGTADTIVS FDNVVEGSNL TDADAITSTS VVWTAKGSTD ASGNVDVTMS KNAYTDVATD ASVNDIAKAL DAGYTNNELF TSLNVGTTAE LNSALKQVSG SQATTVFREA RVLSNRFSML ADAAPKVGNG LAFNVVAKGD PRAELGNNTE YDMLALRKTI DLSESQTMSL EYGIARLDGD GAQKAGDNGV TGGYSQFFGL KHQMSFDNGM NWNNALRYDV HNLDSSRSIA FGNTNKTADT DVKQQYLEFR SEGAKTFEPS EGLKVTPYAG VKLRHTLEGG YQERNAGDFN LNMNSGSETA VDSIVGLKLD YAGKDGWSAS ATLEGGPNLS YAKSQRTASL AGAGSQHFNV DDGQKGGGIN SLTSVGVKYS SKESSLNLDA YNWKEDGISD KGVMLNFKKT F
|
| |