Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1395 |
Symbol | |
ID | 6967004 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1387736 |
End bp | 1390585 |
Gene Length | 2850 bp |
Protein Length | 949 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643385369 |
Product | pertactin family protein |
Protein accession | YP_002269864 |
Protein GI | 209399047 |
COG category | [M] Cell wall/membrane/envelope biogenesis [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3468] Type V secretory pathway, adhesin AidA |
TIGRFAM ID | [TIGR01414] outer membrane autotransporter barrel domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.830364 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.238303 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACGAC ATCTGAACAC CAGCTACAGG CTGGTATGGA ATCACATTAC GGGCACCCTG GTGGTGGCCT CCGAACTGGC CCGCTCACGG GGAAAACGCG CCGGTGTGGC GGTTGCGCTG TCTCTTGCTG CTGTCACATC AGTCCCGGCA CTGGCTGCTG ACAAGGTTGT ACAGGCGGGA GAAACCGTGA ACGATGGAAC ACTGACAAAT CATGACAACC AGATTGTCTT CGGTACGGCC AACGGAATGA CCATCAGTAC CGGGCTGGAA CTGGGGCCGG ACAGTGAAGA AAACACCGGT GGGCAATGGA TACAGAATGG CGGGATAGCC GGAAACACCA CTGTCACCAC AAATGGTCGT CAGGTCGTGC TGGAGGGGGG AACAGCCAGT GATACGGTTA TTCGTGACGG CGGGGGACAG AGCCTGAACG GACTGGCGGT GAACACCACA CTGAATAACA GAGGCGAGCA GTGGGTGCAT GAGGGCGGGG TTGCCACCGG TACAATTATC AACCGCGACG GTTACCAGAG CGTTAAAAGT GGCGGGCTGG CAACAGGAAC CATCATCAAC ACCGGCGCAG AAGGCGGCCC TGATTCTGAC AACTCGTATA CGGGTCAGAA GGTCCAGGGA ACAGCAGAAT CCACCACCAT CAACAAAAAT GGACGGCAGA TTATCTTATT TTCCGGGCTA GCCCGTGACA CTCTCATTTA CGCAGGTGGT GACCAGTCGG TACACGGAAG GGCCCTGAAT ACCACACTGA ATGGCGGTTA CCAATATGTG CACAGGGACG GACTTGCGCT GAACACGGTA ATTAACGAGG GGGGCTGGCA GGTTGTTAAG GCAGGTGGCG CTGCCGGTAA CACCACCATA AATCAGAACG GTGAACTGAG GGTACATGCC GGCGGGGAAG CCACTGCAGT CACCCAGAAC ACGGGCGGTG CACTGGTTAC CAGTACTGCT GCAACTGTCA TCGGCACAAA CCGTCTGGGG AATTTCACGG TGGAAAACGG TAAGGCTGAC GGTGTTGTTC TGGAATCCGG CGGTCGTCTG GATGTACTGG AGAGCCATTC AGCACAGAAT ACCCTAGTGG ATGACGGCGG TACCCTGGCA GTGTCTGCCG GCGGTAAGGC GACAAGTGTC ACCATAACAT CCGGTGGTGC CCTGATTGCA GACAGTGGTG CCACTGTTGA GGGGACCAAT GCCAGCGGTA AGTTCAGTAT TGATGGCACA TCCGGTCAGG CCAGCGGCCT GCTGCTGGAA AATGGCGGCA GCTTTACGGT TAATGCCGGG GGACAGGCTG GCAACACCAC TGTCGGACAT CGTGGAACAC TGACGCTGGC TGCCGGGGGA AGTCTGAGTG GCAGAACACA GCTCAGTAAA GGCGCCAGTA TGGTACTGAA TGGTGATGTG GTCAGTACCG GCGATATTGT TAACGCAGGG GAGATTCGCT TTGATAATCA GACGACACCG AATGCCGCGC TGAGCCGTGC TGTTGCAAAA AGTAACTCCC CGGTAACGTT CCATAAACTG ACCACCACGA ACCTCACCGG CCAGGGCGGC ACCATCAATA TGCGTGTTCG CCTTGATGGC AGCAATGCCT CTGACCAGCT GGTGATTAAT GGTGGTCAGG CAACCGGCAA AACCTGGCTT GCGTTTACAA ATGTCGGAAA CAGCAACCTC GGGGTGGCAA CCACCGGACA GGGTATCCGG GTTGTGGATG CACAGAATGG CGCCACCACA GAAGAAGGTG CGTTTGCCCT GAGTCGCCCG CTTCAGGCCG GCGCCTTTAA CTACACCCTG AACCGTGACA GCGATGAAGA CTGGTACCTG CGCAGTGAAA ATGCTTATCG TGCTGAAGTC CCCCTGTATA CATCCATGTT GACACAGGCA ATGGACTATG ACCGGATTCT GGCAGGCTCC CGCAGCCATC AGACCGGTGT AAACGGTGAA AATAACAGCG TCCGTCTCAG CATTCAGGGC GGTCATCTCG GTCACGATAA CAACGGCGGT ATTGCCCGTG GAGCCACGCC GGAAAGCAGC GGCAGCTATG GCTTCGTCCG TCTGGAGGGT GACCTGCTCA GAACAGAGGT TGCCGGTATG TCTCTGACGA CAGGGGTGTA TGGTGCTGCA GGCCATTCTT CCGTTGATGT TAAGGATGAT GACGGTTCCC GCGCCGGCAC GGTCCGGGAT GATGCCGGCA GTCTGGGCGG ATACCTGAAT CTGGTACACA CATCCTCCGG CCTGTGGGCT GACATTGTGG CCCAGGGAAC CCGTCACAGC ATGAAAGCGT CATCGGACAA TAACGACTTC CGCGCCCGGG GCTGGGGCTG GCTGGGCTCA CTGGAAACCG GTCTGCCCTT CAGTATCACT GACAATCTGA TGCTGGAGCC ACAACTGCAG TACACCTGGC AGGGACTCTC CCTGGATGAC GGCCAGGATA ACGCCGGTTA TGTGAAGTTC GGGCATGGCA GTGCACAACA TGTGCGTGCC GGTTTCCGTC TGGGCAGCCA CAACGATATG ACCTTTGGTG AAGGCACCTC ATCCCGTGAC ACCCTGCGCG ACAGTGCAAA ACACAGTGTG AGTGAACTGC CGGTGAACTG GTGGGTACAG CCTTCTGTTA TCCGCACCTT CAGCTCCCGG GGTGACATGA GCATGGGGAC AGCCGCAGCC GGCAGTAACA TGACGTTCTC ACCGTCCCGG AATGGCACGT CACTGGACCT GCAGGCCGGA CTGGAAGCCC GTATCCGGGA AAATATCACC CTGGGCGTTC AGGCCGGTTA TGCCCACAGC GTCAGCGGCA GCAGCGCTGA AGGCTATAAC GGTCAGGCTA CGCTGAATAT GACTTTCTGA
|
Protein sequence | MKRHLNTSYR LVWNHITGTL VVASELARSR GKRAGVAVAL SLAAVTSVPA LAADKVVQAG ETVNDGTLTN HDNQIVFGTA NGMTISTGLE LGPDSEENTG GQWIQNGGIA GNTTVTTNGR QVVLEGGTAS DTVIRDGGGQ SLNGLAVNTT LNNRGEQWVH EGGVATGTII NRDGYQSVKS GGLATGTIIN TGAEGGPDSD NSYTGQKVQG TAESTTINKN GRQIILFSGL ARDTLIYAGG DQSVHGRALN TTLNGGYQYV HRDGLALNTV INEGGWQVVK AGGAAGNTTI NQNGELRVHA GGEATAVTQN TGGALVTSTA ATVIGTNRLG NFTVENGKAD GVVLESGGRL DVLESHSAQN TLVDDGGTLA VSAGGKATSV TITSGGALIA DSGATVEGTN ASGKFSIDGT SGQASGLLLE NGGSFTVNAG GQAGNTTVGH RGTLTLAAGG SLSGRTQLSK GASMVLNGDV VSTGDIVNAG EIRFDNQTTP NAALSRAVAK SNSPVTFHKL TTTNLTGQGG TINMRVRLDG SNASDQLVIN GGQATGKTWL AFTNVGNSNL GVATTGQGIR VVDAQNGATT EEGAFALSRP LQAGAFNYTL NRDSDEDWYL RSENAYRAEV PLYTSMLTQA MDYDRILAGS RSHQTGVNGE NNSVRLSIQG GHLGHDNNGG IARGATPESS GSYGFVRLEG DLLRTEVAGM SLTTGVYGAA GHSSVDVKDD DGSRAGTVRD DAGSLGGYLN LVHTSSGLWA DIVAQGTRHS MKASSDNNDF RARGWGWLGS LETGLPFSIT DNLMLEPQLQ YTWQGLSLDD GQDNAGYVKF GHGSAQHVRA GFRLGSHNDM TFGEGTSSRD TLRDSAKHSV SELPVNWWVQ PSVIRTFSSR GDMSMGTAAA GSNMTFSPSR NGTSLDLQAG LEARIRENIT LGVQAGYAHS VSGSSAEGYN GQATLNMTF
|
| |