Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_B0104 |
Symbol | |
ID | 6966457 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011350 |
Strand | + |
Start bp | 64310 |
End bp | 68212 |
Gene Length | 3903 bp |
Protein Length | 1300 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 643384001 |
Product | immunoglobulin A1 protease domain protein |
Protein accession | YP_002268480 |
Protein GI | 209395572 |
COG category | [S] Function unknown |
COG ID | [COG4625] Uncharacterized protein with a C-terminal OMP (outer membrane protein) domain |
TIGRFAM ID | [TIGR01414] outer membrane autotransporter barrel domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.270044 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAAAA TATACTCTCT TAAATACAGC CATATTACAG GAGGGTTAAT CGCTGTTTCT GAATTATCCG GCAGAGTATC ATCAAGAGCA ACTGGTAAGA AAAAACACAA ACGCATACTT GCATTATGTT TTTTAGGCTT ATTACAATCC TCATATTCTT TTGCGTCACA GATGGATATT TCAAATTTCT ACATCCGTGA CTATATGGAT TTTGCACAGA ACAAGGGCAT ATTTCAGGCT GGCGCAACAA ATATTGAAAT AGTGAAGAAA GATGGCTCCA CCCTGAAACT ACCGGAAGTA CCATTTCCTG ACTTCTCACC GGTTGCAAAC AAAGGGTCAA CCACATCTAT TGGTGGTGCA TACAGTATCA CAGCCACACA CAATACGAAA AACCACCACT CAGTTGCGAC GCAAAACTGG GGGAACAGCA CGTACAAACA AACTGACTGG AATACTTCAC ATCCTGATTT TGCAGTATCC CGACTTGACA AGTTTGTTGT TGAGACCCGA GGTGCGACTG AAGGCGCAGA TATTTCGTTA TCAAAACAGC AGGCACTTGA ACGTTACGGG GTTAATTATA AAGGAGAAAA GAAACTTATC GCATTCAGAG CCGGCTCTGG TGTGGTATCC GTTAAAAAAA ATGGACGCAT AACTCCATTT AATGAGGTTT CTTATAAGCC AGAAATGTTA AATGGCTCTT TCGTTCACAT TGATGACTGG AGTGGATGGC TGATATTAAC CAACAACCAG TTTGATGAGT TTAATAACAT TGCCTCTCAG GGTGACAGCG GTTCAGCACT GTTCGTCTAT GATAACCAAA AGAAAAAGTG GGTTGTCGCT GGAACTGTCT GGGGGATTTA TAATTACGCC AATGGCAAAA ACCACGCAGC ATACAGTAAA TGGAACCAGA CAACCATTGA CAACCTGAAG AACAAGTATT CTTACAACGT GGATATGTCA GGGGCTCAGG TTGCAACCAT TGAAAATGGA AAACTGACAG GCACTGGCTC AGACACCACC GATATAAAAA ATAAGGACTT AATATTTACT GGCGGTGGAG ATATCCTCCT GAAATCCTCT TTTGATAATG GTGCTGGCGG TCTTGTCTTT AATGATAAAA AGACCTATCG AGTAAACGGG GATGATTTCA CCTTTAAAGG TGCCGGTGTT GATACAAGAA ACGGCAGCAC CGTTGAGTGG AATATCCGGT ATGATAATAA AGACAACCTT CACAAAATTG GTGATGGCAC ATTAGATGTC CGAAAAACCC AGAACACCAA CCTGAAAACA GGTGAGGGTC TTGTCATTCT TGGAGCTGAA AAAACATTCA ATAATATCTA CATAACCAGT GGTGATGGAA CTGTCCGACT GAATGCAGAA AATGCACTGT CTGGCGGTGA ATACAACGGT ATTTTCTTTG CGAAAAATGG CGGAACTCTT GACCTGAACG GATATAATCA GTCTTTCAAT AAAATTGCTG CAACTGATTC AGGTGCTGTA ATAACCAATA CGTCAACCAA AAAATCCATT TTATCCCTGA ATAATACTGC TGACTATATC TATCACGGCA ACATAAACGG GAATCTGGAC GTACTTCAGC ATCATGAGAC GAAAAAAGAG AACCGTCGTC TTATTCTTGA TGGGGGCGTG GACACAACAA ATGATATAAG CCTGCGTAAT ACACAACTGT CCATGCAGGG ACATGCCACT GAACATGCCA TTTATCGGGA TGGAGCTTTC TCTTGTTCAC TACCAGCTCC TATGCGCTTT TTGTGTGGCA GTGATTATGT TGCAGGAATG CAAAATACAG AAGCTGATGC TGTAAAACAA AACGGAAATG CCTATAAAAC CAACAATGCT GTCTCTGATT TATCGCAGCC AGACTGGGAA ACCGGAACAT TCAGATTTGG AACGCTACAT CTTGAAAATT CCGATTTTTC TGTTGGTCGT AATGCAAATG TAATCGGGGA CATTCAGGCC AGTAAATCAA ACATTACTAT TGGTGACACT ACAGCATATA TTGATTTGCA TGCTGGTAAA AATATTACCG GTGATGGTTT TGGCTTCCGC CAGAATATTG TGCGTGGAAA CTCACAAGGA GAAACGCTGT TTACAGGAGG GATCACAGCA GAAGACAGCA CTATCGTTAT TAAAGATAAA GCAAAAGCAT TATTTTCAAA TTATGTATAC CTGCTGAACA CAAAAGCAAC CATAGAGAAC GGTGCTGATG TGACAACTCA AAGTGGTATG TTCTCCACGA GCGATATCAG CATCTCTGGT AATCTGTCCA TGACAGGCAA TCCCGACAAA GACAATAAAT TCGAGCCCTC AATATATCTG AATGATGCTT CTTATCTACT GACTGACGAC TCCGCCAGAC TCGTTGCCAA AAATAAAGCA TCTGTGGTGG GAGATATACA CTCCACTAAA AGTGCATCCA TCATGTTTGG TCATGATGAA AGCGACCTCT CGCAGTTGTC TGACAGAACC TCAAAAGGGC TTGCACTTGG TCTTTTAGGT GGCTTTGATG TCTCATATCG CGGTTCAGTC AATGCCCCGT CAGCATCTGC CACTATGAAC AACACCTGGT GGCAACTAAC CGGAGATTCT GCGCTGAAAA CACTGAAAAG TACAAACAGC ATGGTCTATT TCACTGACAG CGCAAACAAT AAGAAATTCC ATACGCTGAC GGTCGATGAG CTGGCAACCA GCAACAGCGC CTATGCGATG CGTACAAACC TTTCTGAATC AGACAAACTG GAGGTCAAAA AACACTTGTC TGGTGAGAAC AATATTTTAC TCGTTGATTT CCTTCAGAAA CCAACGCCTG AAAAACAACT GAATATTGAA CTGGTAAGCG CGCCAAAAGA CACCAATGAA AATGTCTTTA AAGCCAGTAA ACAAACCATT GGTTTCAGTG ATGTAACGCC GGTCATTACA ACCAGGGAAA CCGATGACAA AATAACATGG TCACTGACAG GCTATAACAC GGTAGCAAAC AAGGAAGCAA CCCGGAATGC CGCCGCCCTG TTCTCTGTTG ACTATAAAGC GTTTCTGAAC GAGGTCAACA ACCTGAACAA ACGTATGGGT GACCTGCGTG ATATCAACGG CGAAGCCGGT GCATGGGCAC GCATCATGAG CGGTACCGGC TCTGCCAGTG GTGGTTTCAG TGACAACTAC ACGCACGTTC AGGTCGGGGT CGACAAAAAA CACGAGCTGG ACGGACTGGA TTTGTTTACC GGTTTCACTG TCACACACAC TGACAGCAGT GCCTCCGCCG ATGTTTTCAG TGGTAAAACG AAGTCTGTGG GGGCTGGCCT GTATGCTTCC GCCATGTTTG ATTCCGGTGC CTATATCGAC CTGATTGGCA AGTATGTTCA CCATGATAAT GAGTACACTG CAACCTTTGC CGGACTCGGA ACCCGTGATT ACAGCACGCA TTCATGGTAT GCCGGTGCAG AAGCGGGCTA CCGCTATCAT GTCACTGAGG ATGCCTGGAT TGAGCCACAG GCTGAGCTGG TTTACGGTTC TGTATCCGGT AAACAGTTTG CATGGAAGGA CCAGGGAATG CATCTGTCCA TGAAGGACAA GGACTACAAT CCGCTGATTG GCCGAACGGG TGTGGATGTG GGTAAATCCT TCTCTGGTAA GGACTGGAAA GTGACAGCCC GTGCCGGTCT GGGCTACCAG TTCGACCTGC TGGCTAACGG CGAAACCGTA TTGCGGGATG CATCTGGTGA AAAACGCATC AAAGGTGAAA AGGACAGCCG TATGCTGATG TCCGTTGGCC TGAATGCAGA AATCAGGGAT AACGTCCGCT TTGGACTGGA GTTTGAGAAA TCCGCCTTTG GTAAGTACAA CGTTGATAAT GCTGTCAACG CTAATTTCCG TTACTCGTTC TGA
|
Protein sequence | MNKIYSLKYS HITGGLIAVS ELSGRVSSRA TGKKKHKRIL ALCFLGLLQS SYSFASQMDI SNFYIRDYMD FAQNKGIFQA GATNIEIVKK DGSTLKLPEV PFPDFSPVAN KGSTTSIGGA YSITATHNTK NHHSVATQNW GNSTYKQTDW NTSHPDFAVS RLDKFVVETR GATEGADISL SKQQALERYG VNYKGEKKLI AFRAGSGVVS VKKNGRITPF NEVSYKPEML NGSFVHIDDW SGWLILTNNQ FDEFNNIASQ GDSGSALFVY DNQKKKWVVA GTVWGIYNYA NGKNHAAYSK WNQTTIDNLK NKYSYNVDMS GAQVATIENG KLTGTGSDTT DIKNKDLIFT GGGDILLKSS FDNGAGGLVF NDKKTYRVNG DDFTFKGAGV DTRNGSTVEW NIRYDNKDNL HKIGDGTLDV RKTQNTNLKT GEGLVILGAE KTFNNIYITS GDGTVRLNAE NALSGGEYNG IFFAKNGGTL DLNGYNQSFN KIAATDSGAV ITNTSTKKSI LSLNNTADYI YHGNINGNLD VLQHHETKKE NRRLILDGGV DTTNDISLRN TQLSMQGHAT EHAIYRDGAF SCSLPAPMRF LCGSDYVAGM QNTEADAVKQ NGNAYKTNNA VSDLSQPDWE TGTFRFGTLH LENSDFSVGR NANVIGDIQA SKSNITIGDT TAYIDLHAGK NITGDGFGFR QNIVRGNSQG ETLFTGGITA EDSTIVIKDK AKALFSNYVY LLNTKATIEN GADVTTQSGM FSTSDISISG NLSMTGNPDK DNKFEPSIYL NDASYLLTDD SARLVAKNKA SVVGDIHSTK SASIMFGHDE SDLSQLSDRT SKGLALGLLG GFDVSYRGSV NAPSASATMN NTWWQLTGDS ALKTLKSTNS MVYFTDSANN KKFHTLTVDE LATSNSAYAM RTNLSESDKL EVKKHLSGEN NILLVDFLQK PTPEKQLNIE LVSAPKDTNE NVFKASKQTI GFSDVTPVIT TRETDDKITW SLTGYNTVAN KEATRNAAAL FSVDYKAFLN EVNNLNKRMG DLRDINGEAG AWARIMSGTG SASGGFSDNY THVQVGVDKK HELDGLDLFT GFTVTHTDSS ASADVFSGKT KSVGAGLYAS AMFDSGAYID LIGKYVHHDN EYTATFAGLG TRDYSTHSWY AGAEAGYRYH VTEDAWIEPQ AELVYGSVSG KQFAWKDQGM HLSMKDKDYN PLIGRTGVDV GKSFSGKDWK VTARAGLGYQ FDLLANGETV LRDASGEKRI KGEKDSRMLM SVGLNAEIRD NVRFGLEFEK SAFGKYNVDN AVNANFRYSF
|
| |