Gene ECH74115_B0104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_B0104 
Symbol 
ID6966457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011350 
Strand
Start bp64310 
End bp68212 
Gene Length3903 bp 
Protein Length1300 aa 
Translation table11 
GC content44% 
IMG OID643384001 
Productimmunoglobulin A1 protease domain protein 
Protein accessionYP_002268480 
Protein GI209395572 
COG category[S] Function unknown 
COG ID[COG4625] Uncharacterized protein with a C-terminal OMP (outer membrane protein) domain 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.270044 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAAA TATACTCTCT TAAATACAGC CATATTACAG GAGGGTTAAT CGCTGTTTCT 
GAATTATCCG GCAGAGTATC ATCAAGAGCA ACTGGTAAGA AAAAACACAA ACGCATACTT
GCATTATGTT TTTTAGGCTT ATTACAATCC TCATATTCTT TTGCGTCACA GATGGATATT
TCAAATTTCT ACATCCGTGA CTATATGGAT TTTGCACAGA ACAAGGGCAT ATTTCAGGCT
GGCGCAACAA ATATTGAAAT AGTGAAGAAA GATGGCTCCA CCCTGAAACT ACCGGAAGTA
CCATTTCCTG ACTTCTCACC GGTTGCAAAC AAAGGGTCAA CCACATCTAT TGGTGGTGCA
TACAGTATCA CAGCCACACA CAATACGAAA AACCACCACT CAGTTGCGAC GCAAAACTGG
GGGAACAGCA CGTACAAACA AACTGACTGG AATACTTCAC ATCCTGATTT TGCAGTATCC
CGACTTGACA AGTTTGTTGT TGAGACCCGA GGTGCGACTG AAGGCGCAGA TATTTCGTTA
TCAAAACAGC AGGCACTTGA ACGTTACGGG GTTAATTATA AAGGAGAAAA GAAACTTATC
GCATTCAGAG CCGGCTCTGG TGTGGTATCC GTTAAAAAAA ATGGACGCAT AACTCCATTT
AATGAGGTTT CTTATAAGCC AGAAATGTTA AATGGCTCTT TCGTTCACAT TGATGACTGG
AGTGGATGGC TGATATTAAC CAACAACCAG TTTGATGAGT TTAATAACAT TGCCTCTCAG
GGTGACAGCG GTTCAGCACT GTTCGTCTAT GATAACCAAA AGAAAAAGTG GGTTGTCGCT
GGAACTGTCT GGGGGATTTA TAATTACGCC AATGGCAAAA ACCACGCAGC ATACAGTAAA
TGGAACCAGA CAACCATTGA CAACCTGAAG AACAAGTATT CTTACAACGT GGATATGTCA
GGGGCTCAGG TTGCAACCAT TGAAAATGGA AAACTGACAG GCACTGGCTC AGACACCACC
GATATAAAAA ATAAGGACTT AATATTTACT GGCGGTGGAG ATATCCTCCT GAAATCCTCT
TTTGATAATG GTGCTGGCGG TCTTGTCTTT AATGATAAAA AGACCTATCG AGTAAACGGG
GATGATTTCA CCTTTAAAGG TGCCGGTGTT GATACAAGAA ACGGCAGCAC CGTTGAGTGG
AATATCCGGT ATGATAATAA AGACAACCTT CACAAAATTG GTGATGGCAC ATTAGATGTC
CGAAAAACCC AGAACACCAA CCTGAAAACA GGTGAGGGTC TTGTCATTCT TGGAGCTGAA
AAAACATTCA ATAATATCTA CATAACCAGT GGTGATGGAA CTGTCCGACT GAATGCAGAA
AATGCACTGT CTGGCGGTGA ATACAACGGT ATTTTCTTTG CGAAAAATGG CGGAACTCTT
GACCTGAACG GATATAATCA GTCTTTCAAT AAAATTGCTG CAACTGATTC AGGTGCTGTA
ATAACCAATA CGTCAACCAA AAAATCCATT TTATCCCTGA ATAATACTGC TGACTATATC
TATCACGGCA ACATAAACGG GAATCTGGAC GTACTTCAGC ATCATGAGAC GAAAAAAGAG
AACCGTCGTC TTATTCTTGA TGGGGGCGTG GACACAACAA ATGATATAAG CCTGCGTAAT
ACACAACTGT CCATGCAGGG ACATGCCACT GAACATGCCA TTTATCGGGA TGGAGCTTTC
TCTTGTTCAC TACCAGCTCC TATGCGCTTT TTGTGTGGCA GTGATTATGT TGCAGGAATG
CAAAATACAG AAGCTGATGC TGTAAAACAA AACGGAAATG CCTATAAAAC CAACAATGCT
GTCTCTGATT TATCGCAGCC AGACTGGGAA ACCGGAACAT TCAGATTTGG AACGCTACAT
CTTGAAAATT CCGATTTTTC TGTTGGTCGT AATGCAAATG TAATCGGGGA CATTCAGGCC
AGTAAATCAA ACATTACTAT TGGTGACACT ACAGCATATA TTGATTTGCA TGCTGGTAAA
AATATTACCG GTGATGGTTT TGGCTTCCGC CAGAATATTG TGCGTGGAAA CTCACAAGGA
GAAACGCTGT TTACAGGAGG GATCACAGCA GAAGACAGCA CTATCGTTAT TAAAGATAAA
GCAAAAGCAT TATTTTCAAA TTATGTATAC CTGCTGAACA CAAAAGCAAC CATAGAGAAC
GGTGCTGATG TGACAACTCA AAGTGGTATG TTCTCCACGA GCGATATCAG CATCTCTGGT
AATCTGTCCA TGACAGGCAA TCCCGACAAA GACAATAAAT TCGAGCCCTC AATATATCTG
AATGATGCTT CTTATCTACT GACTGACGAC TCCGCCAGAC TCGTTGCCAA AAATAAAGCA
TCTGTGGTGG GAGATATACA CTCCACTAAA AGTGCATCCA TCATGTTTGG TCATGATGAA
AGCGACCTCT CGCAGTTGTC TGACAGAACC TCAAAAGGGC TTGCACTTGG TCTTTTAGGT
GGCTTTGATG TCTCATATCG CGGTTCAGTC AATGCCCCGT CAGCATCTGC CACTATGAAC
AACACCTGGT GGCAACTAAC CGGAGATTCT GCGCTGAAAA CACTGAAAAG TACAAACAGC
ATGGTCTATT TCACTGACAG CGCAAACAAT AAGAAATTCC ATACGCTGAC GGTCGATGAG
CTGGCAACCA GCAACAGCGC CTATGCGATG CGTACAAACC TTTCTGAATC AGACAAACTG
GAGGTCAAAA AACACTTGTC TGGTGAGAAC AATATTTTAC TCGTTGATTT CCTTCAGAAA
CCAACGCCTG AAAAACAACT GAATATTGAA CTGGTAAGCG CGCCAAAAGA CACCAATGAA
AATGTCTTTA AAGCCAGTAA ACAAACCATT GGTTTCAGTG ATGTAACGCC GGTCATTACA
ACCAGGGAAA CCGATGACAA AATAACATGG TCACTGACAG GCTATAACAC GGTAGCAAAC
AAGGAAGCAA CCCGGAATGC CGCCGCCCTG TTCTCTGTTG ACTATAAAGC GTTTCTGAAC
GAGGTCAACA ACCTGAACAA ACGTATGGGT GACCTGCGTG ATATCAACGG CGAAGCCGGT
GCATGGGCAC GCATCATGAG CGGTACCGGC TCTGCCAGTG GTGGTTTCAG TGACAACTAC
ACGCACGTTC AGGTCGGGGT CGACAAAAAA CACGAGCTGG ACGGACTGGA TTTGTTTACC
GGTTTCACTG TCACACACAC TGACAGCAGT GCCTCCGCCG ATGTTTTCAG TGGTAAAACG
AAGTCTGTGG GGGCTGGCCT GTATGCTTCC GCCATGTTTG ATTCCGGTGC CTATATCGAC
CTGATTGGCA AGTATGTTCA CCATGATAAT GAGTACACTG CAACCTTTGC CGGACTCGGA
ACCCGTGATT ACAGCACGCA TTCATGGTAT GCCGGTGCAG AAGCGGGCTA CCGCTATCAT
GTCACTGAGG ATGCCTGGAT TGAGCCACAG GCTGAGCTGG TTTACGGTTC TGTATCCGGT
AAACAGTTTG CATGGAAGGA CCAGGGAATG CATCTGTCCA TGAAGGACAA GGACTACAAT
CCGCTGATTG GCCGAACGGG TGTGGATGTG GGTAAATCCT TCTCTGGTAA GGACTGGAAA
GTGACAGCCC GTGCCGGTCT GGGCTACCAG TTCGACCTGC TGGCTAACGG CGAAACCGTA
TTGCGGGATG CATCTGGTGA AAAACGCATC AAAGGTGAAA AGGACAGCCG TATGCTGATG
TCCGTTGGCC TGAATGCAGA AATCAGGGAT AACGTCCGCT TTGGACTGGA GTTTGAGAAA
TCCGCCTTTG GTAAGTACAA CGTTGATAAT GCTGTCAACG CTAATTTCCG TTACTCGTTC
TGA
 
Protein sequence
MNKIYSLKYS HITGGLIAVS ELSGRVSSRA TGKKKHKRIL ALCFLGLLQS SYSFASQMDI 
SNFYIRDYMD FAQNKGIFQA GATNIEIVKK DGSTLKLPEV PFPDFSPVAN KGSTTSIGGA
YSITATHNTK NHHSVATQNW GNSTYKQTDW NTSHPDFAVS RLDKFVVETR GATEGADISL
SKQQALERYG VNYKGEKKLI AFRAGSGVVS VKKNGRITPF NEVSYKPEML NGSFVHIDDW
SGWLILTNNQ FDEFNNIASQ GDSGSALFVY DNQKKKWVVA GTVWGIYNYA NGKNHAAYSK
WNQTTIDNLK NKYSYNVDMS GAQVATIENG KLTGTGSDTT DIKNKDLIFT GGGDILLKSS
FDNGAGGLVF NDKKTYRVNG DDFTFKGAGV DTRNGSTVEW NIRYDNKDNL HKIGDGTLDV
RKTQNTNLKT GEGLVILGAE KTFNNIYITS GDGTVRLNAE NALSGGEYNG IFFAKNGGTL
DLNGYNQSFN KIAATDSGAV ITNTSTKKSI LSLNNTADYI YHGNINGNLD VLQHHETKKE
NRRLILDGGV DTTNDISLRN TQLSMQGHAT EHAIYRDGAF SCSLPAPMRF LCGSDYVAGM
QNTEADAVKQ NGNAYKTNNA VSDLSQPDWE TGTFRFGTLH LENSDFSVGR NANVIGDIQA
SKSNITIGDT TAYIDLHAGK NITGDGFGFR QNIVRGNSQG ETLFTGGITA EDSTIVIKDK
AKALFSNYVY LLNTKATIEN GADVTTQSGM FSTSDISISG NLSMTGNPDK DNKFEPSIYL
NDASYLLTDD SARLVAKNKA SVVGDIHSTK SASIMFGHDE SDLSQLSDRT SKGLALGLLG
GFDVSYRGSV NAPSASATMN NTWWQLTGDS ALKTLKSTNS MVYFTDSANN KKFHTLTVDE
LATSNSAYAM RTNLSESDKL EVKKHLSGEN NILLVDFLQK PTPEKQLNIE LVSAPKDTNE
NVFKASKQTI GFSDVTPVIT TRETDDKITW SLTGYNTVAN KEATRNAAAL FSVDYKAFLN
EVNNLNKRMG DLRDINGEAG AWARIMSGTG SASGGFSDNY THVQVGVDKK HELDGLDLFT
GFTVTHTDSS ASADVFSGKT KSVGAGLYAS AMFDSGAYID LIGKYVHHDN EYTATFAGLG
TRDYSTHSWY AGAEAGYRYH VTEDAWIEPQ AELVYGSVSG KQFAWKDQGM HLSMKDKDYN
PLIGRTGVDV GKSFSGKDWK VTARAGLGYQ FDLLANGETV LRDASGEKRI KGEKDSRMLM
SVGLNAEIRD NVRFGLEFEK SAFGKYNVDN AVNANFRYSF