Gene ECH74115_0351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0351 
Symbol 
ID6969469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp357811 
End bp362064 
Gene Length4254 bp 
Protein Length1417 aa 
Translation table11 
GC content52% 
IMG OID643384412 
Productputative invasin 
Protein accessionYP_002268927 
Protein GI209400580 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACGTT ATAAAACAGG TCATAAACAA CCACGATTTC GTTATTCAGT TCTGGCCCGC 
TGCGTGGCGT GGGCAAATAT CTCTGTTCAG GTTCTTTTTC CACTCGCTGT CACCTTTACC
CCAGTAATGG CGGCACGTGC GCAGCATGCG GTTCAGCCAC GGTTGAGCAT GGGAAATACT
ACGGTAACTG CTGATAATAA CGTGGAGAAA AATGTCGCGT CGTTTGCCGC AAATGCCGGG
ACATTTTTAA GCAGTCAGCC AGATAGCGAT GCGACACGTA ATTTTATTAC CGGAATGGCC
ACCGCTAAAG CTAACCAGGA AATACAGGAG TGGCTCGGGA AATATGGTAC AGCGCGCGTC
AAACTGAATG TCGATAAAGA TTTCTCGCTG AAGGATTCTT CGCTGGAAAT GCTTTATCCG
ATTTATGATA CGCCGACAAA TATGTTGTTC ACTCAGGGGG CAATACATCG TACAGACGAT
CGTACTCAGT CAAATATTGG TTTTGGCTGG CGTCATTTTT CAGGAAATGA CTGGATGGCG
GGGGTGAACA CCTTTATCGA CCATGATTTA TCCCGTAGTC ATACCCGCAT TGGTGTTGGT
GCGGAATACT GGCGCGATTA TCTGAAACTG AGCGCCAATG GTTATATTCG GGCTTCTGGC
TGGAAAAAAT CGCCGGATAT TGAGGATTAT CAGGAACGCC CGGCGAATGG TTGGGATATC
CGCGCAGAGG GCTATTTACC TGCCTGGCCG CAGCTTGGCG CAAGCCTGAT GTATGAACAG
TATTATGGCG ATGAAGTCGG GCTGTTTGGT AAAGATAAGC GCCAGAAAGA CCCGCATGCT
ATTTCTGCCG AGGTGACCTA TACGCCAGTG CCTCTTCTGA CACTGAGCGC CGGGCATAAG
CAGGGCAAGA GCGGTGAGAA TGACACTCGC TTTGGCCTGG AAGTTAACTA CCGAATTGGC
GAACCTTTGG CGAAACAACT CGATACGGAT AGCATTCGCG AGCGTCGGGT ACTGGCAGGC
AGCCGCTATG ACCTGGTTGA GCGTAATAAC AACATCGTTC TTGAGTACCG CAAATCTGAA
GTGATCCGTA TTGCTCTGCC TGAGCGTATT GAAGGTAAGG GCGGTCAGAC ACTTTCCCTG
GGGCTTGTGG TCAGCAAAGC AACTCACGGA CTGAAAAATG TGCAGTGGGA AGCGCCGTCA
TTACTGGCTG AAGGTGGCAA AATTACCGGT CAGGGTAGTC AGTGGCAAGT AACGCTCCCG
GCTTATCGTC CAGGCAAAGA CAATTATTAT GCGATTTCAG CAGTTGCCTA CGATAACAAA
GGCAATACCT CAAAACGCGT GCAGACAGAG GTGGTCATTA CCGGAGCTGG TATGAGCGCC
GATCGCACGG CGTTAACGCT TGACGGTCAG AGCCGTATTC AAATGCTTGC TAACGGTAAT
GAGCAAAAAC CGCTGGTGCT GTCTCTGCGC GACGCCGAGG GCCAGCCAGT CACGGGCATG
AAAGATCAGA TCAAGACTGA ACTAACTTTC AAACCGGCTG GAAATATTGT GACTCGTTCC
CTGAAGGCCA CTAAATCACA GGCAAAGCCA ACACTGGGTG AGTTCACCGA AACTGAAGCA
GGGGTGTATC AGTCTGTCTT TACTACCGGA ACGCAGTCAG GTGAGGCAAC GATTACTGTT
AGCGTTGATG GCATGAGCAA AACCGTCACT GCAGAACTGC GGGCCACGAT GATGGATGTG
GCAAACTCCA CCCTGAGCGC TAACGAGCCG TCAGGTGACG TGGTTGCTGA TGGTCAGCAA
GCCTATACGT TGACGTTGAC TGCGGTGGAC TCCGAGGGTA ATCCGGTGAC GGGAGAAGCC
AGCCGCTTGC GATTTGTTCC GCAAGACACT AATGGTGTAA CCGTTGGTGC CATTTCGGAA
ATAAAACCAG GCGTTTACAG CGCCGCGGTT TCTTCGACCC GTGCCGGAAA CGTTGTTGTG
CGTGCTTTCA GCGAGCAGTA TCAGCTGGGC ACATTACAAC AAACGCTGAA GTTTGTTGCC
GGGCCGCTTG ATGCAGCACA TTCGTCCATC ACCCTGAATC CTGATAAACC GGTGGTTGGG
GGGACAGTTA CGGCAATCTG GACGGTAAAA GATGCCTATG ACAACCCTGT GACCAGCCTC
ACGCCGGAAG CGCCGTCATT AGCGGGTGCC GCTGCTGAAG GTTCTACGGC ATCGGGCTGG
ACAAATAATG GTGATGGGAC GTGGACTGCG CAGATTACTC TCGGCTCTAC GGCGGGTGAA
TTAGAAGTTA TGCCGAAGCT AAATGGACAG AATGCGGCAG CAAATGCGGC AAAAGTAACC
GTGGTGGCTG ATGCGTTATC TTCAAACCAG TCGAAAGTCT CTGTCGCAGA AGATCACGTA
AAAGCCGGCG AAAGCACAAC CGTGACGCTG GTGGCGAAAG ATGCGCATGG CAACGCTATC
AGTGGTCTTG CGTTGTCGGC AAGTTTGACG GGGACCGCCT CTGAAGGGGC GACCGTTTCC
AGTTGGACCG AAAAAGGTAA CGGTTCCTAT GTTGCTACGT TGACTACAGG TGGAAAGACG
GGCGAGCTTC GCGTCATGCC TCTCTTCAAC GGCCAGCCAG CAGCCACCGA AGCCGCGCAG
TTGACGGTCA TCGCCGGAGA GATGTCATCA GCGAACTCTA CGCTTGTTGC GGACAATAAG
GCTCCGACCG TCAAAACGAC GACGGAACTC ACCTTCACCG TGAAGGATGC GTACGGGAAC
CCGGTCACCG GGCTGAAGCC AGATGCACCA GTGTTTAGCG GTGCCGCCAG CACGGGGAGT
GAGCGTCCTT CAGCAGGAAA CTGGACAGAG AAAGGTAATG GGGTCTACGT GTCGACCTTA
ACGCTGGGAT CTGCCGCGGG TCAGTTGTCT GTGATGCCGC GAGTGAACGG CCAAAATGCC
GTTGCTCAGC CACTGGTGCT GAACGTTGCA GGTGACGCAT CTAAGGCTGA GATTCGTGAT
ATGACAGTGA AGGTTAATAA CCAACTGGCT AATGGACAGT CTGCTAACCA GATAACCCTG
ACCGTTGTGG ACACCTATGG TAACCCGTTG CAGGGGCAGG AAGTTACGCT GACTTTACCG
CAGGGTGTGA CCAGCAAGAC GGGGAATACA GTAACAACTA ATGCGGCAGG TAAAGCGGAC
ATTGAGCTTA TGTCAACGGT TGCGGGAGAA CACAATATTT CCGCTTCGGT GAATGGTGCT
CAGAAGACGG TCACGGTGAA ATTCAACGCG GATGCCAGCA CCGGTCAGGC AAACCTGCAG
GTAGACGCCG CTGCTCAAAA AGTGGCAAAC GGCAAAGATG CCTTTACGCT GACGGCGAAC
GTTGAGGATA AAAATGGTAA CCCTGTTCCA GGGAGCCTGG TGACCTTTAA TCTGCCCCGG
GGTGTCAAGC CGCTTACAGG CGATAATGTC TGGGTGAAAG CCAACGATGA GGGGAAAGCA
GAGTTGCAGG TGGTTTCAGT GACTGCCGGA ACGTATGAGA TCACGGCATC GGCAGGGAAT
AGCCAGCCTT CGAATACGCA GACTATAACG TTTGTAGCCG ATAAGGCTAC CGCAACCGTC
TCCGGTATTG AGGTGATTGG CAACTATGCA CTGGCGGACG GCAATGCCAA ACAGACGTAT
AAAGTTACGG TGACTGATGC CAATAACAAC CTGTTGAAAG ATAGCGAAGT GACGCTGACT
GCCAGCCCGG CAAATTTAGT TCTGACTCCC AATGGGACGG CGAAAACTAA TGAGCAAGGA
CAGGCTATTT TCACCGCCAC GACCACTGTC GCAGCGAAAT ATACACTCAC GGCGAAAGTG
AGTCAGGCCG ACGGTCAGGA ATCGACGAAA ACTGCCGAAT CTAAATTCGT CGCGGATGAT
AAAAATGCAG TACTCACCGC ATCATCTGAT GTGACTTCTC TGGTGGCGGA TGGGATATCG
ACTGCGAAGC TGGAGGTGAC ACTGATGTCG GCAAATAACC CCGTTGGGGG GAATATGTGG
GTCGACATTA AGACGCCAGA AGGGGTGACG GAGAAGGATT ATCAGTTCCT GCCGTCGAAA
AATGACCATT TCGTGAGCGG AAAAATCACG CGTACATTTA GTACCAGCAA GCCTGGTGTC
TATACGTTCA CATTTAACGC CCTGACGTAT GGCGGGTACG AAATGAAGCC AGTGACGGTG
ACCATTACCG CGGTGGATGC CGATACGGCA AAGGGCGAGG AGGCGATGAA CTAA
 
Protein sequence
MSRYKTGHKQ PRFRYSVLAR CVAWANISVQ VLFPLAVTFT PVMAARAQHA VQPRLSMGNT 
TVTADNNVEK NVASFAANAG TFLSSQPDSD ATRNFITGMA TAKANQEIQE WLGKYGTARV
KLNVDKDFSL KDSSLEMLYP IYDTPTNMLF TQGAIHRTDD RTQSNIGFGW RHFSGNDWMA
GVNTFIDHDL SRSHTRIGVG AEYWRDYLKL SANGYIRASG WKKSPDIEDY QERPANGWDI
RAEGYLPAWP QLGASLMYEQ YYGDEVGLFG KDKRQKDPHA ISAEVTYTPV PLLTLSAGHK
QGKSGENDTR FGLEVNYRIG EPLAKQLDTD SIRERRVLAG SRYDLVERNN NIVLEYRKSE
VIRIALPERI EGKGGQTLSL GLVVSKATHG LKNVQWEAPS LLAEGGKITG QGSQWQVTLP
AYRPGKDNYY AISAVAYDNK GNTSKRVQTE VVITGAGMSA DRTALTLDGQ SRIQMLANGN
EQKPLVLSLR DAEGQPVTGM KDQIKTELTF KPAGNIVTRS LKATKSQAKP TLGEFTETEA
GVYQSVFTTG TQSGEATITV SVDGMSKTVT AELRATMMDV ANSTLSANEP SGDVVADGQQ
AYTLTLTAVD SEGNPVTGEA SRLRFVPQDT NGVTVGAISE IKPGVYSAAV SSTRAGNVVV
RAFSEQYQLG TLQQTLKFVA GPLDAAHSSI TLNPDKPVVG GTVTAIWTVK DAYDNPVTSL
TPEAPSLAGA AAEGSTASGW TNNGDGTWTA QITLGSTAGE LEVMPKLNGQ NAAANAAKVT
VVADALSSNQ SKVSVAEDHV KAGESTTVTL VAKDAHGNAI SGLALSASLT GTASEGATVS
SWTEKGNGSY VATLTTGGKT GELRVMPLFN GQPAATEAAQ LTVIAGEMSS ANSTLVADNK
APTVKTTTEL TFTVKDAYGN PVTGLKPDAP VFSGAASTGS ERPSAGNWTE KGNGVYVSTL
TLGSAAGQLS VMPRVNGQNA VAQPLVLNVA GDASKAEIRD MTVKVNNQLA NGQSANQITL
TVVDTYGNPL QGQEVTLTLP QGVTSKTGNT VTTNAAGKAD IELMSTVAGE HNISASVNGA
QKTVTVKFNA DASTGQANLQ VDAAAQKVAN GKDAFTLTAN VEDKNGNPVP GSLVTFNLPR
GVKPLTGDNV WVKANDEGKA ELQVVSVTAG TYEITASAGN SQPSNTQTIT FVADKATATV
SGIEVIGNYA LADGNAKQTY KVTVTDANNN LLKDSEVTLT ASPANLVLTP NGTAKTNEQG
QAIFTATTTV AAKYTLTAKV SQADGQESTK TAESKFVADD KNAVLTASSD VTSLVADGIS
TAKLEVTLMS ANNPVGGNMW VDIKTPEGVT EKDYQFLPSK NDHFVSGKIT RTFSTSKPGV
YTFTFNALTY GGYEMKPVTV TITAVDADTA KGEEAMN