Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0351 |
Symbol | |
ID | 6969469 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 357811 |
End bp | 362064 |
Gene Length | 4254 bp |
Protein Length | 1417 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643384412 |
Product | putative invasin |
Protein accession | YP_002268927 |
Protein GI | 209400580 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCACGTT ATAAAACAGG TCATAAACAA CCACGATTTC GTTATTCAGT TCTGGCCCGC TGCGTGGCGT GGGCAAATAT CTCTGTTCAG GTTCTTTTTC CACTCGCTGT CACCTTTACC CCAGTAATGG CGGCACGTGC GCAGCATGCG GTTCAGCCAC GGTTGAGCAT GGGAAATACT ACGGTAACTG CTGATAATAA CGTGGAGAAA AATGTCGCGT CGTTTGCCGC AAATGCCGGG ACATTTTTAA GCAGTCAGCC AGATAGCGAT GCGACACGTA ATTTTATTAC CGGAATGGCC ACCGCTAAAG CTAACCAGGA AATACAGGAG TGGCTCGGGA AATATGGTAC AGCGCGCGTC AAACTGAATG TCGATAAAGA TTTCTCGCTG AAGGATTCTT CGCTGGAAAT GCTTTATCCG ATTTATGATA CGCCGACAAA TATGTTGTTC ACTCAGGGGG CAATACATCG TACAGACGAT CGTACTCAGT CAAATATTGG TTTTGGCTGG CGTCATTTTT CAGGAAATGA CTGGATGGCG GGGGTGAACA CCTTTATCGA CCATGATTTA TCCCGTAGTC ATACCCGCAT TGGTGTTGGT GCGGAATACT GGCGCGATTA TCTGAAACTG AGCGCCAATG GTTATATTCG GGCTTCTGGC TGGAAAAAAT CGCCGGATAT TGAGGATTAT CAGGAACGCC CGGCGAATGG TTGGGATATC CGCGCAGAGG GCTATTTACC TGCCTGGCCG CAGCTTGGCG CAAGCCTGAT GTATGAACAG TATTATGGCG ATGAAGTCGG GCTGTTTGGT AAAGATAAGC GCCAGAAAGA CCCGCATGCT ATTTCTGCCG AGGTGACCTA TACGCCAGTG CCTCTTCTGA CACTGAGCGC CGGGCATAAG CAGGGCAAGA GCGGTGAGAA TGACACTCGC TTTGGCCTGG AAGTTAACTA CCGAATTGGC GAACCTTTGG CGAAACAACT CGATACGGAT AGCATTCGCG AGCGTCGGGT ACTGGCAGGC AGCCGCTATG ACCTGGTTGA GCGTAATAAC AACATCGTTC TTGAGTACCG CAAATCTGAA GTGATCCGTA TTGCTCTGCC TGAGCGTATT GAAGGTAAGG GCGGTCAGAC ACTTTCCCTG GGGCTTGTGG TCAGCAAAGC AACTCACGGA CTGAAAAATG TGCAGTGGGA AGCGCCGTCA TTACTGGCTG AAGGTGGCAA AATTACCGGT CAGGGTAGTC AGTGGCAAGT AACGCTCCCG GCTTATCGTC CAGGCAAAGA CAATTATTAT GCGATTTCAG CAGTTGCCTA CGATAACAAA GGCAATACCT CAAAACGCGT GCAGACAGAG GTGGTCATTA CCGGAGCTGG TATGAGCGCC GATCGCACGG CGTTAACGCT TGACGGTCAG AGCCGTATTC AAATGCTTGC TAACGGTAAT GAGCAAAAAC CGCTGGTGCT GTCTCTGCGC GACGCCGAGG GCCAGCCAGT CACGGGCATG AAAGATCAGA TCAAGACTGA ACTAACTTTC AAACCGGCTG GAAATATTGT GACTCGTTCC CTGAAGGCCA CTAAATCACA GGCAAAGCCA ACACTGGGTG AGTTCACCGA AACTGAAGCA GGGGTGTATC AGTCTGTCTT TACTACCGGA ACGCAGTCAG GTGAGGCAAC GATTACTGTT AGCGTTGATG GCATGAGCAA AACCGTCACT GCAGAACTGC GGGCCACGAT GATGGATGTG GCAAACTCCA CCCTGAGCGC TAACGAGCCG TCAGGTGACG TGGTTGCTGA TGGTCAGCAA GCCTATACGT TGACGTTGAC TGCGGTGGAC TCCGAGGGTA ATCCGGTGAC GGGAGAAGCC AGCCGCTTGC GATTTGTTCC GCAAGACACT AATGGTGTAA CCGTTGGTGC CATTTCGGAA ATAAAACCAG GCGTTTACAG CGCCGCGGTT TCTTCGACCC GTGCCGGAAA CGTTGTTGTG CGTGCTTTCA GCGAGCAGTA TCAGCTGGGC ACATTACAAC AAACGCTGAA GTTTGTTGCC GGGCCGCTTG ATGCAGCACA TTCGTCCATC ACCCTGAATC CTGATAAACC GGTGGTTGGG GGGACAGTTA CGGCAATCTG GACGGTAAAA GATGCCTATG ACAACCCTGT GACCAGCCTC ACGCCGGAAG CGCCGTCATT AGCGGGTGCC GCTGCTGAAG GTTCTACGGC ATCGGGCTGG ACAAATAATG GTGATGGGAC GTGGACTGCG CAGATTACTC TCGGCTCTAC GGCGGGTGAA TTAGAAGTTA TGCCGAAGCT AAATGGACAG AATGCGGCAG CAAATGCGGC AAAAGTAACC GTGGTGGCTG ATGCGTTATC TTCAAACCAG TCGAAAGTCT CTGTCGCAGA AGATCACGTA AAAGCCGGCG AAAGCACAAC CGTGACGCTG GTGGCGAAAG ATGCGCATGG CAACGCTATC AGTGGTCTTG CGTTGTCGGC AAGTTTGACG GGGACCGCCT CTGAAGGGGC GACCGTTTCC AGTTGGACCG AAAAAGGTAA CGGTTCCTAT GTTGCTACGT TGACTACAGG TGGAAAGACG GGCGAGCTTC GCGTCATGCC TCTCTTCAAC GGCCAGCCAG CAGCCACCGA AGCCGCGCAG TTGACGGTCA TCGCCGGAGA GATGTCATCA GCGAACTCTA CGCTTGTTGC GGACAATAAG GCTCCGACCG TCAAAACGAC GACGGAACTC ACCTTCACCG TGAAGGATGC GTACGGGAAC CCGGTCACCG GGCTGAAGCC AGATGCACCA GTGTTTAGCG GTGCCGCCAG CACGGGGAGT GAGCGTCCTT CAGCAGGAAA CTGGACAGAG AAAGGTAATG GGGTCTACGT GTCGACCTTA ACGCTGGGAT CTGCCGCGGG TCAGTTGTCT GTGATGCCGC GAGTGAACGG CCAAAATGCC GTTGCTCAGC CACTGGTGCT GAACGTTGCA GGTGACGCAT CTAAGGCTGA GATTCGTGAT ATGACAGTGA AGGTTAATAA CCAACTGGCT AATGGACAGT CTGCTAACCA GATAACCCTG ACCGTTGTGG ACACCTATGG TAACCCGTTG CAGGGGCAGG AAGTTACGCT GACTTTACCG CAGGGTGTGA CCAGCAAGAC GGGGAATACA GTAACAACTA ATGCGGCAGG TAAAGCGGAC ATTGAGCTTA TGTCAACGGT TGCGGGAGAA CACAATATTT CCGCTTCGGT GAATGGTGCT CAGAAGACGG TCACGGTGAA ATTCAACGCG GATGCCAGCA CCGGTCAGGC AAACCTGCAG GTAGACGCCG CTGCTCAAAA AGTGGCAAAC GGCAAAGATG CCTTTACGCT GACGGCGAAC GTTGAGGATA AAAATGGTAA CCCTGTTCCA GGGAGCCTGG TGACCTTTAA TCTGCCCCGG GGTGTCAAGC CGCTTACAGG CGATAATGTC TGGGTGAAAG CCAACGATGA GGGGAAAGCA GAGTTGCAGG TGGTTTCAGT GACTGCCGGA ACGTATGAGA TCACGGCATC GGCAGGGAAT AGCCAGCCTT CGAATACGCA GACTATAACG TTTGTAGCCG ATAAGGCTAC CGCAACCGTC TCCGGTATTG AGGTGATTGG CAACTATGCA CTGGCGGACG GCAATGCCAA ACAGACGTAT AAAGTTACGG TGACTGATGC CAATAACAAC CTGTTGAAAG ATAGCGAAGT GACGCTGACT GCCAGCCCGG CAAATTTAGT TCTGACTCCC AATGGGACGG CGAAAACTAA TGAGCAAGGA CAGGCTATTT TCACCGCCAC GACCACTGTC GCAGCGAAAT ATACACTCAC GGCGAAAGTG AGTCAGGCCG ACGGTCAGGA ATCGACGAAA ACTGCCGAAT CTAAATTCGT CGCGGATGAT AAAAATGCAG TACTCACCGC ATCATCTGAT GTGACTTCTC TGGTGGCGGA TGGGATATCG ACTGCGAAGC TGGAGGTGAC ACTGATGTCG GCAAATAACC CCGTTGGGGG GAATATGTGG GTCGACATTA AGACGCCAGA AGGGGTGACG GAGAAGGATT ATCAGTTCCT GCCGTCGAAA AATGACCATT TCGTGAGCGG AAAAATCACG CGTACATTTA GTACCAGCAA GCCTGGTGTC TATACGTTCA CATTTAACGC CCTGACGTAT GGCGGGTACG AAATGAAGCC AGTGACGGTG ACCATTACCG CGGTGGATGC CGATACGGCA AAGGGCGAGG AGGCGATGAA CTAA
|
Protein sequence | MSRYKTGHKQ PRFRYSVLAR CVAWANISVQ VLFPLAVTFT PVMAARAQHA VQPRLSMGNT TVTADNNVEK NVASFAANAG TFLSSQPDSD ATRNFITGMA TAKANQEIQE WLGKYGTARV KLNVDKDFSL KDSSLEMLYP IYDTPTNMLF TQGAIHRTDD RTQSNIGFGW RHFSGNDWMA GVNTFIDHDL SRSHTRIGVG AEYWRDYLKL SANGYIRASG WKKSPDIEDY QERPANGWDI RAEGYLPAWP QLGASLMYEQ YYGDEVGLFG KDKRQKDPHA ISAEVTYTPV PLLTLSAGHK QGKSGENDTR FGLEVNYRIG EPLAKQLDTD SIRERRVLAG SRYDLVERNN NIVLEYRKSE VIRIALPERI EGKGGQTLSL GLVVSKATHG LKNVQWEAPS LLAEGGKITG QGSQWQVTLP AYRPGKDNYY AISAVAYDNK GNTSKRVQTE VVITGAGMSA DRTALTLDGQ SRIQMLANGN EQKPLVLSLR DAEGQPVTGM KDQIKTELTF KPAGNIVTRS LKATKSQAKP TLGEFTETEA GVYQSVFTTG TQSGEATITV SVDGMSKTVT AELRATMMDV ANSTLSANEP SGDVVADGQQ AYTLTLTAVD SEGNPVTGEA SRLRFVPQDT NGVTVGAISE IKPGVYSAAV SSTRAGNVVV RAFSEQYQLG TLQQTLKFVA GPLDAAHSSI TLNPDKPVVG GTVTAIWTVK DAYDNPVTSL TPEAPSLAGA AAEGSTASGW TNNGDGTWTA QITLGSTAGE LEVMPKLNGQ NAAANAAKVT VVADALSSNQ SKVSVAEDHV KAGESTTVTL VAKDAHGNAI SGLALSASLT GTASEGATVS SWTEKGNGSY VATLTTGGKT GELRVMPLFN GQPAATEAAQ LTVIAGEMSS ANSTLVADNK APTVKTTTEL TFTVKDAYGN PVTGLKPDAP VFSGAASTGS ERPSAGNWTE KGNGVYVSTL TLGSAAGQLS VMPRVNGQNA VAQPLVLNVA GDASKAEIRD MTVKVNNQLA NGQSANQITL TVVDTYGNPL QGQEVTLTLP QGVTSKTGNT VTTNAAGKAD IELMSTVAGE HNISASVNGA QKTVTVKFNA DASTGQANLQ VDAAAQKVAN GKDAFTLTAN VEDKNGNPVP GSLVTFNLPR GVKPLTGDNV WVKANDEGKA ELQVVSVTAG TYEITASAGN SQPSNTQTIT FVADKATATV SGIEVIGNYA LADGNAKQTY KVTVTDANNN LLKDSEVTLT ASPANLVLTP NGTAKTNEQG QAIFTATTTV AAKYTLTAKV SQADGQESTK TAESKFVADD KNAVLTASSD VTSLVADGIS TAKLEVTLMS ANNPVGGNMW VDIKTPEGVT EKDYQFLPSK NDHFVSGKIT RTFSTSKPGV YTFTFNALTY GGYEMKPVTV TITAVDADTA KGEEAMN
|
| |