Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3508 |
Symbol | |
ID | 6967267 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3252488 |
End bp | 3254425 |
Gene Length | 1938 bp |
Protein Length | 645 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643387309 |
Product | tail fiber protein |
Protein accession | YP_002271772 |
Protein GI | 209398916 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.283696 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGTGTTG TTGTTTCGGG GACGCTGAAA TCTCCTGATG GTGAGGCGAT ATCAGGAGCA AATATTACCC TGACGGCGCT GACAGTTTCA CCGGATGCGC TCAGCGGCAC CAGTGCGTCG GCAGTGACCC GTGAAGGTGG ATATTACGGA ATGACGATGG ATCCGGGGGA GTATGCGGTT TCGGTGACGG TGAAAGGGAA GACTGTTGTC TACGGACGTG TGCGTATTGA GGGGACCGAA AGTACGGTGA CGCTCAATAT GCTGTTACGC CGCAGTCTTG TTGAGGTTAG CATACCCGGA GAACTGCTGA CAGATTTCCG GCAGATACAG AATAATGTGG CTGATGACCT TGCCACTATT CGTCGCCTGA ATGAAGACAC GGCGACAAAA AACACTCAGG CCACACAGTC AAAAGAAAGT GCAGCAGCCA GTGCGAAGAG TGCATCTGAC AGTGCAAAGA CGGCAACCAG CAGGGCGGCT GAAGCCGGAC AAAAAGCGAC TGATGCCACT GAGGCTGCGA CCCGTGCAGT CACAGCAGCG GGGAATGCAG AGGAAAGCTC GACCCGTGCC GGAGAGTCTG AAAAAGCCGC CGGAGCTGAT GCAGAAAAAG CCAGACAGCA TGCTGAAAAG GCCAGGCTGG CGCAGGAGAG CGCCGGAGAG ATCCTTAAGC GGGCAGAGGC TGCCACTGTC AGTGCTGAAG AGGCCAGACG TATGGCTGAG AATGCACGGG GGCCCCGGGG GCCTCAGGGA GAAACTGGTC CGAAGGGGGA TGTCGGTCCT AAAGGCGAAA CAGGTCCAGT GGGCCCTCAA GGGCCCGCAG GGCCGAAAGG TGAGCGTGGT GACGTTGGTG CTCAGGGGGC TGTAGGGCCT GCTGGTCCGC GTGGTGAGAA GGGCGAACAG GGGGAGCGAG GACCGCAGGG AATACCAGGC CTGAAGGGGG ATACCGGAGA GCGGGGGCCT AAAGGGGACC AGGGGGATAT GGGGCCAAAA GGCGAGAAAG GTGATCCGGG AGGTCCTGCA GGCCCGCAAG GTCCTAAAGG CGAACGAGGA GAAGCCGGAC CACAGGGACC GATGGGAGCA CGAGGTGAGC GTGGGGAGAC TGGCCCCCGA GGTGAACCTG GTCCTGCAGG TCCGAGAGGC GAACGAGGAG AGACCGGACC TCAGGGACCT CGTGGAGAGC CAGGTCCGGC AGGCAGCGCT GCAAATGTGG CTGATGCAAC GACGGCACAG AAGGGAATTG TGCAGTTAAG CAGCGCAACG GACAGTGATG ATGAAACGAA GGCTGCCACC CCGAAAGCGG TGAAAGCGGC AATGGATGTG GCAAATGAAG CGAAAACAAA GGCAGAAGAG GCTGCAGCAG GAGGTGGTGT TCCCGGTCCG AAAGGAGATA AAGGGGACAC GGGGCCAGCA GGTCCGGCTG GGCCGAAGGG TGATAAGGGA GAGCGCGGTG ACACCGGCCC TGTCGGGGCA ACCGGCGAAC GGGGACCGGC AGGTGATGCT GGTCCGGCAG GCCCGCAGGG GCCGAAAGGT GACAGGGGAG AGCGGGGAGA GACCGGTCTG ACGGGAAATG CAGGTCCACA GGGTCCAAAG GGAGATACCG GTGCGGCAGG CCCGGCAGGC CCACAGGGAC CGAAAGGAGA AACAGGTGCG GCAGGCCCGG TGGGGGCGAC CGGACCTCAG GGGCCGAAGG GCGACCCGGG GGAGACGCAA ATACGGTTCC GTCTGGGGCC GGGAAACATT ATTGAGACAA ACAGCCATGG CTGGTTCCCG GATACAGATG GCGCACTCAT CACCGGACTG ACCTTTCTTG ACCCCAAAGA TGCCACACGG GTTCAGGGTT TTTTTCAGCA TTTGCAGGTC AGGTTTGGTG ACGGGCCGTG GCAGGATGTC AAGGGGCTGG ATGAAGTGGG CAGTGATACA GGCAGAACAG GAGAATGA
|
Protein sequence | MSVVVSGTLK SPDGEAISGA NITLTALTVS PDALSGTSAS AVTREGGYYG MTMDPGEYAV SVTVKGKTVV YGRVRIEGTE STVTLNMLLR RSLVEVSIPG ELLTDFRQIQ NNVADDLATI RRLNEDTATK NTQATQSKES AAASAKSASD SAKTATSRAA EAGQKATDAT EAATRAVTAA GNAEESSTRA GESEKAAGAD AEKARQHAEK ARLAQESAGE ILKRAEAATV SAEEARRMAE NARGPRGPQG ETGPKGDVGP KGETGPVGPQ GPAGPKGERG DVGAQGAVGP AGPRGEKGEQ GERGPQGIPG LKGDTGERGP KGDQGDMGPK GEKGDPGGPA GPQGPKGERG EAGPQGPMGA RGERGETGPR GEPGPAGPRG ERGETGPQGP RGEPGPAGSA ANVADATTAQ KGIVQLSSAT DSDDETKAAT PKAVKAAMDV ANEAKTKAEE AAAGGGVPGP KGDKGDTGPA GPAGPKGDKG ERGDTGPVGA TGERGPAGDA GPAGPQGPKG DRGERGETGL TGNAGPQGPK GDTGAAGPAG PQGPKGETGA AGPVGATGPQ GPKGDPGETQ IRFRLGPGNI IETNSHGWFP DTDGALITGL TFLDPKDATR VQGFFQHLQV RFGDGPWQDV KGLDEVGSDT GRTGE
|
| |