Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3030 |
Symbol | |
ID | 6967305 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2812984 |
End bp | 2814174 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643386862 |
Product | phage tail sheath protein |
Protein accession | YP_002271330 |
Protein GI | 209397008 |
COG category | [R] General function prediction only |
COG ID | [COG3497] Phage tail sheath protein FI |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.00296103 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGTGATT ATCATCACGG CGTGCAGGTG CTGGAGATTA ACGACGGCAC CCGCGTCATT TCCACCGTAT CCACCGCCAT TGTCGGCATG GTCTGCACGG CCAGCGATGC GGATGCGGAA ACCTTCCCCC TCAATAAACC TGTGCTGATT ACCAATGTGC AGAGCGCAAT TGCAAAGGCC GGTAAAAAAG GCACGCTGGC GGCATCGTTG CAGGCCATCG CTGACCAGTC AAAACCGGTT ACCGTTGTCG TGCGTGTTGA AGACGGCACC GGCGACGACG AAGAAACGAA ACTCGCGCAG ACCGTTTCCA ATATCATCGG CACCACTGAC GAAAACGGTC AGTATACCGG ACTGAAAGCC CTGCTGGCGG CAGAGTCGGT AACCGGTGTT AAACCGCGTA TTCTCGGCGT GCCGGGACTG GACACCAAAG AGGTGGCTGT TGCACTGGCA TCAGTCTGTC AGAAGCTGCG CGCTTTCGGG TATATCAGCG CATGGGGCTG TAAAACCATT TCCGAGGTGA AAGCCTACCG CCAGAATTTC AGCCAGCGTG AGCTGATGGT CATCTGGCCG GATTTCCTCG CATGGGATAC GGTCAGCAGC ACCACCGCCA CCGCGTATGC CACCGCCCGT GCGCTGGGTC TGCGCGCTAA AATCGACCAG GAGCAGGGCT GGCATAAAAC GCTGTCCAAC GTCGGGGTAA ACGGTGTTAA CGGCATCAGC GCATCTGTAT TCTGGGATTT GCAGGAGTCC GGCACCGATG CTGACCTGCT TAACGAGTCA GGCGTCACTA CGCTGATTCG CCGCGACGGT TTCCGCTTCT GGGGTAACCG TACCTGCTCT GATGACCCGC TGTTCCTCTT TGAAAACTAC ACCCGCACCG CGCAGGTGCT GGCCGACACG ATGGCTGAGG CGCACATGTG GGCGGTGGAC AAGCCCATCA CCGCAACGCT GATTCGCGAC ATCGTTGACG GCATCAATGC CAAATTCCGT GAGCTGAAAA CAAACGGCTA TATCGTGGAT GCGACCTGCT GGTTCAGCGA AGAATCCAGC GATGCGGAAA CCCTCAAGGC CGGAAAACTG TATATCGACT ACGACTATAC ACCGGTACCT CCTCTTGAAA ACCTGACCCT GCGCCAGCGT ATCACCGATA AATACCTGGC AAATCTGGTC ACCTCGGTTA ACAGCAATTA A
|
Protein sequence | MSDYHHGVQV LEINDGTRVI STVSTAIVGM VCTASDADAE TFPLNKPVLI TNVQSAIAKA GKKGTLAASL QAIADQSKPV TVVVRVEDGT GDDEETKLAQ TVSNIIGTTD ENGQYTGLKA LLAAESVTGV KPRILGVPGL DTKEVAVALA SVCQKLRAFG YISAWGCKTI SEVKAYRQNF SQRELMVIWP DFLAWDTVSS TTATAYATAR ALGLRAKIDQ EQGWHKTLSN VGVNGVNGIS ASVFWDLQES GTDADLLNES GVTTLIRRDG FRFWGNRTCS DDPLFLFENY TRTAQVLADT MAEAHMWAVD KPITATLIRD IVDGINAKFR ELKTNGYIVD ATCWFSEESS DAETLKAGKL YIDYDYTPVP PLENLTLRQR ITDKYLANLV TSVNSN
|
| |