Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1788 |
Symbol | |
ID | 6971438 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1707272 |
End bp | 1708297 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643385735 |
Product | phage major capsid protein E |
Protein accession | YP_002270225 |
Protein GI | 209400938 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.123536 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGATGT ACACAACCGC CCAACTGCTG GCGGCAAATG AGCAGAAATT TAAGTTTGAT CCGCTGTTTC TGCGTCTCTT TTTCCGTGAG AGCTATCCCT TCACCACGGA GAAAGTCTAT CTCTCACAAA TTCCGGGACT GGTAAACATG GCGCTGTACG TTTCGCCGAT TGTTTCTGGT GAGGTTATCC GTTCCCGTGG CGGCTCCACC TCTGAATTTA CGCCGGGATA TGTCAAGCCG AAGCATGAAG TGAATCCGCA GATGACCCTG CGTCGCCTGC CGGATGAAGA TCCGCAGAAT CTGGCGGACC CGGCTTACCG CCGCCGTCGC ATCATCATGC AGAACATGCG TGACGAAGAG CTGGCCATTG CTCAGGTCGA AGAGATGCAG GCAGTTTCTG CCGTGCTCAA GGGCAAATAC ACCATGACCG GTGAAGCCTT CGATCCGGTT GAGGTGGATA TGGGCCGCAG TGCGGCGAAC AACATCACGC AGTCCGGCGG CACGGAGTGG AGCAAGCGTG ACAAGTCCAC GTATGACCCG ACCGACGATA TCGAAGCCTA CGCGCTGAAC GCCAGCGGCG TGGTGAATAT CATCGTGTTT GACCCGAAAG GCTGGGCGCT GTTCCGTTCC TTCAAAGCCG TCAGGGAGAA GCTGGATACC CGTCGCGGCT CTCATTCCGA ACTGGAGACA GCGGTAAAAG ACCTGGGCAA AGCGGTGTCT TATAAGGGAA TGTATGGCGA TGTGGCCATC GTCGTGTATT CCGGACAGTA CGTGGAAAAC GGCGTCAAAA AGAACTTCCT GCCGGACAAC ACGATGGTGC TGGGGAACAC TCAGGCACGC GGTCTGCGTA CCTATGGCTG CATTCAGGAT GCGGACGCAC AGCGCGAAGG CATTAACGCC TCTGCCCGTT ACCCGAAAAA CTGGGTGACC ACCGGCGATC CGGCGCGTGA GTTCACCATG ATTCAGTCAG CACCGCTGAT GCTGCTGGCT GACCCTGATG AGTTCGTGTC CGTACAACTG GCGTAA
|
Protein sequence | MSMYTTAQLL AANEQKFKFD PLFLRLFFRE SYPFTTEKVY LSQIPGLVNM ALYVSPIVSG EVIRSRGGST SEFTPGYVKP KHEVNPQMTL RRLPDEDPQN LADPAYRRRR IIMQNMRDEE LAIAQVEEMQ AVSAVLKGKY TMTGEAFDPV EVDMGRSAAN NITQSGGTEW SKRDKSTYDP TDDIEAYALN ASGVVNIIVF DPKGWALFRS FKAVREKLDT RRGSHSELET AVKDLGKAVS YKGMYGDVAI VVYSGQYVEN GVKKNFLPDN TMVLGNTQAR GLRTYGCIQD ADAQREGINA SARYPKNWVT TGDPAREFTM IQSAPLMLLA DPDEFVSVQL A
|
| |