Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1583 |
Symbol | |
ID | 6967183 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1543001 |
End bp | 1544056 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 643385546 |
Product | phage major capsid protein E |
Protein accession | YP_002270040 |
Protein GI | 209396882 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.000137547 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCAGGCA AAGCAACGGC ACTTAACACT AACCAGCTTT TCATGTACCT GAATCGCGGG GATATTGCGG ATTTTAAATT CAGCCCTCTG TTTACCACGC TGTTTTTCCC GAACGTGGCG ACATTCAGCA CGCAAAACAT CATGCTGGAT ACCCTGGACA TTGAAGAAGT CACCATGTCG GCGTTTTGTT CGCCTATGGT GGGTAGCCAG GTTCAGCGCG ATAAAGGGTA CGAAACCAGC ACAATCAAAC CTGGCTACAT GAAGCCAAAG CACGAAATCG ATCCAACGAA AACCATCATG CGCATGGCTG GAGAAGATCC GGCACAGCTT AACGACCCTA CCTATCGCCG TATGCGCCTG ATTACTGGCA ACATGCGCCG CCAGATAAAC GCCATTAAGG CACGCGTGGA ATGGCTGGCG GTGAATGCGG TAACGACCGG AAAAAACATC ATTGAGGGCG AAGGCATAGA ACGCTATGAA ATCGACTGGA AAATACCGGA AAACTGCATC ATAGAGCAGG CCAAGGGTAA AAAATGGTCC GAGCAGGATA AAGACATGCA CGACCCAATC TATGACATCG AGCTTTATGC TGATCAGGCT GGTTGCCCCG CAAACGTCAT GATTATGGGC GCTGAGGTAT GGCGCACATT ACGCAGCTTT AAAAAATTCC GTGAGCTGTA CGATCTTTCC CGTGGTTCAG AATCCGCCGC CGAACTGGCC TGTAAAAACC TGGGCGAAGT GGTGAGCTTT AAAGGCTATC TTGGTGATCT GGCCCTTATC GTCTATTCCG GCAAATACAC TGACAGCGAT GGTACCGAAA AATATTTCCT TGAGCCTGAT TTGCTGGTCC TGGGTAACAC CAACAATAAA GGGCTGGTGG CCTATGGTGC GATAATGGAT CAGGAAGCGG TAAGAACGGG CGCAACACAA AACATGTTTT ACCCGAAAAA CTGGATTGAG GACGGCGATC CGGCGATTGA GTACGTGCAG ACACACAGTG CACCGCAGCC GGTACCGGCA GATATTCGCA AATTTGTTAC CGTCAAAATT GGTTAA
|
Protein sequence | MAGKATALNT NQLFMYLNRG DIADFKFSPL FTTLFFPNVA TFSTQNIMLD TLDIEEVTMS AFCSPMVGSQ VQRDKGYETS TIKPGYMKPK HEIDPTKTIM RMAGEDPAQL NDPTYRRMRL ITGNMRRQIN AIKARVEWLA VNAVTTGKNI IEGEGIERYE IDWKIPENCI IEQAKGKKWS EQDKDMHDPI YDIELYADQA GCPANVMIMG AEVWRTLRSF KKFRELYDLS RGSESAAELA CKNLGEVVSF KGYLGDLALI VYSGKYTDSD GTEKYFLEPD LLVLGNTNNK GLVAYGAIMD QEAVRTGATQ NMFYPKNWIE DGDPAIEYVQ THSAPQPVPA DIRKFVTVKI G
|
| |