Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3050 |
Symbol | |
ID | 6971965 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2824399 |
End bp | 2825472 |
Gene Length | 1074 bp |
Protein Length | 357 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643386882 |
Product | phage major capsid protein GpN |
Protein accession | YP_002271350 |
Protein GI | 209400016 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01551] phage major capsid protein, P2 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.0000148658 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGCCAGG AAACCCGCTT TAAATTTAAT GCCTACCTGT CCCGTGTTGC CGAACTGAAC GGCATCGACG CCGGTGATGT GTCGAAAAAA TTCACCGTTG AACCGTCGGT CACCCAGACC CTGATGAACA CCATGCAGGA GTCCTCTGAC TTTCTGACCC GCATCAACAT TGTGCCGGTC AGCGAAATGA AAGGGGAAAA AATTGGTATT GGTGTCACCG GCTCCATCGC CAGCACCACA GACACCGCCG GTGGCACCGA GCGTCAGCCG AAGGACTTCT CGAAGCTGGC GTCAAACAAG TACGAATGCG ACCAGATTAA CTTCGATTTT TATATCCGCT ACAAAACGCT TGACCTGTGG GCGCGTTATC AGGATTTCCA GCTCCGTGTC CGTAACGCCA TTATCAAACG CCAGTCCCTT GATTTAATCA TGGCCGGTTT TAACGGCGTG AGGCGTGCCG AAACCTCTGA CCGCAGCAGT AACCAGATGC TGCAGGATGT GGCGGTCGGC TGGCTGCAGA AATACCGCAA TGAAGCCCCG GCGCGCGTGA TGAGCAAGGT TACTGACGAG GAAGGTCACA CGACCTCTGA GGTCATCCGC GTGGGTAAGG GCGGTGATTA TGCCAGCCTC GATGCACTGG TGATGGATGC GACCAACAAC CTGATTGAGC CGTGGTATCA GGAAGACCCT GACCTTGTGG TGATTGTGGG GCGTCAGCTG CTGGCGGACA AGTATTTTCC CATCGTCAAC AGGGAGCAGG ACAACAGCGA GATGCTGGCC GCTGACGTCA TCATCAGCCA GAAACGCATC GGTAACCTGC CGGCGGTACG CGTCCCGTAC TTCCCGGCGG ATGCGATGCT CATCACGAAG CTGGAAAACC TGTCCATCTA CTACATGGAT GACAGCCATC GCCGCGTGAT TGTGGAAAAC CCGAAACTCG ACCGCGTGGA GAACTACGAG TCAATGAACA TTGATTACGT GGTGGAAGAC TACGCCGCCG GTTGTCTGGT GGAAAAAATT AAGGTCGGTG ACTTCTCCAC ACCGACTAAA GTGACCGCAG AGCCGGGAGC GTAA
|
Protein sequence | MRQETRFKFN AYLSRVAELN GIDAGDVSKK FTVEPSVTQT LMNTMQESSD FLTRINIVPV SEMKGEKIGI GVTGSIASTT DTAGGTERQP KDFSKLASNK YECDQINFDF YIRYKTLDLW ARYQDFQLRV RNAIIKRQSL DLIMAGFNGV RRAETSDRSS NQMLQDVAVG WLQKYRNEAP ARVMSKVTDE EGHTTSEVIR VGKGGDYASL DALVMDATNN LIEPWYQEDP DLVVIVGRQL LADKYFPIVN REQDNSEMLA ADVIISQKRI GNLPAVRVPY FPADAMLITK LENLSIYYMD DSHRRVIVEN PKLDRVENYE SMNIDYVVED YAAGCLVEKI KVGDFSTPTK VTAEPGA
|
| |