Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3052 |
Symbol | |
ID | 6972087 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 2826559 |
End bp | 2828331 |
Gene Length | 1773 bp |
Protein Length | 590 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643386884 |
Product | phage large terminase subunit GpP |
Protein accession | YP_002271352 |
Protein GI | 209400321 |
COG category | [S] Function unknown |
COG ID | [COG5484] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.680947 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.000000111956 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCATCA CCACAGACAC CACTCTTTTA CACGACCCGC GTCGTCAGGC GGCGCTGCTG TACTGGCAGG GATTTTCCGT GCCGCAGATT GCCGCCATGT TGCAGATGAA ACGCCCGACG GTGCAGAGCT GGAAACAGCG CGACGGCTGG GACAGCGTTG CCCCCATCAG CCGTGTCGAA ATGAGTCTGG AAGCGCGGCT GACCCAGCTC ATCATCAAAC CGCAGAAAAC CGGCGGTGAC TTCAAGGAAA TTGACCTGCT CGGACGCCAG ATTGAACGAC TGGCACGGGT AAACCGCTAC AGCCAGACCG GCAACGAGGC AGACCTTAAT CCGAACATCG CTAACCGCAA CAAAGGCGGG CGGCGCAAAC CGAAAAAGAA TTTTTTCAGT GACGAGGCTA TCGAAAAGCT GGAGCAGATT TTCTTTGAGC AGTCTTTCGA ATATCAGTTG CACTGGTATC GCGCCGGGCT TGAGCACCGC ATCCGCGATA TCCTGAAATC CCGCCAGATT GGCGCGACGT TTTATTTTTC CCGCGAGGCG CTGCTGCGCG CCCTGAAAAC CGGTCATAAC CAGATTTTTC TGTCGGCCAG TAAAACGCAG GCGTATGTGT TCCGCGAATA CATCATCGCC TTTGCCCGGC TGGTTGACGT TGACCTGACC GGTGACCCGA TTGTCCTGGG CAATAACAGC GCAAAACTGA TTTTTCTCGG CACCAACTCC AACACCGCGC AGAGCCATAA CGGCGACCTG TACGTCGACG AGATTTTCTG GATACCGAAT TTTCAGGTAC TGCGTAAGGT GGCATCAGGT ATGGCCTCAC AGAGTCACCT GCGCTCGACC TATTTCTCCA CCCCGTCCAC GCTGGCGCAC GACGCCTACC CGTTCTGGTC GGGTGAACTG TTCAACCGGG GACGCGCCAG CGCCGCCGAA CGCGTGGAAA TCGACGTCAG TCATAACGCT CTTGCCGGTG GGCTTCTCTG TGCGGACGGT CAGTGGCGGC AGATTGTCAC CATTGAGGAC GCCCTGAAAG GTGGCTGCAC GCTGTTCGAC ATTGAGCAGC TTAAACGCGA AAACAGCGCC GACGATTTTA AAAACCTGTT CATGTGTGAA TTTGTTGACG ACAAGGCGTC GGTGTTCCCG TTCGAGGAGC TGCAACGCTG CATGGTCGAC ACGCTGGAAG AATGGGAAGA CTATGCGCCG TTTGCCGCGA ATCCGTTCGG CTCCCGCCCG GTCTGGATTG GTTACGACCC GTCACACCGT GGCGACAGTG CCGGATGCGT GGTGCTGGCA CCGCCGGTGG TGGCCGGTGG CAAATTCAGA ATACTTGAGC GTCACCAGTG GAAAGGCATG GACTTTGCCA CCCAGGCTGA ATCCATCCGC AAACTCACCG AAAAATACAA CGTCGAATAC ATCGGAATTG ATGCCACCGG CCTCGGTGTC GGCGTGTTCC TGCTCGTTCG CTCGTTCTAT CCCGCCGCAC GCGATATCCG CTACACGCCG GAAATGAAAA CCGCAATGGT GCTCAAGGCA AAAGACGTTA TTCGCCGTGG CTGTCTGGAA TATGACGTCA GCGCCACCGA CATCACCAGC TCGTTTATGG CTATCCGCAA GACCATGACC AGCAGCGGAC GCAGCGCCAC CTATGAGGCC AGCCGCAGCG AGGAAGCCAG CCACGCCGAC CTCGCCTGGG CGACCATGCA CGCCCTGTTA AATGAGCCAC TCACCGCCGG TATCAGCACC CCGCTGACAT CCACCATTCT GGAGTTTTAC TGA
|
Protein sequence | MTITTDTTLL HDPRRQAALL YWQGFSVPQI AAMLQMKRPT VQSWKQRDGW DSVAPISRVE MSLEARLTQL IIKPQKTGGD FKEIDLLGRQ IERLARVNRY SQTGNEADLN PNIANRNKGG RRKPKKNFFS DEAIEKLEQI FFEQSFEYQL HWYRAGLEHR IRDILKSRQI GATFYFSREA LLRALKTGHN QIFLSASKTQ AYVFREYIIA FARLVDVDLT GDPIVLGNNS AKLIFLGTNS NTAQSHNGDL YVDEIFWIPN FQVLRKVASG MASQSHLRST YFSTPSTLAH DAYPFWSGEL FNRGRASAAE RVEIDVSHNA LAGGLLCADG QWRQIVTIED ALKGGCTLFD IEQLKRENSA DDFKNLFMCE FVDDKASVFP FEELQRCMVD TLEEWEDYAP FAANPFGSRP VWIGYDPSHR GDSAGCVVLA PPVVAGGKFR ILERHQWKGM DFATQAESIR KLTEKYNVEY IGIDATGLGV GVFLLVRSFY PAARDIRYTP EMKTAMVLKA KDVIRRGCLE YDVSATDITS SFMAIRKTMT SSGRSATYEA SRSEEASHAD LAWATMHALL NEPLTAGIST PLTSTILEFY
|
| |