Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_2775 |
Symbol | |
ID | 6064857 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 3041798 |
End bp | 3042856 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641602181 |
Product | P2 family phage major capsid protein |
Protein accession | YP_001725730 |
Protein GI | 170020776 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01551] phage major capsid protein, P2 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.765557 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000193345 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAAGAAGA ATACCCGCTT TGCTTTTAAC GCTTACCTGC AGCAGCTGGC GCGTCTGAAC GGTGTGGCAG TTGAAGAACT GTCCAGCAAG TTCACCGTAG AGCCGTCTGT ACAGCAGACG CTGGAAGACC AGATTCAGCA ATCCGCCGCT TTCCTGACGC TGATTAACGT CACGCCAGTG ACTGAGCAGT CTGGTCAGTT GCTGGGGCTG GGTGTTGGCA GCACCATTGC CGGAACCACT GACACCACCG CGAAAGAGCG TGAACCTGTC GATCCGACGC TGATGGTCGA TGTGGAATAC AAATGCGAGC AGACCAACTT TGACACGGTG CTGACCTACG CGAAGCTGGA CTTGTGGGCG AAGTTTCAGG ATTTCCAGGT GCGTATCCGT GACGCCATCG TGAAACGTCA GGCACTGGAC CGCATCATGA TCGGCTTTAA CGGCGTGAAG CGTGCGAAAA CCTCCAACCG TAGCGAAAAC CCGCTGCTGC AGGATGTGAA CAAAGGCTGG CTGCAGAAAA TCCGTGAGGA TGCACCGGAT CATGTCATGG GCAGCACCAC CACGGGCGGT GAAACCACAC CGGGTGCGGT GAAAGTCGGG AAAGGTGGCG AATATGCCAA CCTGGACGCC GTGGTGATGG ATGCGGTCAA TGAGCTTATC GACGTGGTCT ACCAGGACGA TGACGATCTG GTGGTGATTT GCGGTCGTGA ACTGTTGTCT GACAAGTATT TCCCGCTGGT CAACAAAGAG CAGGAAAACA GTGAAAAACT GGCTGCCGAT ATGATCATCA GCCAGAAACG CATGGGTGGC CTGCAGGCCG TGCGTGCGCC GTTCTTCCCG CCGAATGCGC TGCTGATCAC CCGTCTGGAT AACCTGTCCA TCTACTGGCA GGAAGATACC CGCCGCCGTT CAGTTATTGA CAACCCGAAA CGTGACCGGA TTGAAAACTT TGAATCCGTT AACGAAGCCT ACGTGGTTGA GGACTACCGT TGCGCTGCAC TGGTGGAAAA CATCCAGATT GGCGATTTCA GCGCCGCCGC AGCAGAAGCC GGAGCGTAA
|
Protein sequence | MKKNTRFAFN AYLQQLARLN GVAVEELSSK FTVEPSVQQT LEDQIQQSAA FLTLINVTPV TEQSGQLLGL GVGSTIAGTT DTTAKEREPV DPTLMVDVEY KCEQTNFDTV LTYAKLDLWA KFQDFQVRIR DAIVKRQALD RIMIGFNGVK RAKTSNRSEN PLLQDVNKGW LQKIREDAPD HVMGSTTTGG ETTPGAVKVG KGGEYANLDA VVMDAVNELI DVVYQDDDDL VVICGRELLS DKYFPLVNKE QENSEKLAAD MIISQKRMGG LQAVRAPFFP PNALLITRLD NLSIYWQEDT RRRSVIDNPK RDRIENFESV NEAYVVEDYR CAALVENIQI GDFSAAAAEA GA
|
| |