Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_2223 |
Symbol | |
ID | 5589449 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 2186358 |
End bp | 2187392 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640925891 |
Product | phage major capsid protein E |
Protein accession | YP_001463291 |
Protein GI | 157156927 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.00397353 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGATT CGTTTACTAC GTCAGAACTG ATTACCGCAA CGCAGCAGGT ATTTAAGTTC AATCCGTTGT TTTTAAGATT GTTTTTCCGT GAGACCTACA CGTTTACGAG CGAAGAGGTT TTTCTGGATA AAATCCCGGG CAAAGTCAAT ATGGCGGTAT ATTGCGCCCC GATGATCACC GGCAAAGTTG ACCGCACTCG CGGCTATTCA ACGAACCACT TTAAACCGGG TTACACGAAA CCGAAACACA CGATCAATCC GAATATGAGC ATCAAGCGCG CCGCCGGTGA GCAAATTGGC CAGCCAGAAA CGCCGGTCGA ACGTCGTGCA AAAATCATCA TGCAGAATTT GCTCGACGAG GAGCTCAGCA TCAGCCAGCT CGAAGAGTTC CAGGCAGTGC AGGCGGTTCT GTACGGTAAA TACACCGTTT CCGGCAGCAA TATCGAGACC TATGAGATCG ACATGAGCCG CAGCGCGACG AATAACGTCA CTCAGTCCGG TTCGACCGCC TGGTCGACTC AGGACGCGGA AACGTATGAC CCGAGCGACG ATATTGAATC CTATGCAGAC CTCGCCTCCG GTGCGGTTAA CGTGATCATC ATGGACGGTA AAGCCTGGAA GCAGCTTAAA CGCTTTAAAA AATTCTGGAC GGCACTGGAT ACGCGCCGCG GCTCAAACAG TCAGCTCGAA GTCGCGCTGA AAAACCTGGG CGATGTCGTT AGCTTTAAGG GCTACTACGG CGACACGGCG CTGTTTGTCT ACAAAGGGCA ATACATCGAC CCGGTAACAG GCACTGAAAC GCGTTATATG CCGGATAACA CGATGATCCT TGGCAACACA AAAAACCGCG GGCTCCGCAC TTATGGCGCA ATTCAGGACG AAGACGCGCT GAAAGAGGGT ATTTGCGAAG CGACGCGCTA TCCAAAAGTC TGGACCACTA CCGGTGATCC GGCAGTGACG CAAACAATGA CGCAATCCGC GCCAGCAATG GTCCTCACGG ACGCCGACGC GTTCGTTGTC GTAAAAATCG CGTAA
|
Protein sequence | MSDSFTTSEL ITATQQVFKF NPLFLRLFFR ETYTFTSEEV FLDKIPGKVN MAVYCAPMIT GKVDRTRGYS TNHFKPGYTK PKHTINPNMS IKRAAGEQIG QPETPVERRA KIIMQNLLDE ELSISQLEEF QAVQAVLYGK YTVSGSNIET YEIDMSRSAT NNVTQSGSTA WSTQDAETYD PSDDIESYAD LASGAVNVII MDGKAWKQLK RFKKFWTALD TRRGSNSQLE VALKNLGDVV SFKGYYGDTA LFVYKGQYID PVTGTETRYM PDNTMILGNT KNRGLRTYGA IQDEDALKEG ICEATRYPKV WTTTGDPAVT QTMTQSAPAM VLTDADAFVV VKIA
|
| |