Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_2230 |
Symbol | |
ID | 5589405 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 2193020 |
End bp | 2194984 |
Gene Length | 1965 bp |
Protein Length | 654 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640925896 |
Product | phage terminase large subunit (GpA) |
Protein accession | YP_001463296 |
Protein GI | 157155074 |
COG category | [R] General function prediction only |
COG ID | [COG5525] Bacteriophage tail assembly protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACTGGC AGACATACGA GTCGACTTCA GCGATTACGA TGTCAGCGAT ATCGAACGCG ATCTCGACAG GGTTGAATCC CTTGCGCGTG ACGATACCGA TGACGGGGGT TGAGTGGGCT GATAAATATT TTTATCTCCC TGAGGGCTCC AGCCACATCG CTGGCCACTG GACGACTCAG CCGGTCCAGG TAGTGATGCT CAATATGATG ACTAACGACG CGATAAAAAT CGTGTCTGTT CGCAAATCAG CTCGTCTCGG TTATACAAAA ATACTCGTCG CGGCGCTGCT CTATTTCGCT GAGCACAAAA AACGTAGTGC CGTGGTCTAT CAGCCTATCG ATGACGAATC GGATGGATTT GTTGCCGACG AGGTTGACCC CGCTATCGCC GAAATGCCGG TGATTCAGAA AATTTTCCCC GACTGGGATA AAAGCAACGA GCGTAACAAT CTCCAGCGTA AAGAAATGAG CGGCGCGATT ATTGATTTTC GCGGCGCGAG TGCACCAGGA AATTTCCGGC GACTAACGAA ACAGGTTGTC GAGGGTGACG AAGTTGACGG CTGGCCGCTG GAAGTTGCCA AAAAAGGCAA AGGCGAGGGC TCGCCCATCG AACTGGCGCT CGTTCGAATT AAAGGGGCAG CGTACCCGAA GGCGATTTTC GGCTCGACGC CAACCGTCAC CGGAAAAAGT CATATTGAAA TGCTGGAGGA TGCCGCTGAC CTGACGTTTC GTTTTTACCT GAAATGTCCG CATTGTGGCG AGGAGCAGGT CCTGGTATTT GGTTTCGACG GCATCGAATA TGGCCTCAAA TGGGATAATA GCCTGCAGAC AAATGAGGCG AAATCGTCGT CCGCGTATTA CCAGTGCTGC CACTGTCCTG AGCATTTTTA CTATCGCGAT CTCGAAAAAA TGGAGCTCGC GGGGCGCTGG ATAGCAGAGG ACTGCACCTG GACGCGGGAC GGTATTCGTT TTTTTGATCA CGACGGTGGT GTCGTTCGCG CGCCGAAACA CGCGGCGATC GTGATAAACG CCATGTATTC GCTGAATCTC GACGGCTGGG GCGAGATTGT CAGCGAGTGG CTGAAAGCGA AGGGCGATCC GCTCAAAGAA AAGACGTTTC ATAACACGAC GCTCGGCGAA CTCTGGAGTG ACGTAGCCAG CGAGCAGCTG GAGCACGATA TTCTGGTTAA TCGCCGGGAA AAATACGCCA GCCAGGTTCC TGACGGTGTT GTTTATCTGA CCGGCGGCAT CGACTCTCAG ACGTCCGGTC GCTACGAGTG TTACGTGTGG GGCTGGGGAG TGGAGGAGGA GTGCTGGCTG ATTGATAAAA CAATCGTCCT CGGTCGCTAC GACGAGGAGG ACACGCTGCA GCGCGTCGAC GGAGTGATTC GCAAACAATA CCGGCGCAGC GACGGGACCA CAATCGGCGT CAGTCGCTGG GCGTGGGATA CCGGTGGTAT AGATGCGCAG GTCGTTTATA ACCGCTCGCT GAAACTCGGT CCGCTGTGGG TCATTCCAAT TAAAGGTGCG AGTTCATACG GTCAGCCAGT CGTAAATATG CCGCGTACAC GTAACGCGAA TAAAGTCTAT TTGTCGTTAA TCGGTACTGA TACGGCAAAA GATTTGCTCG CAATGCGCCT GCCGCTGGAA CCCGATTCTA AATCGGCGAC ACCAAGTGCG ATTCATTTTC CCAACGACGA CGAAATATTC GGCACGACAG AGGCAAAACA GCTCGTCTCT GAAGTTCTGA TCCCGAAACT GATTAACGGT CGCGTCGTTT ATCGCTGGGA CAACCAGGGC CGGCGAAATG AGGCGCTCGA CTGCTGGGTA TACGCGCTGG CAGCGCTACG TATCAGTAAA ATTCGTTTCC AGCTCAATCT CGAGACGCTC GCTGAGCAAC GGAAAAAATC ACAAAACAAA CTGTCTCTCG AGGAGATGGC CAGAATGCTC GGAGGGAGCT CATGA
|
Protein sequence | MNWQTYESTS AITMSAISNA ISTGLNPLRV TIPMTGVEWA DKYFYLPEGS SHIAGHWTTQ PVQVVMLNMM TNDAIKIVSV RKSARLGYTK ILVAALLYFA EHKKRSAVVY QPIDDESDGF VADEVDPAIA EMPVIQKIFP DWDKSNERNN LQRKEMSGAI IDFRGASAPG NFRRLTKQVV EGDEVDGWPL EVAKKGKGEG SPIELALVRI KGAAYPKAIF GSTPTVTGKS HIEMLEDAAD LTFRFYLKCP HCGEEQVLVF GFDGIEYGLK WDNSLQTNEA KSSSAYYQCC HCPEHFYYRD LEKMELAGRW IAEDCTWTRD GIRFFDHDGG VVRAPKHAAI VINAMYSLNL DGWGEIVSEW LKAKGDPLKE KTFHNTTLGE LWSDVASEQL EHDILVNRRE KYASQVPDGV VYLTGGIDSQ TSGRYECYVW GWGVEEECWL IDKTIVLGRY DEEDTLQRVD GVIRKQYRRS DGTTIGVSRW AWDTGGIDAQ VVYNRSLKLG PLWVIPIKGA SSYGQPVVNM PRTRNANKVY LSLIGTDTAK DLLAMRLPLE PDSKSATPSA IHFPNDDEIF GTTEAKQLVS EVLIPKLING RVVYRWDNQG RRNEALDCWV YALAALRISK IRFQLNLETL AEQRKKSQNK LSLEEMARML GGSS
|
| |