Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2094 |
Symbol | |
ID | 5594163 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2077958 |
End bp | 2080084 |
Gene Length | 2127 bp |
Protein Length | 708 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640921235 |
Product | Phage terminase large subunit (GpA) |
Protein accession | YP_001458779 |
Protein GI | 157161461 |
COG category | [R] General function prediction only |
COG ID | [COG5525] Bacteriophage tail assembly protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.0401019 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCACAA ACATATCTCA GTCCTCGGAG AACCAGAGTT TAACCCAGCG GAAGATTGAG CGTCTTCAAC TGAGTGTCCG GAAAGGGTGG ACACCGCCGC CGCGGATCAG CGTCCCGCAA TGGGCCGATG ACTACCGGAA GCTGGCGAAA GAAGCTGGCA GTACCTCCGG GAACTGGGAA ACATCAACGG TTGAAATTGC CCGCGGTCCT ATGCTGGCCG CGACGGAATC GGGCGTCCAC ATTATCACCG TGATGTGCTG TACCCAGTTA ATGAAAACCG CGCTGCTGGA AAACCTGTTT GGTTATTTTG CGCACCTCGA CCCATGTCCG ATTTTGCTCC TGCAGCCGAA GGAAGAGGCC GCTGAGCAGT TTTCCAAAGA ACGCATCAGC CCGCTGGTTA GGGTAACGCC AGTTCTGCGT AACATCATAG GTGACTCAAA GCAGAAGAGT TCAAAAGAAA CCATTCTGTA TAAAGCTTTC ACCGGCGGAT TTCTGGCGCT GGCCGGCGCC GGTAGTCCAG ATAACCTTGC GCGCCGTCCG ATCCGTGTTC TGCTGGCAGA TGAGGTGGAT AAATATCCGA TTACCCGTGA GGGCGATCCC ATCGCTCTGG CGGAAGAGCG AACCGCCACA TTTGGCCTTA ACTGGCTGTC CGTGCGGGCT TGTTCGCCGA CGATCGAAGA TGAAAGCCGG ATTGCTGACA GTTACGAAGA TTCAGATCAG CGGCGGGCCT CTGTAGTTTG CCCCCATTGC GGGCACCGAC AGTTCCTTGA TTTCTTCAAA CATGTTCAAT GGCCAAAAGA AGGTGATAAG CACCTGACCA AAGCGGCCAT GATCCATTGT GAATGTTGTG GTGCTGGCTG GTCAGAGGGT GAGCGTCTGC GGGCATTACA GACAATCCGC TGGCATCAGA CCAAACCGTT TGAATGTTGT GGTTCCCGCC ATTCACCATT AATGGAATAC GACCAGAAAT GGCATGAAGG AGACGAGGGC AGTGTTGATG CCGTCTGGCG CTGGTCAGAG TCGGAACGGC ATGCCGTATA CCGGGCGATT TGCCCGGACT GCGGGGCCGA GGCGCTGGAT AATCACCACG CCGGGTACCA GGCGTCAAAA CTCTTCAGTC CCTGGCAAAA AGACAAGCCA TCGGACATTG CAAAGAAATA CCTCGATGCG AAAGGGGATC CGGATAAGGA ACAGGCCTGG TGGAACACCC AGATGGGGTT GCCGCACCGG CCTAACCACG GGAAACAGCT CCCGGTTGAT GTCCTGCTGG CGCGCCGTGA AGTCTTCCCG GCCGTCGTTC CTGATGGCGT GGCATTGTTA ACTGCGGGCG TCGATACTCA GGATGACCGA TTCGAAATCA CGATCACTGG CTGGGGGCGG GACGAGGAAT CGTGGTCAGT TGCGCATGAC GTCATTTATG GCGATCTGGA GACTGAGGAA CCCTGGAAGC GCCTCGATGC GTACCTGAAA CAGATATGGC GACGCGGCGA CGGGCGAGGG CTGAATATTC TGGCTGCATG TATGGACTCC GGCGGTCACC ACACGCAAAA GGTTTATGAG TTCTGCAAAG ATCGCCTTGG GCGCCGCGTC TGGGCTATCA AGGGCGAATC TGCGCAGGGT GGGAAACGCA ACCCCGTCTG GCCAACCAAG CGACCGACAT CAAAAAGTAA AGCCAGCTTC AGGCCAATTA TACTTGGCGT GAACTCTGCG AAAGATGTTG TCCGTGGTCG TCTGCATCTT GAACCGCCAG CTTTAGGTAC TGCCGGTGCG GGCTATATGC ACTTCCCGGA TGATCGTGAC CTCGGCTATT TCAACCAGCT GCTGGCCGAG CGACTGGTTT ATAAAGTGGT GGCCGGTCAG CGATTCAGTG TCTGGGAGCC TATCCCCGGA CGGGCGAACG AAGCACTCGA CTGCCTCGTT TACAGCTATG CCGCGTTGTG TGGGCTGAAA CATATGGGAT TAAAACTCAA TGTTCGGGCC GCTAACCTTC AGGCCGATCC CGATAAGTTC CTGCCGGCGC CAGCCGAGCC AGAAGAAAAA ATCAATTACG AATTACCGGG TGCCATCGTG GATGAGGCTA TGGCTCCTGT TAAGCGTAAG AACATTTCTA AACTCCTGCC GCAATAA
|
Protein sequence | MSTNISQSSE NQSLTQRKIE RLQLSVRKGW TPPPRISVPQ WADDYRKLAK EAGSTSGNWE TSTVEIARGP MLAATESGVH IITVMCCTQL MKTALLENLF GYFAHLDPCP ILLLQPKEEA AEQFSKERIS PLVRVTPVLR NIIGDSKQKS SKETILYKAF TGGFLALAGA GSPDNLARRP IRVLLADEVD KYPITREGDP IALAEERTAT FGLNWLSVRA CSPTIEDESR IADSYEDSDQ RRASVVCPHC GHRQFLDFFK HVQWPKEGDK HLTKAAMIHC ECCGAGWSEG ERLRALQTIR WHQTKPFECC GSRHSPLMEY DQKWHEGDEG SVDAVWRWSE SERHAVYRAI CPDCGAEALD NHHAGYQASK LFSPWQKDKP SDIAKKYLDA KGDPDKEQAW WNTQMGLPHR PNHGKQLPVD VLLARREVFP AVVPDGVALL TAGVDTQDDR FEITITGWGR DEESWSVAHD VIYGDLETEE PWKRLDAYLK QIWRRGDGRG LNILAACMDS GGHHTQKVYE FCKDRLGRRV WAIKGESAQG GKRNPVWPTK RPTSKSKASF RPIILGVNSA KDVVRGRLHL EPPALGTAGA GYMHFPDDRD LGYFNQLLAE RLVYKVVAGQ RFSVWEPIPG RANEALDCLV YSYAALCGLK HMGLKLNVRA ANLQADPDKF LPAPAEPEEK INYELPGAIV DEAMAPVKRK NISKLLPQ
|
| |