Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0309 |
Symbol | |
ID | 5593467 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 312144 |
End bp | 313559 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640919495 |
Product | PBSX family phage terminase large subunit |
Protein accession | YP_001457081 |
Protein GI | 157159763 |
COG category | [R] General function prediction only |
COG ID | [COG1783] Phage terminase large subunit |
TIGRFAM ID | [TIGR01547] phage terminase, large subunit, PBSX family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 61 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCTCGA TTAATCCTAT CTTTGAACCG TTCATTGAGG CGCATCGCTA CAAAGTCGCC AAAGGCGGTC GAGGTAGCGG TAAGTCATGG GCAATTGCGA GGCTGCTTGT TGAAGCGGCG CGTCGGCAGC CAGTGCGTAT TCTTTGCGCT CGTGAACTGC AAAACAGTAT CAGCGATTCG GTAATCCGGT TGCTTGAAGA TACCATCGAG CGTGAAGGGT ATTCGGCTGA GTTTGAAATT CAGCGTTCCA TGATTCGTCA TCTCGGAACG AATGCTGAAT TCATGTTCTA CGGCATCAAA AACAACCCGA CGAAGATTAA ATCGCTCGAA GGCATTGATA TCTGCTGGGT GGAAGAAGCG GAAGCGGTAA CGAAGGAATC ATGGGATATC CTGATACCAA CCATCCGTAA GCCGTTCTCT GAAATATGGG TGAGCTTTAA CCCGAAGAAC ATACTCGACG ATACCTATCA GCGATTCGTT GTAAATCCTC CCGATGATAT TTGCCTGCTG ACGGTGAACT ACACCGACAA CCCGCACTTT CCTGAAGTTC TCCGTCTGGA GATGGAAGAG TGTAAACGCA GAAACCCGAC ACTGTATCGT CACATCTGGC TTGGTGAGCC AGTAAGCGCA AGTGATATGG CAATCATCAA ACGTGAATGG CTTGAAGCCG CAACCGATGC GCACAAGAAA CTCGGATGGA AAGCGAAAGG CGCTGTTGTC TCTGCACATG ATCCATCAGA TACAGGGCCA GATGCTAAAG GTTACGCATC GCGCCACGGT TCGGTAGTTA AGCGCATTGC CGAAGGTCTG CTGATGGACA TCAACGATGG TGCTGACTGG GCTACTTCGC TGGCGATTGA AGACGGCGCT GACCACTACC TGTGGGATGG TGATGGTGTT GGTGCCGGGC TACGCAGACA GACAACGGAA GCGTTCTCCG GCAAGAAAAT CACCGCCACG ATGTTCAAGG GCAGCGAATC GCCATTCGAT GAAGATGCAC CGTATCAGGC CGGAGCATGG GCCGATGAAG TCGTACAGGG CGACAACGTT CGCACTATTG GCGATGTATT CCGCAATAAG CGAGCGCAAT TCTATTACGC GCTGGCTGAC AGGCTGTATC TGACATATCG GGCGGTTGTT CATGGTGAGT ATGCAGACCC CGACGACATG CTGAGTTTCG ACAAAGAAGC GATAGGCGAG AAGATGCTGG AGAAGCTGTT TGCAGAACTG ACGCAGATTC AGCGCAAATT CAATAACAAC GGGAAGCTGG AGCTAATGAC TAAGGTCGAA ATGAAGCAGA AGCTCGGTAT TCCATCTCCT AACCTGGCTG ATGCGCTGAT GATGTGTATG CATTGCCCGG AGTCGGCTGC GCAACCCGAC TATTCCAGTT ACTCAATTCC TTGTGGTGTA GGTTGA
|
Protein sequence | MTSINPIFEP FIEAHRYKVA KGGRGSGKSW AIARLLVEAA RRQPVRILCA RELQNSISDS VIRLLEDTIE REGYSAEFEI QRSMIRHLGT NAEFMFYGIK NNPTKIKSLE GIDICWVEEA EAVTKESWDI LIPTIRKPFS EIWVSFNPKN ILDDTYQRFV VNPPDDICLL TVNYTDNPHF PEVLRLEMEE CKRRNPTLYR HIWLGEPVSA SDMAIIKREW LEAATDAHKK LGWKAKGAVV SAHDPSDTGP DAKGYASRHG SVVKRIAEGL LMDINDGADW ATSLAIEDGA DHYLWDGDGV GAGLRRQTTE AFSGKKITAT MFKGSESPFD EDAPYQAGAW ADEVVQGDNV RTIGDVFRNK RAQFYYALAD RLYLTYRAVV HGEYADPDDM LSFDKEAIGE KMLEKLFAEL TQIQRKFNNN GKLELMTKVE MKQKLGIPSP NLADALMMCM HCPESAAQPD YSSYSIPCGV G
|
| |