Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0924 |
Symbol | |
ID | 5594178 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 928572 |
End bp | 930338 |
Gene Length | 1767 bp |
Protein Length | 588 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640920094 |
Product | terminase, ATPase subunit |
Protein accession | YP_001457661 |
Protein GI | 157160343 |
COG category | [S] Function unknown |
COG ID | [COG5484] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 59 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACACCA CACTGACACC CGCAGATCTC GATCCCCGTC GGCAGGCCAT GCTGCTGTAC TTTCAGGGAT ACCGCGTAGC CCGCATTGCT GAAATGCTGG GCGAGAAAGT TGCAACCGTT CACAGCTGGA AAAAACGCGA CAAGTGGGGT GACTATGGGC CGCTGGATCA GATGCAGCTC ACCACCGCCG CACGCTACTG CCAGCTCATT ATGAAGGAGC ACAAAGAAGG GAAAGATTTC AAAGAGATTG ACCTGCTGGC GCGCCAGTCT GAGCGCCACG CGCGGATCGG CAAGTTTAAC AATGGCGGCA ACGAAGCCGA CTTGAACCCT AACGTCGCCA ACCGCAACAA AGGCCCGCGC CGTCAGCCGG AAAAAAATGT CTTCACCGAT GAACAGATTG AGAAGCTGGA AGAAATCTTC CATTCCTCCA TGTTCAACTA CCAGCGCCAC TGGTGGGAAG CCGGAAAAAC CAACCGCATC CGCAACCTGC TGAAGTCACG CCAGATCGGC GCGACCTTCT ATTTTGCCCG TGAAGCCCTG ATTGACGCCC TGCTGACCGG ACGTAACCAG ATTTTCCTTT CTGCCAGTAA GGCACAGGCC CACGTCTTCA AACAGTACAT CATCGACTTT GCCAAAGAAG TAGAGGTGGA GCTGAAAGGC GATCCGATGG TGCTTCCCAA CGGGGCCACA CTGTATTTCC TCGGCACCAA TGCCCGCACG GCCCAGAGTT ATCACGGCAA CCTGTATCTG GATGAATATT TCTGGATACC AAAATTCCAG GAGCTGCGCA AAGTGGCTTC CGGTATGGCT ATTCACAAAA AATGGCGACA AACCTATTTT TCCACGCCAT CCAGTCTGAC CCACAGTGCT TATCCGTTCT GGTCCGGTGC GCTGTTCAAC CGTGGGCGCA ACAAAGCTGA CAAGGTGGAC ATCGACCTGT CCCACAGCAA TCTGGCCCCC GGCCTGCTGT GCGCAGACGG GCAATACCGC CAGATAGTCA CCGTGGAAGA TGCGGTGCGC GGCGGCTGTA ACCTGTTCGA CCTCGACCAG TTGCGCATGG AGTACAGCCC GGACGAATAC CAGAACCTGC TGATGTGTGA GTTCGTGGAC GATCTCGCGT CCGTGTTCCC GCTCAGCGAG CTGCAGGCGT GCATGGTGGA CAGTTGGGAA GTCTGGACCG ACTTTCATGC ACTGGCCCTG CGCCCGTTTG GCTGGCGCGA AGTGTGGATC GGTTATGACC CGGCAAAAGG TACGCAAAAC GGCGACAGCG CCGGGTGCGT GGTGGTGGCA CCGCCAGCCG TGCCGGGCGG TAAGTTCCGC ATTCTTGAGC GTCACCAGTG GCGCGGGATG GACTTCCGCG CCCAGGCTGA CGCCATCAAA AAACTGACCG AACAGTACAA CGTGACATAC ATCGGCATTG ACTCGACAGG TGTCGGTCAC GGGGTTTACG AGAACGTGAA AGCGTTCTTT CCTGCCGTCC GGGAGTTTGT CTACAACCCC AACGTTAAAA ATGCCCTGGT ACTCAAAGCC TACGACATTA TCAGTCACCG TCGTCTGGAG TTTGACGCCG GACACACCGA CATAGCGCAG TCATTTATGG CAATCCGTCG CGCCACCACC GCCAGCGGCA ACCGCCCGAC CTATGAAGCC AGCCGCAGCG AAGAAGCCAG CCATGCCGAT CTGGCCTGGG CAACAATGCA CGCACTGTTT AACGAACCGC TGCAGGGCGA GTCCGCCAAT ACCAGCAATA TTGTGGAGAT TTTTTGA
|
Protein sequence | MNTTLTPADL DPRRQAMLLY FQGYRVARIA EMLGEKVATV HSWKKRDKWG DYGPLDQMQL TTAARYCQLI MKEHKEGKDF KEIDLLARQS ERHARIGKFN NGGNEADLNP NVANRNKGPR RQPEKNVFTD EQIEKLEEIF HSSMFNYQRH WWEAGKTNRI RNLLKSRQIG ATFYFAREAL IDALLTGRNQ IFLSASKAQA HVFKQYIIDF AKEVEVELKG DPMVLPNGAT LYFLGTNART AQSYHGNLYL DEYFWIPKFQ ELRKVASGMA IHKKWRQTYF STPSSLTHSA YPFWSGALFN RGRNKADKVD IDLSHSNLAP GLLCADGQYR QIVTVEDAVR GGCNLFDLDQ LRMEYSPDEY QNLLMCEFVD DLASVFPLSE LQACMVDSWE VWTDFHALAL RPFGWREVWI GYDPAKGTQN GDSAGCVVVA PPAVPGGKFR ILERHQWRGM DFRAQADAIK KLTEQYNVTY IGIDSTGVGH GVYENVKAFF PAVREFVYNP NVKNALVLKA YDIISHRRLE FDAGHTDIAQ SFMAIRRATT ASGNRPTYEA SRSEEASHAD LAWATMHALF NEPLQGESAN TSNIVEIF
|
| |