Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_2091 |
Symbol | |
ID | 6067309 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 2286958 |
End bp | 2288883 |
Gene Length | 1926 bp |
Protein Length | 641 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641601499 |
Product | terminase GpA |
Protein accession | YP_001725058 |
Protein GI | 170020104 |
COG category | [R] General function prediction only |
COG ID | [COG5525] Bacteriophage tail assembly protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.11571 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAATATAT CGAACAGTCA GGTTAACAGG CTGCGGCATT TTGTCCGCGC CGGGCTTCGC TCACTGTTCA GGCCGGAGCC ACAGACCGCC GTTGAATGGG CGGATGCTAA TTACTATCTC CCGAAAGAAT CCGCATACCA GGAAGGGCGC TGGGAAACAC TGCCCTTTCA GCGGGCCATC ATGAATGCGA TGGGCAGCGA CTACATCCGT GAGGTGAATG TGGTGAAGTC TGCCCGTGTT GGTTATTCCA AAATGCTGCT GGGTGTTTAT GCCTACTTCA TAGAGCATAA GCAGCGCAAC ACCCTTATCT GGTTGCCGAC GGATGGTGAT GCCGAGAACT TTATGAAAAC TCACGTTGAG CCGACCATCC GTGATATTCC TTCGCTGCTT GCGCTGGCCC CGTGGTATGG CAAAAAGCAC CGGGATAACA CGCTCACCAT GAAGCGTTTC ACCAATGGTC GTGGCTTCTG GTGCCTGGGC GGTAAAGCGG CAAAAAACTA CCGTGAAAAG TCGGTGGATG TGGCGGGTTA TGATGAACTT GCTGCTTTTG ATGATGATAT TGAACAGGAA GGCTCTCCGA CGTTCCTGGG TGACAAGCGT ATTGAAGGCT CGGTCTGGCC AAAGTCCATC CGTGGCTCCA CGCCCAAAGT GAGAGGCACC TGCCAGATTG AGCGTGCAGC CAGTGAATCC CCGCATTTTA TGCGTTTTCA TGTTGCCTGC CCGCACTGCG GGGAGGAGCA GTACCTTAAA TTTGGCGATA AAGAGACGCC GTTTGGCCTC AAATGGACGC CGGATGATCC CTCCAGCGTG TTTTATCTCT GCGAGCATAA TGCCTGCGTC ATCCGCCAGC AGGAGCTGGA CTTTACTGAT GCCCGTTATA TCTGCGAAAA GACCGGGATC TGGACCCGTG ATGGCATTCT CTGGTTTTCG TCATCCGGTG AAGAGATTGA GCCGCCGGAC AGTGTGACCT TTCACATCTG GACGGCGTAC AGCCCGTTCA CCACCTGGGT GCAGATTGTC AAAGACTGGA TGAAAACGAA AGGAGATACG GGAAAACGTA AAACCTTCGT GAACACCACG CTCGGTGAGA CGTGGGAAGC GAAAATCGGC GAACGTCCGG ATGCTGAAGT GATGGCAGAG CGGAAAGAGC ATTATTCAGC GCCCGTTCCT GACCGTGTGG CTTACCTGAC CGCCGGTATC GACTCCCAGC TGGACCGCTA CGAAATGCGC GTATGGGGAT GGGGGCCGGG TGAGGAAAGC TGGCTGATTG ACCGGCAGAT TATTATGGGC CGCCACGACG ATGAACAGAC GCTGCTGCGT GTGGATGAGG CCATCAATAA AACCTATACC CGCCGGAATG GTGCAGAAAT GTCGATATCC CGTATCTGCT GGGATACTGG CGGGATTGAC CCGACCATTG TGTATGAACG CTCGAAAAAG CATGGGCTGT TCCGGGTGAT CCCCATTAAA GGGGCATCCG TCTACGGTAA GCCGGTGGCC AGCATGCCTC GTAAGCGAAA CAAAAACGGG GTTTACCTTA CCGAAATCGG TACGGATACC GCGAAAGAGC AGATTTATAA CCGCTTCACA CTGACGCCGG AAGGGGATGA ACCGCTTCCC GGTGCCGTTC ACTTCCCGAA TAACCCGGAT ATTTTTGATC TGACCGAAGC GCAGCAGCTG ACTGCTGAAG AGCAGGTCGA AAAATGGGTG GATGGCAGGA AAAAAATACT GTGGGACAGC AAAAAGCGAC GCAATGAGGC ACTCGACTGC TTCGTTTATG CGCTGGCGGC GCTGCGCATC AGTATTTCCC GCTGGCAGCT GAATCTCAGT GCACTGCTGG CGAGCCTGCA GGAAGAGGAT GGTGCAGCAA CCAACAAGAA AACACTGGCA GATTACGCCC GTGCCTTATC CGGAGAGAAT GAATGA
|
Protein sequence | MNISNSQVNR LRHFVRAGLR SLFRPEPQTA VEWADANYYL PKESAYQEGR WETLPFQRAI MNAMGSDYIR EVNVVKSARV GYSKMLLGVY AYFIEHKQRN TLIWLPTDGD AENFMKTHVE PTIRDIPSLL ALAPWYGKKH RDNTLTMKRF TNGRGFWCLG GKAAKNYREK SVDVAGYDEL AAFDDDIEQE GSPTFLGDKR IEGSVWPKSI RGSTPKVRGT CQIERAASES PHFMRFHVAC PHCGEEQYLK FGDKETPFGL KWTPDDPSSV FYLCEHNACV IRQQELDFTD ARYICEKTGI WTRDGILWFS SSGEEIEPPD SVTFHIWTAY SPFTTWVQIV KDWMKTKGDT GKRKTFVNTT LGETWEAKIG ERPDAEVMAE RKEHYSAPVP DRVAYLTAGI DSQLDRYEMR VWGWGPGEES WLIDRQIIMG RHDDEQTLLR VDEAINKTYT RRNGAEMSIS RICWDTGGID PTIVYERSKK HGLFRVIPIK GASVYGKPVA SMPRKRNKNG VYLTEIGTDT AKEQIYNRFT LTPEGDEPLP GAVHFPNNPD IFDLTEAQQL TAEEQVEKWV DGRKKILWDS KKRRNEALDC FVYALAALRI SISRWQLNLS ALLASLQEED GAATNKKTLA DYARALSGEN E
|
| |