Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_0832 |
Symbol | |
ID | 5590205 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 850435 |
End bp | 852024 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640924542 |
Product | phage terminase large subunit (GpA) |
Protein accession | YP_001461957 |
Protein GI | 157158554 |
COG category | [R] General function prediction only |
COG ID | [COG5525] Bacteriophage tail assembly protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.325192 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAATATAT CGAACAGTCA GGTTAACAGG CTGCGGCATT TTGTCCGCGC CGGGCTTCGC TCACTGTTCA GGCCGGAGCC ACAGACCGCC GTTGAATGGG CGGATGCTAA TTACTATCTC CCGAAAGAAT CCGCATACCA GGAAGGGCGC TGGGAAACAC TGCCCTTTCA GCGGGCCATC ATGAATGCGA TGGGCAGCGA CTACATCCGC GAGGTGAATG TGGTGAAGTC TGCCCGTGTT GGTTATTCCA AAATGCTGTT GGGTGTTTAT GCCTACTTCA TAGAGCATAA GCAGCGCAAC ACACTTATCT GGTTGCCGAC GGATGGTGAT GCCGAGAACT TTATGAAAAC CCACGTTGAG CCGACCATCC GCGATATTCC GTTGCTGCTG GCGCTGGCTC CGTGGTATGG CAAAAAGCAC CGGGATAACA CGCTCACCAT GAAGCGTTTT TCCAATGGTC GTGGCTTCTG GTGCCTGGGC GGTAAAGCGG CAAAAAACTA CCGTGAAAAG TCGGTGGATG TGGCGGGTTA TGATGAACTT GCTGCCTTTG ATGAGGATAT TGAACAGGAA GGCTCTCCGA CGTTCCTTGG CGACAAACGT ATTGAAGGCT CGGTCTGGCC AAAGTCCATC CGTGGCTCCA CCCCCAAAGT GAGAGGCACC TGCCAGATTG AGCGTGCAGC CAGTGAATCC CCGCATTTTA TGCGTTTTCA TGTTGCCTGT CCGCACTGCG GGGAGGAGCA GTACCTTAAA TTTGGCGATA AAGAGACGTC GTTTGGCCTC AAATGGACGC CGGATGATCC CTCCAGCGTG TTTTATCTCT GCGAACATAA TGCCTGCGTC ATCCGCCAGC AGGAACTGGA CTTCACTGAT GCCCGTTATA TCTGCGAAAA GACCGGGATC TGGACCCGTG ATGGCATTCT CTGGTTTTCG TCATCCGGTG AAGAGATTGA GCCGCCGGAC AGCGTGACCT TTCACATCTG GACGGCGTAC AGCCCGTTCA CCACCTGGGT GCAGATTGTC AAAGACTGGA TGAAAACGAA AGGGGATACG GGAAAACGTA AAACCTTCGT AAACACCACG CTCGGTGAGA CGTGGGAGGC GAAAATTGGC GAACGTCCGG ATGCTGAAGT GATGGCAGAG CGGAAAGAGC ATTATTCAGC GCCCGTTCCT GACCGTGTGG CTTACCTGAC CGCCGGTATC GACTCCCAGC TGGATCGCTA CGAAATGCGC GTATGGGGAT GGGGGCCGGG TGAGGAAAGC TGGCTGATTG ACCGGCAGAT TATTATGGGC CGCCACGACG ATGAACAGAC GCTGCTGCGT GTGGATGAGG CCATCAATAA AACCTATACC CGCCGGAATG GTGCAGAAAT GTCGGTATCC CGTATCTGCT GGGATACTGG CGGGATTGAT CCGACCATTG TGTATGAACG CTCGAAAAAA CATGGGCTGT TCCGGGTGAT CCCCATTAAA GGGGCATCCG TCTACGGTAA GCCTGTGGCC AGCATGCCAC GTAAGCGAAA CAAAAACGGG GTTTACCTTA CCGAAATCGG TACGGATATC CAACAGCGTA TGGAAATATC ATTCACCTGA
|
Protein sequence | MNISNSQVNR LRHFVRAGLR SLFRPEPQTA VEWADANYYL PKESAYQEGR WETLPFQRAI MNAMGSDYIR EVNVVKSARV GYSKMLLGVY AYFIEHKQRN TLIWLPTDGD AENFMKTHVE PTIRDIPLLL ALAPWYGKKH RDNTLTMKRF SNGRGFWCLG GKAAKNYREK SVDVAGYDEL AAFDEDIEQE GSPTFLGDKR IEGSVWPKSI RGSTPKVRGT CQIERAASES PHFMRFHVAC PHCGEEQYLK FGDKETSFGL KWTPDDPSSV FYLCEHNACV IRQQELDFTD ARYICEKTGI WTRDGILWFS SSGEEIEPPD SVTFHIWTAY SPFTTWVQIV KDWMKTKGDT GKRKTFVNTT LGETWEAKIG ERPDAEVMAE RKEHYSAPVP DRVAYLTAGI DSQLDRYEMR VWGWGPGEES WLIDRQIIMG RHDDEQTLLR VDEAINKTYT RRNGAEMSVS RICWDTGGID PTIVYERSKK HGLFRVIPIK GASVYGKPVA SMPRKRNKNG VYLTEIGTDI QQRMEISFT
|
| |