Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Emin_0848 |
Symbol | |
ID | 6262569 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Elusimicrobium minutum Pei191 |
Kingdom | Bacteria |
Replicon accession | NC_010644 |
Strand | + |
Start bp | 936793 |
End bp | 937986 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 642611326 |
Product | PBSX family phage terminase large subunit |
Protein accession | YP_001875740 |
Protein GI | 187251258 |
COG category | [R] General function prediction only |
COG ID | [COG1783] Phage terminase large subunit |
TIGRFAM ID | [TIGR01547] phage terminase, large subunit, PBSX family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.00062306 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAACAGA AAAACCTAAT TAAACCCAAA ATAGGACCTG TCTTTAAATT AAATAATAAA GCGGGCAAAA GAACCGTTAT AAACGTGGGC GGGGCAAGAA GCGGCAAAAG CCACGCCGTG GCGCAGCTTT TAATAATGCG CGCTTTAAAC CTGCAGGGTA TTAACGTGGG TATAACGCGC AAAACAATGC CCGCTTTAAA AATGACGGCG GCGCGGCTTG TAACGGACCT TCTTAAAGAA TATGGCCTTT ACTCCGAAAA AAACCATAAT AAAATGGAGC ATTATTATAA TTTGGGTAAA AGCAGAATAC AGTTTTTTTC TTTAGATAAC CCGGAGAAGA TAAAATCAGC TGAGTTTAAC TATATTTGGA TGGAGGAGGC AACGGAATTT ACGTATGAGG ACTACGTTAC CCTTCTTACC CGTCTTTCCG CTCCCATAAA AGAGCCTTAC AAAAACCAAA TATTTTTAAC GTTAAACCCT TCGGACTCAA ATTCTTGGAT AGCAAAAAAA CTGCTTTCAG CACAAAACAC GCAAATTATA AAAAGTTCTT ATAAGGACAA TCCTTTTTTA AGCAAAGATT ATATTAACAC TTTGCTGGGT TTAAAAGATA TTGACGAGAA TTATTACCGT GTTTTCGCTT TGGGCCAATG GGGCGCTAAC AAAAATATTG TTTATGACAA CTATACTTTT GTTGACGAAA TAAAAAACAC GGACAATGTT ATCTGGGGTC TTGATTTCGG GTTTAACAAC CCGTCTGCGC TTGTTAAACT TTATATATCG GACGAAGGTG TTTACACCGA GGAAAAACTT TACAAAAGTG GACTTACAAA CAGCGCGCTT ATAAAAAATT TAGCAGAAAT TATACCCCCC TCACAAAGGC ACGAAAGTAT TTACGCCGAC GCGGCCGAGC CTGCCCGCAT AGCCGAAATA AGTGAAGCCG GTTTTAACAT ACACCCGGCT TTGAAAGATG TAAAAGCGGG TATTTTAAGC GTAAAAACCA AAAAACTTTT TATAAACAAA AACTCATCAA ATCTTATAAA AGAGATTCAA GGTTACTGCT GGAAAACGGA CTTAAACGGC AATGCGCTTG AAGAAGCGGT TAAATTTAAC GACCACGCGC TTGACGCCTT ACGTTACGCT TTGCACACTC ATTTTTTTAT ATCTGGAAAG AAACCCGATG TAAGTTTTTT TTAA
|
Protein sequence | MKQKNLIKPK IGPVFKLNNK AGKRTVINVG GARSGKSHAV AQLLIMRALN LQGINVGITR KTMPALKMTA ARLVTDLLKE YGLYSEKNHN KMEHYYNLGK SRIQFFSLDN PEKIKSAEFN YIWMEEATEF TYEDYVTLLT RLSAPIKEPY KNQIFLTLNP SDSNSWIAKK LLSAQNTQII KSSYKDNPFL SKDYINTLLG LKDIDENYYR VFALGQWGAN KNIVYDNYTF VDEIKNTDNV IWGLDFGFNN PSALVKLYIS DEGVYTEEKL YKSGLTNSAL IKNLAEIIPP SQRHESIYAD AAEPARIAEI SEAGFNIHPA LKDVKAGILS VKTKKLFINK NSSNLIKEIQ GYCWKTDLNG NALEEAVKFN DHALDALRYA LHTHFFISGK KPDVSFF
|
| |