Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Emin_0697 |
Symbol | |
ID | 6263511 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Elusimicrobium minutum Pei191 |
Kingdom | Bacteria |
Replicon accession | NC_010644 |
Strand | - |
Start bp | 771318 |
End bp | 772517 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 642611169 |
Product | HK97 family phage portal protein |
Protein accession | YP_001875589 |
Protein GI | 187251107 |
COG category | [S] Function unknown |
COG ID | [COG4695] Phage-related protein |
TIGRFAM ID | [TIGR01537] phage portal protein, HK97 family [TIGR01540] phage portal protein, PBSX family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 1.25481e-17 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAATATAA TACAAAAAAT AGTAAAACAA TTAACCCGTG ATAAAGACGA AAAATCTTTC ACGGCGCCTA CGGTATTTGA ACTTACCGGT AAATACAGCC GCCTGCCTTC TCCTTCGGGA CGGAACTACG CTCCTTACGT AGAAGCTTAC GCCGATAAGC CTTGGATATA CTCAACAGTA TCGGTAATAG CTGAAACAGT ATCCTCCACT GAATTTCTTT TAAAAAACGC TAAAGGGGAA ATAATAACAA AACATCCTGT TCTTGAACTT ATGTATAAAC CAAACCCTCT TATGACGGGG CGCCAGTTAA GACAATGGAT AACGGCTAGC CTTGAACTTA CGGGCAACGC TTATATATTA AAAGACTCTT TACGCTCGGA CGGTTCCCCC GTGGAGCTGT TTCCTCTTTT AAGCCATTTG GTTGAGGTTG TACCCGGTAC CATGGCGGCC GAACCTGTAC AAGGGTATAA ATACAGAGTA GGCTCCAAAA CCGCTTATTA CAGGGCTAAG GATATTATTC ATATAAAATA TTTTAACCCT TTTGATTTTT TTTACGGATT GTCGCCTTTG GCGGCAGCAA GAGGAGCGGC CGACGCTATA GAATCAGCCG AAAATTATAA CAGAGCTTTT TTTGATAATT CCGCCACAAT ATCGGGTATA CTTTCAACCG AAAATAAATT AGACGACGCT ACCAGAACAC GCATAAGCAA AGCGTGGAAC GACAAATATA CTTCCGCCGC TAAAGCGCAT AAAGTAGCTT TATTAGAAGG CGGTTTAAAG TGGCAGTCTA TAGGAATGAG CCAAAAAGAT ATGGATTTTA TAAGCGGCGT TAAAATCAAT AGGGAAACAA TACTTTCCGT GTTTCATGTT CCTCCCGCGC TTGTAGGAAT TTTTGACCAC GCCCCTCAGT TTAACACGCG CGAACAACAG CGTATTTTTT ACCAGACCTG CGTACTGCCT AAACTTACTT TAATACTTGA ATCTTTAACG GAATTTTTAC TGCCTGATTT TGATTCCTCA CGCGAATTAT ATTTAACGCC TGATATAAGC GCGGTATCGG TATTAAAAGA CGACGAAGTG CAACGCGCCC AGGCGGCTAA ATTATATTTG GATATGGGTT TTGGGCGTGA TGAGGTTATT AACGCGTTAG GCTTGCCGTT TAGCGTAAGC ACTGTAAAAA AAGCAAAGAG GAAATTTTAA
|
Protein sequence | MNIIQKIVKQ LTRDKDEKSF TAPTVFELTG KYSRLPSPSG RNYAPYVEAY ADKPWIYSTV SVIAETVSST EFLLKNAKGE IITKHPVLEL MYKPNPLMTG RQLRQWITAS LELTGNAYIL KDSLRSDGSP VELFPLLSHL VEVVPGTMAA EPVQGYKYRV GSKTAYYRAK DIIHIKYFNP FDFFYGLSPL AAARGAADAI ESAENYNRAF FDNSATISGI LSTENKLDDA TRTRISKAWN DKYTSAAKAH KVALLEGGLK WQSIGMSQKD MDFISGVKIN RETILSVFHV PPALVGIFDH APQFNTREQQ RIFYQTCVLP KLTLILESLT EFLLPDFDSS RELYLTPDIS AVSVLKDDEV QRAQAAKLYL DMGFGRDEVI NALGLPFSVS TVKKAKRKF
|
| |