Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Emin_1086 |
Symbol | |
ID | 6263441 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Elusimicrobium minutum Pei191 |
Kingdom | Bacteria |
Replicon accession | NC_010644 |
Strand | - |
Start bp | 1177206 |
End bp | 1178408 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 642611566 |
Product | HK97 family phage major capsid protein |
Protein accession | YP_001875975 |
Protein GI | 187251493 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 99 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGAAA TAATGCAATC CATAAAAGAA TTAAGAAAAA CTTTAGAAAC TAAAATCGAT TCATGCATTA CTAAAGAAAA GGCTGAAAAC ATAGTTTCTG ATTTAGTAAA AAAAGTTCAC CCCGAACAAA GAAAGGCTTT ACTTCCTACC TCGGCGGACG ATGTTCTGGA ACGTTTTGCC GAATTTTCAA AATCCTCAAA ACACGCGCCT GAAAAACCTT GGACAAGCGA TTACGGCAGA AAGTTTGGCA ACATGAAAAA CTTTCTTTTA GCGGCTAAAG ACAGGCACGC TTCTTTCGCG GACAGCAAAT CAGTTTTAGC CGAATCAGGC GCTGGCGGCG GCGGATATCT TGTTCCCACG GAATTTTCAA ACCATGTTGC CCGTATAATG GCCGATATTT CACCCATTAT GCAAATTGCC AACGTTGTGC CCATGGGCTC TTGGAAAAGG CAAATACCTA AGCAGATTTC TAATTTAAGC GTAGGCTGGG TTGCCGAGAA CGGTGTAAGG GGCATTAATA ACCCCGCGTT TGGGCAAATT GAACAGGTAG CAAAAGTAAT GGCAACAGTT ATCAAATGCA CGGACGAGCT TATAAGAGAC AGCGCCATTA ATTTAACGCA GTTCTTATCC GAGCTTGTGG CTGAAGCCAT GGCATTGGAA GTTGAAAGAG TTTCCTTAGT TGGCGACACG GCCGCCGGCG ATCCTTTTAA CGGCGTTTAT AACACGGCAG GATTACAAAA TGTAACAATG GGCGGGCAAA ACGTTTCTTT TGATGATATT GCTAACTTAA TTTTTCAACT AAACGATGCT AACGCCGCAG GCAGTGTTTT GGTCCTTTCC CGCACCGGTC TTAGCAAACT TTTAAAATTA AAAGACTCTA ACGGCAATTA TTTGTGGCAG CCCCCCGCGG GCGGCGCTCC GGCTACAATT TGGAACACTC CTTATGTCGT AAGCTCTAAA ATACCTAATG ATGTCGACGG TGATAAAACG ATAGCTCTTT TCGGACGTTT TAATAAACAC CTTCTTATCT CACCCAGACA GGAAATGGCG GTAAAAGTTT CACAGGACGC CTCTTCATGG AACGCGGCTT CCGAAACGGC CGACTCCGCA TTTATGTTAG ACCAAACCTG GCTGCGTTTT ACTCAGGCAC TTTCAATAGA CGTTACATTC GGAAGCGCCT TTAGCTGCCT TAAATTCAAA TAA
|
Protein sequence | MDEIMQSIKE LRKTLETKID SCITKEKAEN IVSDLVKKVH PEQRKALLPT SADDVLERFA EFSKSSKHAP EKPWTSDYGR KFGNMKNFLL AAKDRHASFA DSKSVLAESG AGGGGYLVPT EFSNHVARIM ADISPIMQIA NVVPMGSWKR QIPKQISNLS VGWVAENGVR GINNPAFGQI EQVAKVMATV IKCTDELIRD SAINLTQFLS ELVAEAMALE VERVSLVGDT AAGDPFNGVY NTAGLQNVTM GGQNVSFDDI ANLIFQLNDA NAAGSVLVLS RTGLSKLLKL KDSNGNYLWQ PPAGGAPATI WNTPYVVSSK IPNDVDGDKT IALFGRFNKH LLISPRQEMA VKVSQDASSW NAASETADSA FMLDQTWLRF TQALSIDVTF GSAFSCLKFK
|
| |