Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0795 |
Symbol | |
ID | 4270558 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 886105 |
End bp | 887244 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638125545 |
Product | HK97 family phage major capsid protein |
Protein accession | YP_741639 |
Protein GI | 114319956 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCATC AAGTGAAGGA AATGGTCGAG GGCGTGGCCC GAGACTTCGA GGGGTTTAAG TCCCGCCAGG ATCAAGCCCT GTCCGACCTG GACGAGCGCG TGAAGCGTGC CGAGACCCTG GCCGCCCGCA AGGTGGGCAT GACCGTGGAC AGCGGCGCAG GCATGGGGCC GGAGACCAAG GCGGTACGCG AGTGGCTGCG TGGTGGCGAC TTCGACCGCA AGGCCCTGAG CATCGAGGAT GACGGCCAAG GCGTGACCGT GCGCAGCGAT TGGGCGGATC AAATCTTTAA GAAGATTCGC GAATCCAGCC CCGTCCGCCA GGTGGCTAAT AACCTGTCCA CCAATTCCAA CGAATTGGAG GTTCTGGTTG ACCGTGGTGA ACCGGACTCA GCGTGGGTGG CTGAAAAGGG CGACCGCGAT CCGACTGCCG CTAGTTTCAT GTCTCGGCAT AAGATCGCCG TGCATGAGCA CTACTCGTAT CCGAGCGTAA CCCAGCAATT CCTGGAGGAT AGCCGGCTCG ACCTGGAGCA GTGGTTGCAG GATAAAATCG GCACCCGTTT CGGTCGGCAG GAAGCCGAAT CCTTTATCAA GGGTGACGGC AACGGCAAGC CTCGCGGCAT TCTGGATTAC GACACCGTGC CGGATGGTGA TTTCGAGTGG GGCGCTGATC CTGCCGATTA CACCATCGGG GCGATCTATA CCGGCGAATC GGGTGACTTC CCGAGCAACA ATCCCGATAA CGTGCTCTAC GATGTTGTGG ACGCGCTCAA GTCGGATTAC CTGGGCAATG CACGGTTCAT GATGAGCCGC GCCACGATGA ACAAAATTCG GAAGCTGCGG GATGGTGACG ACCGTGCCTT GCTCCAGATG AGCCTGGCGG AAGGTCGGCC CAATACCCTG CTGGGGTTCC CGGTGGTGAT TGCCGAGGAT ATGCCGGACC CGGCGGCGGA TTCCGAGTCG ATCCTGTTCG GTGACTTCGG CCAGGCGTAC ACCATCGTTG ACCGGATCGG GGTAAGCGTG CTGCGTGATC CCTACACCCT GCCCGGCTGG GTCCGCTGGT ACGTCCGCAA GCGTATCGGC GGGGCGCTGA CCAACCCCGA AGCCCTGAAG GCCGTGGTGT TCGGCAGCGA GCCGAGCTGA
|
Protein sequence | MTHQVKEMVE GVARDFEGFK SRQDQALSDL DERVKRAETL AARKVGMTVD SGAGMGPETK AVREWLRGGD FDRKALSIED DGQGVTVRSD WADQIFKKIR ESSPVRQVAN NLSTNSNELE VLVDRGEPDS AWVAEKGDRD PTAASFMSRH KIAVHEHYSY PSVTQQFLED SRLDLEQWLQ DKIGTRFGRQ EAESFIKGDG NGKPRGILDY DTVPDGDFEW GADPADYTIG AIYTGESGDF PSNNPDNVLY DVVDALKSDY LGNARFMMSR ATMNKIRKLR DGDDRALLQM SLAEGRPNTL LGFPVVIAED MPDPAADSES ILFGDFGQAY TIVDRIGVSV LRDPYTLPGW VRWYVRKRIG GALTNPEALK AVVFGSEPS
|
| |