Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0644 |
Symbol | |
ID | 4270833 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 694910 |
End bp | 696049 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638125392 |
Product | HK97 family phage major capsid protein |
Protein accession | YP_741488 |
Protein GI | 114319805 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.146587 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCATC AAGTGAAGGA AATGGTCGAG GGCGTGGCCC GAGACTTCGA GGGGTTTAAG TCCCGCCAGG ATCAAGCCCT GTCCGACCTG GGCGAGCGCG TGAAGCGTGC CGAGACCCTG GCCGCCCGCA AGGTGGGCAT GACCGTGGAC AGCGGCGCAG GCATGGGGCC GGAGACCAAG GCGGTCCGTG AGTGGCTGCG TGGTGGCGAC TTCGACCGCA AGGCCCTGAG CATCGAGGAT GACGGCCAAG GCGTGACCGT GCGCAGCGAT TGGGCGGATC AGATTTTCAA GCGCATTCGG GAATCCAGCC CGGTCCGGCA GGTGGCTAAT AACCTGTCCA CCAATTCCAA CGAATTGGAG GTTCTGGTTG ACCGTGGTGA ACCGGACTCA GCGTGGGTGG CTGAAAAGGG CGACCGCGAT CCGACTGCCG CTAGTTTCAT GTCTCGGCAT AAGATTGCGG TGCATGAACA TTATGCGTAT CCGAGCGTAA CCCAGCAATT CCTGGAGGAT AGCCGGCTCG ACCTGGAGCA GTGGTTGCAG GACAAGATCG GAACCCGTTT CGGTCGGCAG GAAGCCGAAT CCTTTATCAA GGGTGACGGC AACGGCAAGC CTCGCGGCAT TCTGGATTAC GACACCGTGC CGGATGGTGA TTTCGAGTGG GGCGCTGATC CTGCCGATTA CACCATCGGG GCGATCTATA CCGGTGAATC GGGTGACTTC CCGAGCAACA ATCCCGATAA CGTGCTCTAC GATGTTGTGG ACGCGCTCAA GTCGGATTAC CTGGGCAATG CACGGTTCAT GATGAGCCGC GCCACGATGA ACAAAATTCG GAAGCTGCGG GATGGTGACG ACCGTGCCCT GCTCCAGATG AGCCTGGCGG AAGGTCGGCC CAACACGCTT CTTGGGTTCC CGGTGGTGAT TGCCGAGGAT ATGCCGGACC CGGCTGCCGA TTCCGAGTCG ATCCTGTTCG GTGACTTCGG CCAGGCTTAC ACCATCGTTG ACCGGATCGG AGTAAGCGTG CTGCGTGATC CCTACTCCCT GCCCGGATGG GTCCGCTGGT ATGTGCGCAA GCGGATCGGT GGGGCGCTGA CCAACCCCGA GGCCGTGAAG GCCGTGGTGT TCGGCGCTGA GCCGAGCTGA
|
Protein sequence | MTHQVKEMVE GVARDFEGFK SRQDQALSDL GERVKRAETL AARKVGMTVD SGAGMGPETK AVREWLRGGD FDRKALSIED DGQGVTVRSD WADQIFKRIR ESSPVRQVAN NLSTNSNELE VLVDRGEPDS AWVAEKGDRD PTAASFMSRH KIAVHEHYAY PSVTQQFLED SRLDLEQWLQ DKIGTRFGRQ EAESFIKGDG NGKPRGILDY DTVPDGDFEW GADPADYTIG AIYTGESGDF PSNNPDNVLY DVVDALKSDY LGNARFMMSR ATMNKIRKLR DGDDRALLQM SLAEGRPNTL LGFPVVIAED MPDPAADSES ILFGDFGQAY TIVDRIGVSV LRDPYSLPGW VRWYVRKRIG GALTNPEAVK AVVFGAEPS
|
| |