Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mchl_1705 |
Symbol | |
ID | 7116850 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium chloromethanicum CM4 |
Kingdom | Bacteria |
Replicon accession | NC_011757 |
Strand | - |
Start bp | 1754142 |
End bp | 1755407 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643524469 |
Product | phage major capsid protein, HK97 family |
Protein accession | YP_002420496 |
Protein GI | 218529680 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.154021 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.125329 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAGA TGCGTTTCGA GACCAAGGCC CCCACCGGCC TACCCGAGAA CAAGGCGGCG ACCTTCGGCA CCGAAGCGGT GCTCGACGAG TTCGCCCGCG CCTTCGAGGC GTTCAAGGAG GCCAACGACG TCCGCCTCTC CGAGATCGAG ACCCGGCTCA CCGCGGATGT GGTTACGGAG GAGAAGCTCA TCCGCATCGA CGCCGCCCTC GATCAGGCGA AGAACCGCCT CGACCGGATC AGCCTCGACC GTGCCCGGCC GCCGCTCGGC GGGACGGAGC CGGCCCGCGA CGCCTCCGCC ACCGAGCACA AGGCGGCCTT CGACCTTTAT GTTCGGGCCG GCGAGAGCGC CGGCCTCAAG CGGCTGGAAG AAAAGGCACT TTCCGCCGGC TCCGGGCCGG ATGGCGGCTA CCTCGTGCCG CCGACGATCG AGCGCGAGGT GCTGCGTCGG CTCGCCGAAA TCTCGCCGAT CCGCGCGATC GCCACGGTGC GGACCGTCTC CGGCGGCCAG TACAAGCGAG CCGTCTCGGT CAACGGTCCC GCCGCCGGCT GGGTCGCCGA GACCGCGCCC CGGCCGCAGA CCGACACGCC AAACCTGTCC GAGCTGAGCT TTCCGGCGAT GGAGCTCTAC GCCATGCCGG CCGCGACCCA GACGCTGCTC GACGACGCGG TGCTCGATAT CGATGCGTGG CTCGCCGAGG AGGTCGAGAC GGCCTTCGCC GAGCAGGAGA GCGTCGCCTT CGTCACCGGC AATGGCGTCG GTCGGCCGAA GGGCTTTCTC AGCTACGACA CCGTCGCCAA CGCGAACTGG GCTTCGGGCA GGCTCGGCTT CATCGCGACG GGGGCGGCCG GCGCCTTCCC CGCGAGCAAC CCGAGCGACG TGCTGTTCGA TCTCATCTAC GCACTGCGCG CCGGCTATCG CCAGGGTGCG AGCTTCGTGA TGAACCGGCG GGTGCAGAGC GCGATCCGCA AGTTCAAGGA CGCGGACGGC AACTACCTCT GGCAGCCGCC GCTTGCCGCC GACCGGGCCG CGACGCTGAT GGGCTTTCCG CTGGTCGAGG CCGAGGCGAT GCCCGACGTC GCCGCCAGCA GCCACGCCAT CGCCTTCGGC GACTTCAAGC GCGGCTACCT CGTCGTAGAC CGCGTCGGCC TACGGACCCT GCGCGATCCC TACTCCGCCA AGCCCTACGT GCTGTTCTAC ACCACCAAGC GCGTCGGCGG CGGGGTGCAG GACTTTGCCG CGATCAAGCT GCTCCGGTTC GCCTGA
|
Protein sequence | MTEMRFETKA PTGLPENKAA TFGTEAVLDE FARAFEAFKE ANDVRLSEIE TRLTADVVTE EKLIRIDAAL DQAKNRLDRI SLDRARPPLG GTEPARDASA TEHKAAFDLY VRAGESAGLK RLEEKALSAG SGPDGGYLVP PTIEREVLRR LAEISPIRAI ATVRTVSGGQ YKRAVSVNGP AAGWVAETAP RPQTDTPNLS ELSFPAMELY AMPAATQTLL DDAVLDIDAW LAEEVETAFA EQESVAFVTG NGVGRPKGFL SYDTVANANW ASGRLGFIAT GAAGAFPASN PSDVLFDLIY ALRAGYRQGA SFVMNRRVQS AIRKFKDADG NYLWQPPLAA DRAATLMGFP LVEAEAMPDV AASSHAIAFG DFKRGYLVVD RVGLRTLRDP YSAKPYVLFY TTKRVGGGVQ DFAAIKLLRF A
|
| |