Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mnod_1998 |
Symbol | |
ID | 7305187 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium nodulans ORS 2060 |
Kingdom | Bacteria |
Replicon accession | NC_011894 |
Strand | + |
Start bp | 2100063 |
End bp | 2101379 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643599733 |
Product | phage major capsid protein, HK97 family |
Protein accession | YP_002497288 |
Protein GI | 220921987 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGCCC TCGAGGCGCC GCTTCGCGCC CTCCAGGAAA AGCGGGCCGG GATCGTCGCG CGCATGCGCG AGATCCTTCA GGCCGCCGAG GCCGAGGACC GCGACCTCAC GGCCGAGGAG GCGGAGAGCT ACGATGGCCT GAAGGCCGAC AAGGATGCTC TTGATCGGCG CATCGCGCGC CTGGAGGAGC AGGCCGGGCA TGAGGCCGCG TTGGAGGAGA CGCGCCCGGC GGTGTCCCGC CGCGCCGGTC CTCAGCCGGT CCGGCGGCAC GGCGAGGCCT CGACGCAGTT CGAGAGCCTG GGCGAGTTCA TGCACGCCGT GCGCTTCCGG CCGAACGACC AGCGCCTCGA CTTCCACGAG GGCATCGGCG CCTCGGAAGC AGAAGGTGCC CTGAGCGCCG AGATGCGGAT GGACGACGGG CCCTCCGGCG GCTTCGCGAT TCCGCCGCAG TTCCGCACCG AGCTGATGTC GGTGCGCCCG CAGGACTCGA TCGTGCGGTC GCGCGCGAAC GTGCTGCCCG CCGGCTCGCC GCCGGATGCC CCGGTGGTGA TCCCGGCCCT CGATCAGACT GGCGACGCCC CGCAGGGGAT GTTCGGCGGC GTGAAGGTGA CCTGGATCGA GGAGGGCGGG GAGAAGCCCG AGACCGACCT CAAGCTGCGC GAGATCATGC TGACGCCGCA CGAGGTCGCC GGCACGATCA CGATCGGCGA CAAGCTGCTG CGCAACTGGC AGACCTCCGA TACCCTGCTG CGCACCCAGC TGCGCGGCGC GGTCTCGGCG GCGGAGGACT ACGCCTTCCT GCGCGGCAAT GGCGTCGGCC GGCCGCTCGG CGCGATCCAT GCGCCGGCGG CCTACAAGGT GCCGCGGGCG CAGGCGACCA AGGTCACCTA CGTCGACCTC GTCACCATGC TCTCGCGGCT GCTGATGCGC GGCAGCAACC CCGTGTGGAG CGCGCCGCAG GCCGTCTTGC CGCAGATCAT GCTGCTCAAG GACGACCAGG GCCGCCTCAT CTGGCAGCCG AACGCGCAGG ACGGCATTCC CGGCACCCTG CTCGGCTACC CGCTGATCTG GAACAACCGG GCGCCGCTGC TCGGCACGCT CGGCGACGTC GTGCTGGCGG ACTGGTCCTC GTACCTGATC AAGGACGGCT CCGGCCCCTA CGTCGCGGCG TCGGAGCACG TGCACTTCAC CCGCAACAAG ACCGTGATCA AGGTCTTCTG GAACGTCGAC GGCGCGCCCT GGCTCACCGA GCCGATCAAG GAGGAGAACG GCTACGCCGT CTCGCCCTTC GTCGTGCTCG ACGTGCCCGC CGCGTGA
|
Protein sequence | MAALEAPLRA LQEKRAGIVA RMREILQAAE AEDRDLTAEE AESYDGLKAD KDALDRRIAR LEEQAGHEAA LEETRPAVSR RAGPQPVRRH GEASTQFESL GEFMHAVRFR PNDQRLDFHE GIGASEAEGA LSAEMRMDDG PSGGFAIPPQ FRTELMSVRP QDSIVRSRAN VLPAGSPPDA PVVIPALDQT GDAPQGMFGG VKVTWIEEGG EKPETDLKLR EIMLTPHEVA GTITIGDKLL RNWQTSDTLL RTQLRGAVSA AEDYAFLRGN GVGRPLGAIH APAAYKVPRA QATKVTYVDL VTMLSRLLMR GSNPVWSAPQ AVLPQIMLLK DDQGRLIWQP NAQDGIPGTL LGYPLIWNNR APLLGTLGDV VLADWSSYLI KDGSGPYVAA SEHVHFTRNK TVIKVFWNVD GAPWLTEPIK EENGYAVSPF VVLDVPAA
|
| |