Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mnod_5067 |
Symbol | |
ID | 7303760 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium nodulans ORS 2060 |
Kingdom | Bacteria |
Replicon accession | NC_011894 |
Strand | - |
Start bp | 5142798 |
End bp | 5144777 |
Gene Length | 1980 bp |
Protein Length | 659 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643602697 |
Product | peptidase U35 phage prohead HK97 |
Protein accession | YP_002500216 |
Protein GI | 220924914 |
COG category | [R] General function prediction only |
COG ID | [COG3740] Phage head maturation protease |
TIGRFAM ID | [TIGR01543] phage prohead protease, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCGAG CTTATTCCAT CCTGACCGTG AAAGCGGTCG AGGACGAGCA GCGCGTCATC CGGGGTGTGG CGACCACGCC GTCTCCGGAT CGGGTCGGCG ACATCGTCGA GCCGCTCGGG GTCAGGTTCA CCAATCCGAT GCCGCTGCTG CACCAGCACG ATGCGGATCG GCCCGTCGGC ACGGTGCGCT TCGACAAGCC GACCAAGGAC GGCATCACCT TCGAGGCGCG ACTTCCGAAG ATCGCGGAGC CCGGCCCGCT CAAGGACCGG GTCGAGACCG CCTGGGGCGA GGTCAAGGCC GGCCTGGTGC GTGCCGTCTC GATCGGCTTC CGCACCCTGC CGGACGGCTA CGAGATCATG CAGGACGGCG GCATCCGCTA TCTCAAGACC GAGGTGCTGG AGCTCAGTCT TGTGACCGTG CCGGCCAATG CGGACGCCAG GATCTCCCTG ATCAAGTCGA TCGATCGCCC GCTGCTCGCC GCGTCCGGCA AGGAGCCGCG GGCGGATGAT CGGCCCTCAC CCCCCGGCGC TTCGGGATCC CAACCACACC AGAGCTTGCC GAAAGGGGCA TGCATCATGC CGAAGACCAT TGCCGAACAG ATTGCCGCGT TCGAGGCGAC GCGCCAGGCC AAGTCGGCCC GGATGACCGA GCTGATGAAC CAGGCGGCCG AGAACGGGGT CACCCTCGAT GCGGCGGAGA CCGAGGAGTA CGACAGCCTC GCCGAGGAGG TGAAGGCGAT CGACGCGCAT CTCGTGCGCC TCGCGGATCT CGAGAAGGCC AACCGCGCCG CGGCCGTGCC AGTCGAGGGT GTGCGCAGCG CCGAGACCGG CTCGCAGCTG CGCGCGGGCG TGCGCATCGA GGTGAAGGGC CCCACCCTGC CGAAAGGCAC CGCCTTCACC CGCTACGCCA TGGCGCTGAT GCGGGCCAAG GGCAACCTGA TGCAGGCCGA GCAGATCGCC AAGGGCTGGC ACGACAGCAC GCCCGAGGTG GAGACCGTGC TCAAGGCGGC GGTGGCGGCC GGCACGACCA CGGACACCGC CTGGGCCAAG CCGCTGGTCG AGTACCAGAC CATGGCCTCC GAGTTCGCGG AGCTGCTCCG CCCGGCCACC ATCCTCGGCC GCATCCCGGG CCTGCGCCGG GTGCCGTTCA ACATCAAGGT GCCGCGCCAG ACCGGCGGCT CGACCATCGG CTGGGTCGGC CAGGGGGCGC CGAAGCCGGT CGGCAAGCTC TCGTTCGACC AGATCACGCT CGGCATGGCC AAGACGGCCG GCATCGTGGT GATGTCGGAC GAGCTGGTGC GCTCCTCCAA CCCCTCGGCC GAGGCGATCG TCCGCCAGGA CATGATCGAC CAGACCGCGC AGTTCCTCGA CCAGCAGTTC GTCGATCCGA GCGTGACGGC GGTGGCCGAC GTGTCGCCGG CCTCCGTGAC CCATGGCGTC ACGCCAGTGA CCGCCAGCGG CACCGACGCG GATGCGGTGC GGGCCGACGT GCGCGCCGTG ATGGGCAGGT TCATCAGCGC CAACATGTCG CTGGCCGGGG CCGTCTGGAT GATGACCGAG ATGCAGGCGC TCGGCCTGGC GCTGATGCTG AACCCGCTCG GCCAGCCGGA ATTCCCGGGC CTGGTGGTGA ACGGCAACAG CGGCGGCACC TTCTTCGGCC TGCCGGTGAT CCTGTCCGAG AACATCCCGG CCAACCCCGG TTCGGGCACC CCCGTGACCG GCGCGGGCTC GCGCCTGATC CTGGCCAAGG CGAGCGAGAT CCTGCTGGCC GACGACGGCG AGGTGGTGCT CGATGCCAGC CGCGAGGCCT CCCTGCAGAT GGACAGCGCG CCCGACAACC CGCCGAGCGC CAGCACCGTG CTGGTGTCGC TCTGGCAGAA CAACCTGGTG GGTCTCAAGG CGGAGCGGTT CATCAACTGG AGCAAGCGCC GGGATGGCGC CGTGCAGTAC ATCGACGCGG CCAACTACGG CTCCGCCTAA
|
Protein sequence | MNRAYSILTV KAVEDEQRVI RGVATTPSPD RVGDIVEPLG VRFTNPMPLL HQHDADRPVG TVRFDKPTKD GITFEARLPK IAEPGPLKDR VETAWGEVKA GLVRAVSIGF RTLPDGYEIM QDGGIRYLKT EVLELSLVTV PANADARISL IKSIDRPLLA ASGKEPRADD RPSPPGASGS QPHQSLPKGA CIMPKTIAEQ IAAFEATRQA KSARMTELMN QAAENGVTLD AAETEEYDSL AEEVKAIDAH LVRLADLEKA NRAAAVPVEG VRSAETGSQL RAGVRIEVKG PTLPKGTAFT RYAMALMRAK GNLMQAEQIA KGWHDSTPEV ETVLKAAVAA GTTTDTAWAK PLVEYQTMAS EFAELLRPAT ILGRIPGLRR VPFNIKVPRQ TGGSTIGWVG QGAPKPVGKL SFDQITLGMA KTAGIVVMSD ELVRSSNPSA EAIVRQDMID QTAQFLDQQF VDPSVTAVAD VSPASVTHGV TPVTASGTDA DAVRADVRAV MGRFISANMS LAGAVWMMTE MQALGLALML NPLGQPEFPG LVVNGNSGGT FFGLPVILSE NIPANPGSGT PVTGAGSRLI LAKASEILLA DDGEVVLDAS REASLQMDSA PDNPPSASTV LVSLWQNNLV GLKAERFINW SKRRDGAVQY IDAANYGSA
|
| |