Gene Mnod_5067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_5067 
Symbol 
ID7303760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011894 
Strand
Start bp5142798 
End bp5144777 
Gene Length1980 bp 
Protein Length659 aa 
Translation table11 
GC content69% 
IMG OID643602697 
Productpeptidase U35 phage prohead HK97 
Protein accessionYP_002500216 
Protein GI220924914 
COG category[R] General function prediction only 
COG ID[COG3740] Phage head maturation protease 
TIGRFAM ID[TIGR01543] phage prohead protease, HK97 family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCGAG CTTATTCCAT CCTGACCGTG AAAGCGGTCG AGGACGAGCA GCGCGTCATC 
CGGGGTGTGG CGACCACGCC GTCTCCGGAT CGGGTCGGCG ACATCGTCGA GCCGCTCGGG
GTCAGGTTCA CCAATCCGAT GCCGCTGCTG CACCAGCACG ATGCGGATCG GCCCGTCGGC
ACGGTGCGCT TCGACAAGCC GACCAAGGAC GGCATCACCT TCGAGGCGCG ACTTCCGAAG
ATCGCGGAGC CCGGCCCGCT CAAGGACCGG GTCGAGACCG CCTGGGGCGA GGTCAAGGCC
GGCCTGGTGC GTGCCGTCTC GATCGGCTTC CGCACCCTGC CGGACGGCTA CGAGATCATG
CAGGACGGCG GCATCCGCTA TCTCAAGACC GAGGTGCTGG AGCTCAGTCT TGTGACCGTG
CCGGCCAATG CGGACGCCAG GATCTCCCTG ATCAAGTCGA TCGATCGCCC GCTGCTCGCC
GCGTCCGGCA AGGAGCCGCG GGCGGATGAT CGGCCCTCAC CCCCCGGCGC TTCGGGATCC
CAACCACACC AGAGCTTGCC GAAAGGGGCA TGCATCATGC CGAAGACCAT TGCCGAACAG
ATTGCCGCGT TCGAGGCGAC GCGCCAGGCC AAGTCGGCCC GGATGACCGA GCTGATGAAC
CAGGCGGCCG AGAACGGGGT CACCCTCGAT GCGGCGGAGA CCGAGGAGTA CGACAGCCTC
GCCGAGGAGG TGAAGGCGAT CGACGCGCAT CTCGTGCGCC TCGCGGATCT CGAGAAGGCC
AACCGCGCCG CGGCCGTGCC AGTCGAGGGT GTGCGCAGCG CCGAGACCGG CTCGCAGCTG
CGCGCGGGCG TGCGCATCGA GGTGAAGGGC CCCACCCTGC CGAAAGGCAC CGCCTTCACC
CGCTACGCCA TGGCGCTGAT GCGGGCCAAG GGCAACCTGA TGCAGGCCGA GCAGATCGCC
AAGGGCTGGC ACGACAGCAC GCCCGAGGTG GAGACCGTGC TCAAGGCGGC GGTGGCGGCC
GGCACGACCA CGGACACCGC CTGGGCCAAG CCGCTGGTCG AGTACCAGAC CATGGCCTCC
GAGTTCGCGG AGCTGCTCCG CCCGGCCACC ATCCTCGGCC GCATCCCGGG CCTGCGCCGG
GTGCCGTTCA ACATCAAGGT GCCGCGCCAG ACCGGCGGCT CGACCATCGG CTGGGTCGGC
CAGGGGGCGC CGAAGCCGGT CGGCAAGCTC TCGTTCGACC AGATCACGCT CGGCATGGCC
AAGACGGCCG GCATCGTGGT GATGTCGGAC GAGCTGGTGC GCTCCTCCAA CCCCTCGGCC
GAGGCGATCG TCCGCCAGGA CATGATCGAC CAGACCGCGC AGTTCCTCGA CCAGCAGTTC
GTCGATCCGA GCGTGACGGC GGTGGCCGAC GTGTCGCCGG CCTCCGTGAC CCATGGCGTC
ACGCCAGTGA CCGCCAGCGG CACCGACGCG GATGCGGTGC GGGCCGACGT GCGCGCCGTG
ATGGGCAGGT TCATCAGCGC CAACATGTCG CTGGCCGGGG CCGTCTGGAT GATGACCGAG
ATGCAGGCGC TCGGCCTGGC GCTGATGCTG AACCCGCTCG GCCAGCCGGA ATTCCCGGGC
CTGGTGGTGA ACGGCAACAG CGGCGGCACC TTCTTCGGCC TGCCGGTGAT CCTGTCCGAG
AACATCCCGG CCAACCCCGG TTCGGGCACC CCCGTGACCG GCGCGGGCTC GCGCCTGATC
CTGGCCAAGG CGAGCGAGAT CCTGCTGGCC GACGACGGCG AGGTGGTGCT CGATGCCAGC
CGCGAGGCCT CCCTGCAGAT GGACAGCGCG CCCGACAACC CGCCGAGCGC CAGCACCGTG
CTGGTGTCGC TCTGGCAGAA CAACCTGGTG GGTCTCAAGG CGGAGCGGTT CATCAACTGG
AGCAAGCGCC GGGATGGCGC CGTGCAGTAC ATCGACGCGG CCAACTACGG CTCCGCCTAA
 
Protein sequence
MNRAYSILTV KAVEDEQRVI RGVATTPSPD RVGDIVEPLG VRFTNPMPLL HQHDADRPVG 
TVRFDKPTKD GITFEARLPK IAEPGPLKDR VETAWGEVKA GLVRAVSIGF RTLPDGYEIM
QDGGIRYLKT EVLELSLVTV PANADARISL IKSIDRPLLA ASGKEPRADD RPSPPGASGS
QPHQSLPKGA CIMPKTIAEQ IAAFEATRQA KSARMTELMN QAAENGVTLD AAETEEYDSL
AEEVKAIDAH LVRLADLEKA NRAAAVPVEG VRSAETGSQL RAGVRIEVKG PTLPKGTAFT
RYAMALMRAK GNLMQAEQIA KGWHDSTPEV ETVLKAAVAA GTTTDTAWAK PLVEYQTMAS
EFAELLRPAT ILGRIPGLRR VPFNIKVPRQ TGGSTIGWVG QGAPKPVGKL SFDQITLGMA
KTAGIVVMSD ELVRSSNPSA EAIVRQDMID QTAQFLDQQF VDPSVTAVAD VSPASVTHGV
TPVTASGTDA DAVRADVRAV MGRFISANMS LAGAVWMMTE MQALGLALML NPLGQPEFPG
LVVNGNSGGT FFGLPVILSE NIPANPGSGT PVTGAGSRLI LAKASEILLA DDGEVVLDAS
REASLQMDSA PDNPPSASTV LVSLWQNNLV GLKAERFINW SKRRDGAVQY IDAANYGSA