Gene Mnod_4080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_4080 
Symbol 
ID7303457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011894 
Strand
Start bp4146726 
End bp4148705 
Gene Length1980 bp 
Protein Length659 aa 
Translation table11 
GC content69% 
IMG OID643601732 
Productpeptidase U35 phage prohead HK97 
Protein accessionYP_002499262 
Protein GI220923960 
COG category[R] General function prediction only 
COG ID[COG3740] Phage head maturation protease 
TIGRFAM ID[TIGR01543] phage prohead protease, HK97 family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCGAG CTTATTCCAT CCTGACCGTG AAAGCGGTCG AGGACGAGCA GCGCGTCATC 
CGGGGTGTGG CGACCACGCC CTCGCCGGAT CGGGTCGGCG ACGTCGTCGA GCCGCTCGGG
GTCAGGTTCA CCAATCCGAT GCCGCTGCTG CACCAGCACG ATGCGGATCG GCCCGTCGGC
ACGGTGCGCT TCGACAAGCC GACCAAGGAC GGCATCACCT TCGAAGCCAC CCTGCCCAAG
ATCGCGGAAC CCGGCCCGCT CAAGGACCGA GTCGAGACCG CCTGGGGCGA GATCAAGGCC
GGCCTGGTGC GTGCCGTCTC GATCGGCTTT CGCACCCTGC CGGACGGCTA CGAGATCATG
CAGGACGGCG GCATCCGCTA TCTCAAGACC GAGGTGCTGG AGCTCAGTCT TGTGACCGTC
CCGGCCAATG CGGACGCCAA GATTTCCCTG ATCAAGTCGA TCGATCGCCC GCTGCTCGCC
GCGTCCGGCA AGGAGCCGCG GGCGGATGAT CGGCCCTCAC CCCCCGGCGC TTCGGGATCC
CAACCACACC AGAGCTTGCC GAAAGGGGCA CGCATCATGC CGAAGACCAT TGCCGAACAG
GTTGCCGCGT TCGAGGCGAC GCGCCAGGCC AAGTCCGCGC GGATGACCGA GCTGATGAAC
GACGCGGCCG AGCAGGGCGT CACCCTCGAT GCCGCGCAGA CCGAGGAGTA CGACACCCTC
GAGAGCGAGG TGAAGGCGAT CGACGCGCAC CTGAAGCGGC TCGAGGCGCT GGAGAAGACC
AACCGCGCCG CGGCGGCCCC GGTCGAGGGG GTCCGCGACA CCGAGACCGG GTCGAAGATC
CGCGGCGGCG CGCGCATCGA GGTGAAGGGC CCCACCCTGC CGAAAGGCAC CGCCTTCACC
CGCTACGCCA TGGCGCTGAT GCGGGCCAAG GGCAACCTGA TGCAGGCCGA GCAGATCGCC
AAGGGCTGGC ACGACAGCAC GCCCGAGGTG GAGACCGTGC TCAAGGCGGC GGTGGCGGCC
GGCACGACCA CGGACACCGC CTGGGCCAAG CCGCTGGTCG AGTACCAGAC CATGGCCTCC
GAGTTCGCGG AGCTGCTCCG CCCGGCCACC ATCCTCGGCC GCATCCCGGG CCTGCGCCGG
GTGCCGTTCA ACATCAAGGT GCCGCGCCAG ACCGGCGGCT CGACCATCGG CTGGGTCGGC
CAGGGGGCGC CGAAGCCGGT CGGCAAGCTC TCGTTCGACC AGATCACGCT CGGCATGGCC
AAGACGGCCG GCATCGTGGT GATGTCGGAC GAGCTGGTGC GCTCCTCCAA CCCCTCGGCC
GAGGCGATCG TCCGCCAGGA CATGATCGAC CAGACCGCGC AGTTCCTCGA CCAGCAGTTC
GTCGATCCGA GCGTGACGGC GGTGGCCGAC GTGTCGCCGG CCTCCGTGAC CCATGGCGTC
ACGCCAGTGA CCGCCAGCGG CACCGATGCG GATGCGGTGC GGGCCGACGT GCGCGCCGTG
ATGGGCAGGT TCATCAGCGC CAACATGTCG CTGGCCGGGG CCGTCTGGAT GATGACCGAG
ATGCAGGCGC TCGGCCTGGC GCTGATGCTG AACCCGCTCG GCCAGCCGGA ATTCCCGGGC
CTGGTGGTGA ACGGCAACAG CGGCGGCACC TTCTTCGGCC TGCCGGTGAT CCTGTCCGAG
AACATCCCGG CCAACCCCGG TTCGGGCACC CCCGTGACCG GCGCGGGCTC GCGCCTGATC
CTGGCCAAGG CGAGCGAGAT CCTGCTGGCC GACGACGGCG AGGTGGTGCT CGATGCCAGC
CGTGAGGCCT CCCTGCAGAT GGACAGCGCG CCCGACAACC CGCCGAGCGC CAGCACCGTG
CTGGTGTCGC TCTGGCAGAA CAACCTGGTG GGTCTCAAGG CCGAGCGGTT CATCAACTGG
AGCAAGCGCC GGGATGGCGC CGTGCAGTAC ATCGACGCGG CCAACTACGG CTCCGCCTAA
 
Protein sequence
MNRAYSILTV KAVEDEQRVI RGVATTPSPD RVGDVVEPLG VRFTNPMPLL HQHDADRPVG 
TVRFDKPTKD GITFEATLPK IAEPGPLKDR VETAWGEIKA GLVRAVSIGF RTLPDGYEIM
QDGGIRYLKT EVLELSLVTV PANADAKISL IKSIDRPLLA ASGKEPRADD RPSPPGASGS
QPHQSLPKGA RIMPKTIAEQ VAAFEATRQA KSARMTELMN DAAEQGVTLD AAQTEEYDTL
ESEVKAIDAH LKRLEALEKT NRAAAAPVEG VRDTETGSKI RGGARIEVKG PTLPKGTAFT
RYAMALMRAK GNLMQAEQIA KGWHDSTPEV ETVLKAAVAA GTTTDTAWAK PLVEYQTMAS
EFAELLRPAT ILGRIPGLRR VPFNIKVPRQ TGGSTIGWVG QGAPKPVGKL SFDQITLGMA
KTAGIVVMSD ELVRSSNPSA EAIVRQDMID QTAQFLDQQF VDPSVTAVAD VSPASVTHGV
TPVTASGTDA DAVRADVRAV MGRFISANMS LAGAVWMMTE MQALGLALML NPLGQPEFPG
LVVNGNSGGT FFGLPVILSE NIPANPGSGT PVTGAGSRLI LAKASEILLA DDGEVVLDAS
REASLQMDSA PDNPPSASTV LVSLWQNNLV GLKAERFINW SKRRDGAVQY IDAANYGSA