Gene Mnod_4081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_4081 
Symbol 
ID7303458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011894 
Strand
Start bp4148725 
End bp4150425 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content72% 
IMG OID643601733 
ProductTerminase 
Protein accessionYP_002499263 
Protein GI220923961 
COG category[R] General function prediction only 
COG ID[COG4626] Phage terminase-like protein, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGGGAAT GGAGCACCGC CTGCCCGGAT TGGGAGGAGC GCATCCGGGC CGGGCGCTCG 
CTGCTGCCCT GCGGGCCGCT CTTCCCGGCC GAGGCGGCCG CGGCCATGGA GGTGTTTCGC
GCGCTGCGGA TCGTGGATGC GCTCGGCAGC CCCACCCTCG GCGAGGTCTG CCGGCCCTGG
GTCACCGAGT TCGCCGAGGT GATCTTCGGC GCCTACGATC ACGCCAGCGG GCGGCGGCTG
ATCCGCGAGT TCTTCCTCTG CATCGCCAAG AAGAACGGCA AGTCGACGCT GGCGGCCGGC
CTCATGCTCA CCGCCCTGAT CCGCAACTGG CGGCGCTCGG CCGAGTTCCT GATCCTGGCT
CCGACCATCG AGGTGGCCAA CAACGCCTTC CAGCCGGCGC GCGACATGGT CAAGGCCGAT
GACGAGCTGC GCGCCCTGCT GCACGTGCAG GACCACTACC GCACCATCAC CCACCGCCAC
ACCGGCGCGG CGCTGAAGGT GGTCGCGGCC GACACCGAGA GCGTGTCGGG CAAGAAGGCG
ACCAGCGTGC TGGTCGACGA GCTGTGGCTG TTTGGCAAGC GGCCGCAGGC CGAGAACATG
CTGCGCGAGG CGACCGGCGG CCTGGTCTCG CGGCCCGAGG GCTTCGTGAT CTACGTCTCG
ACGCATGCGG ACGAGCCGCC CGCGGGCGTG TTCAAGCAGA AGCTGGCCTA CTTCCGCGCC
GTGCGGGACG GGCGGATCAC GGATCCGCGC AGCCTGGGCG TGCTCTACGA GCATCCCCGG
GCGATGGTGG AGCGCGGCGA GCACCTCGCT CCGGCGAGCT TTCGCCTCAC CAACCCGAAC
CTGGGCCTGT CGGTCGACCC GGAGTGGCTC AGCGAGAAGC TGGAGGAGGC GCGGACCGCC
GGGCCGGCCT CGCTGGCGGG GTTTGCGGCC AAGCACCTCA ACGTGGAGAT CGGCCTGGGG
CTGCGGGCGG ACCGCTGGCC GGGCGCCGAG TTCTGGGCCC GCCGCGCCGA TCCTGCTCTG
GCCTCCCCTG TCGAGGATCC GCGGGCGGGT CTGCAGGCCC TGCTCGAGCG CTGCGAGGTG
GTGGTGGTCG GCATCGACGG GGGCGGCCTG GACGACCTGT TCGGCTTGTG CGTGCTCGGC
CGCGAGCGGG CGAGCCGCGA CTGGCTGGCC TGGAGCCACG GCTGGTGCCA CGCGGGCGTG
CTGGAGCGCC GGCCGGCGAT CGCCTCGCGG CTGCGGGATT TCCAGGCGGC GGGCGAGCTC
ACCATCGTGG GCGACGAGCT GGCCGACATC TCGGCGATCG TCGGCCTGGT CGCGGCGGTG
AAGGAGCGGG GCCTGCTGGG CGGGGTCGGG GTCGATCCGG CCGGGCTCGG CGAGCTGATC
GAGGCCTTCG CGGAGATCGG GGTGACGCAG GAGGCCGGCC TGCTGATCGG GGTGCCGCAG
GGCTACGGGC TGATGACCGG CATCAAGACC GCCGAGCGCA AGCTCGCCAA CGGCACGCTC
CGGCATGCCG GCTCTGCCCT GGCGGCGTGG TGCGTGGCCA ATCTCAAGAT CGAGCCGACC
GCGACCGCGA TCCGGGCCAC CAAGCAGAAT GCCGGCGACG CCAAGATCGA CCTGGCCATG
GCGCTGTTCA ACGCGGTGGT GCTGATGGCG CGCACCCCCG AGGCTCACCG CGAGCCGGAA
TACGCCATGT ATTTCGCCTA G
 
Protein sequence
MREWSTACPD WEERIRAGRS LLPCGPLFPA EAAAAMEVFR ALRIVDALGS PTLGEVCRPW 
VTEFAEVIFG AYDHASGRRL IREFFLCIAK KNGKSTLAAG LMLTALIRNW RRSAEFLILA
PTIEVANNAF QPARDMVKAD DELRALLHVQ DHYRTITHRH TGAALKVVAA DTESVSGKKA
TSVLVDELWL FGKRPQAENM LREATGGLVS RPEGFVIYVS THADEPPAGV FKQKLAYFRA
VRDGRITDPR SLGVLYEHPR AMVERGEHLA PASFRLTNPN LGLSVDPEWL SEKLEEARTA
GPASLAGFAA KHLNVEIGLG LRADRWPGAE FWARRADPAL ASPVEDPRAG LQALLERCEV
VVVGIDGGGL DDLFGLCVLG RERASRDWLA WSHGWCHAGV LERRPAIASR LRDFQAAGEL
TIVGDELADI SAIVGLVAAV KERGLLGGVG VDPAGLGELI EAFAEIGVTQ EAGLLIGVPQ
GYGLMTGIKT AERKLANGTL RHAGSALAAW CVANLKIEPT ATAIRATKQN AGDAKIDLAM
ALFNAVVLMA RTPEAHREPE YAMYFA