Gene Mnod_5068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_5068 
Symbol 
ID7303761 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011894 
Strand
Start bp5144797 
End bp5146497 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content71% 
IMG OID643602698 
ProductTerminase 
Protein accessionYP_002500217 
Protein GI220924915 
COG category[R] General function prediction only 
COG ID[COG4626] Phage terminase-like protein, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGGGAGT GGAGCACCGC CTGCCCGGAT TGGGAGGAGC GCATCCTGGC CGGGCGCTCG 
CTGCTGCCCT GCGGGCCGCT CTTCCCGGCC GAGGCGGCCG CAGCCATGGA GGTCTTCCGC
GCGCTGCGGA TCGTGGACGC GCTCGGCAGC CCCACCCTCG GCGAGGTCTG CCGGCCCTGG
GTCACCGAGT TCGCCGAGGT GATCTTCGGC GCCTACGATC ACGCCAGCGG GCGGCGGCTG
ATCCGCGAGT TCTTCCTCTG CATCGCCAAG AAGAACGGCA AGTCGACGCT GGCGGCCGGC
CTCATGCTCA CCGCCCTGAT CCGCAACTGG CGGCGCTCGG CCGAGTTCCT GATCCTGGCT
CCGACCATCG AGGTGGCCAA CAACGCCTTC CAGCCGGCGC GCGACATGGT CAAGGCCGAT
GACGAGCTGC GCGCCCTGCT GCACGTGCAG GACCACTATC GCACCATCAC CCACCGCCAC
ACCGGCGCGG CGCTGAAGGT GGTCGCGGCC GACACCGAGA GCGTGTCGGG CAAGAAGGCG
ACCAGCGTGC TGGTCGACGA GCTGTGGCTG TTTGGCAAGC GGCCGCAGGC CGAGAACATG
CTGCGCGAGG CGACCGGCGG CCTGGTCTCG CGGCCCGAGG GCTTCGTGAT CTACGTCTCG
ACGCATGCCG ACGAGCCGCC CGCGGGCGTG TTCAAGCAGA AGCTGGCCTA CTTCCGCGCC
GTGCGGGACG GGCGGATCAC GGATCCGCGC AGTCTCGGCG TGCTCTACGA GCATCCCCGG
GCGATGGTGG AGCGCGGCGA GCATCTCGCT CCGGCGAGCT TTCGCCTCAC CAACCCGAAC
CTGGGCCTGT CGGTCGACCC GGAGTGGCTC AGCGAGAAGT TGGAGGAGGC GCGGAATGCC
GGGCCGGCCT CGCTGGCGGG GTTTGCGGCC AAGCACCTCA ATGTCGAGAT CGGCCTGGGG
CTGCGGGCGG ACCGCTGGCC GGGCGCCGAG TTCTGGGCCC GCCGCGCCGA TCCTGCTCTG
GCCTCACCTG TCGAGGATCC GCGGGCGGGT CTGCAGGCCC TGCTCGAGCG CTGCGAGGTG
GTGGTGGTCG GCATCGACGG GGGCGGCCTG GACGACCTGT TCGGCTTGTG CGTGCTCGGC
CGCGAGCGGG CGAGCCGCGA CTGGCTCGCC TGGAGCCACG GCTGGTGCCA TGCGGGCGTG
CTGGAGCGCC GGCCGGCGAT CGCCTCGCGG CTGCGGGATT TCCAGGCGGC GGGCGAGCTC
ACCATCGTGG GCGACGAGCT GGCCGACATC TCGGCGATCG TCGGCCTGGT CGCGGCGGTG
AAGGAGCGGG GCCTGCTGGG CGGGGTCGGG GTCGATCCGG CCGGGCTCGG CGAGCTGATC
GAGGCCTTTG CGGAGATCGG GGTGACGCAG GAGGCCGGCC TGCTGGTCGG CGTGCCGCAG
GGCTACGGGC TGATGACCGG CATCAAGACC GCCGAGCGCA AGCTCGCCAA CGGCACGCTC
CGGCATGCCG GCTCTGCCCT GGCGGCGTGG TGCGTGGCCA ATCTCAAGAT CGAGCCGACC
GCGACCGCGA TCCGGGCCAC CAAGCAGAAT GCCGGCGACG CCAAGATCGA CCTGGCCATG
GCGCTGTTCA ACGCGGTGGT GCTGATGGCG CGCACCCCCG AGGCTCACCG CGAGCCGGAA
TACGCCATGT ATTTCGCCTA G
 
Protein sequence
MREWSTACPD WEERILAGRS LLPCGPLFPA EAAAAMEVFR ALRIVDALGS PTLGEVCRPW 
VTEFAEVIFG AYDHASGRRL IREFFLCIAK KNGKSTLAAG LMLTALIRNW RRSAEFLILA
PTIEVANNAF QPARDMVKAD DELRALLHVQ DHYRTITHRH TGAALKVVAA DTESVSGKKA
TSVLVDELWL FGKRPQAENM LREATGGLVS RPEGFVIYVS THADEPPAGV FKQKLAYFRA
VRDGRITDPR SLGVLYEHPR AMVERGEHLA PASFRLTNPN LGLSVDPEWL SEKLEEARNA
GPASLAGFAA KHLNVEIGLG LRADRWPGAE FWARRADPAL ASPVEDPRAG LQALLERCEV
VVVGIDGGGL DDLFGLCVLG RERASRDWLA WSHGWCHAGV LERRPAIASR LRDFQAAGEL
TIVGDELADI SAIVGLVAAV KERGLLGGVG VDPAGLGELI EAFAEIGVTQ EAGLLVGVPQ
GYGLMTGIKT AERKLANGTL RHAGSALAAW CVANLKIEPT ATAIRATKQN AGDAKIDLAM
ALFNAVVLMA RTPEAHREPE YAMYFA