Gene Mnod_1994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_1994 
Symbol 
ID7305183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011894 
Strand
Start bp2095478 
End bp2097313 
Gene Length1836 bp 
Protein Length611 aa 
Translation table11 
GC content70% 
IMG OID643599729 
ProductTerminase 
Protein accessionYP_002497284 
Protein GI220921983 
COG category[R] General function prediction only 
COG ID[COG4626] Phage terminase-like protein, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.447036 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTATCT CAGGCCCGTC GCGAAGTGAT CCCACGACCG CCTGGGCGGA AGACGTCGTC 
GCCGGGCGGA TCGTGGCCGG CGAGCTGGTG CGCCATGCCG CCGAGCGGCA CCTGCGCGAC
CGGCGCGATG GAGCCCGGCG TGGGTTGCAC TGGCGGCCGG AGATCGCAGC CCGGGCGCTC
GGCTTCCTGC CGGCCGTCCT GACCATCACG GCGGGCGCCA AGGCCGGCGA GCCGTTCGTG
CCGCTGCCCT GGCACACCTT CGTGATCGGC TCGCTGTTCG GCTGGCGCAA GGACAGCGGT
CGGATGCGCT TCCGTGCCGG CTGGCTGGAG ACCGGCAAGG GCCAGGCGAA GTCGCCGCTG
ATGGCCGCGG TTGGGCTCTA CCTGATGGGC TGGGCCGGCA TTCCGCGGTC CGAGGTCTAC
GCGATCGGGC AGGACCGGGC CACCGCCAAC GTGCTGTTCG GGGACGCGGT GGCGATGTGC
CGGGCGCCGA TCCCCGGAGC CGAGGACGAC AGTGACACGC TGGAGCAGCG CGGCGAGGTC
GTGATCCGCG GCGAGGGCGA CAATGCCTGG AAGATCGAGC ACATCGAGAC CGGCTCGAAG
TTCCGGGCGC TGGCCAACGG CGAGGCGGTG TCTGGCCCGC GGCCCACCGC CGTGCTGGCC
GACGAGATCC ACGAGTTCAA GGCCAACGCG GCCATCGAGA CGTGGCGGCG GGCGGTGGCG
AAGATGCCGG GCGACGCGCT GATGCTGCTC GGCACCAACA CGCCGGCCAC GACGCAGATC
GTTGGCACGG ATTACTCGGA GTTCTACCAG AAGGTAGCGC GGGGCGAGAT CCAGGATGAC
GAGGCGTTCG CCTTCATCGC CCGGGTCGAC AAGGCCGACC GCGAGAGCGT GTTCGAGGAC
GAATCCTGCT GGCCGAAGGC GCTGCCGGCG CTGGGCATCA CCTTCCCGAT CGAGAACATC
CGGGGCGAGG TGAACACGGC CAAGCAGTTG CTCTCGACGG CGCTGTCGGT GAAGCGGCTC
TACTTCGGCA TCCCGATCGG CGCCACCGCG TTCTGGATCG CTGAGGAGGC CTGGGTCGCG
GTTCAGGGCA AGGTCGATGC GCAGGCGCTG CGTGGGCAGC CGTGCTGGCT GGCGCTGGAC
CTGTCCAAGA AGAACGACCT CACGGCCCTC ACGGCGGTGT GGGTCGGGGG AGACGGGCAC
CACTTTGCCA AGACCTGGTA CTGGACCACG CGGGAAGGGA TTGCGGACCG GGCCCGGGCC
GATCAGGCGC CCTATGACCA GTGGGCGGAG AGGCCTGAGG AGACGGGCTT GGTCGCCGTT
CCGGGCGCGG TGATCGACAA GACCTTCGTG GCCGCCGAGG TGGCCCGCCT CGTCGCCGAG
CACGACGTGC AGTTCCTGGC CTTCGACCCG GCCGGGATGG CGGATTTCGA GGCCGCCTGC
GAGGAGATCG GGCTTCCGGT GTGGCGCTAC CAGGGGCCGG GCGAGCCCGA AGGCGAGGGG
TTGAAGCTCG TCGCGCACGG CCAGGGCAAG CGCATCGTGT TCGAGGACCG GGCGCTGTGC
ATGCCGCGCT CGATCGAACG CCTGGAGGAC CTGATCCTGA CCGGCGGCAT CGCGATCGAC
GCCTCCCCCG TCACCTACGC CTGCGCCGCC AACGCCCACG TCGATGCGGA CGGCCAGGGC
AACCGGGCCT TCGACAAGAA GCGGAGCCGG GGCCGCATCG ACGGCCTCGT GACGATCGCG
ATGGCGGTCG GGGCAGCGTC GGCCGACCTG CCGGACAGCG GCCCGTCCGT CTACGAGACC
CGCGGCATCC TGGAAATCGA GATCGACGCG ATCTGA
 
Protein sequence
MPISGPSRSD PTTAWAEDVV AGRIVAGELV RHAAERHLRD RRDGARRGLH WRPEIAARAL 
GFLPAVLTIT AGAKAGEPFV PLPWHTFVIG SLFGWRKDSG RMRFRAGWLE TGKGQAKSPL
MAAVGLYLMG WAGIPRSEVY AIGQDRATAN VLFGDAVAMC RAPIPGAEDD SDTLEQRGEV
VIRGEGDNAW KIEHIETGSK FRALANGEAV SGPRPTAVLA DEIHEFKANA AIETWRRAVA
KMPGDALMLL GTNTPATTQI VGTDYSEFYQ KVARGEIQDD EAFAFIARVD KADRESVFED
ESCWPKALPA LGITFPIENI RGEVNTAKQL LSTALSVKRL YFGIPIGATA FWIAEEAWVA
VQGKVDAQAL RGQPCWLALD LSKKNDLTAL TAVWVGGDGH HFAKTWYWTT REGIADRARA
DQAPYDQWAE RPEETGLVAV PGAVIDKTFV AAEVARLVAE HDVQFLAFDP AGMADFEAAC
EEIGLPVWRY QGPGEPEGEG LKLVAHGQGK RIVFEDRALC MPRSIERLED LILTGGIAID
ASPVTYACAA NAHVDADGQG NRAFDKKRSR GRIDGLVTIA MAVGAASADL PDSGPSVYET
RGILEIEIDA I