Gene Mnod_1998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_1998 
Symbol 
ID7305187 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011894 
Strand
Start bp2100063 
End bp2101379 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content71% 
IMG OID643599733 
Productphage major capsid protein, HK97 family 
Protein accessionYP_002497288 
Protein GI220921987 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGCCC TCGAGGCGCC GCTTCGCGCC CTCCAGGAAA AGCGGGCCGG GATCGTCGCG 
CGCATGCGCG AGATCCTTCA GGCCGCCGAG GCCGAGGACC GCGACCTCAC GGCCGAGGAG
GCGGAGAGCT ACGATGGCCT GAAGGCCGAC AAGGATGCTC TTGATCGGCG CATCGCGCGC
CTGGAGGAGC AGGCCGGGCA TGAGGCCGCG TTGGAGGAGA CGCGCCCGGC GGTGTCCCGC
CGCGCCGGTC CTCAGCCGGT CCGGCGGCAC GGCGAGGCCT CGACGCAGTT CGAGAGCCTG
GGCGAGTTCA TGCACGCCGT GCGCTTCCGG CCGAACGACC AGCGCCTCGA CTTCCACGAG
GGCATCGGCG CCTCGGAAGC AGAAGGTGCC CTGAGCGCCG AGATGCGGAT GGACGACGGG
CCCTCCGGCG GCTTCGCGAT TCCGCCGCAG TTCCGCACCG AGCTGATGTC GGTGCGCCCG
CAGGACTCGA TCGTGCGGTC GCGCGCGAAC GTGCTGCCCG CCGGCTCGCC GCCGGATGCC
CCGGTGGTGA TCCCGGCCCT CGATCAGACT GGCGACGCCC CGCAGGGGAT GTTCGGCGGC
GTGAAGGTGA CCTGGATCGA GGAGGGCGGG GAGAAGCCCG AGACCGACCT CAAGCTGCGC
GAGATCATGC TGACGCCGCA CGAGGTCGCC GGCACGATCA CGATCGGCGA CAAGCTGCTG
CGCAACTGGC AGACCTCCGA TACCCTGCTG CGCACCCAGC TGCGCGGCGC GGTCTCGGCG
GCGGAGGACT ACGCCTTCCT GCGCGGCAAT GGCGTCGGCC GGCCGCTCGG CGCGATCCAT
GCGCCGGCGG CCTACAAGGT GCCGCGGGCG CAGGCGACCA AGGTCACCTA CGTCGACCTC
GTCACCATGC TCTCGCGGCT GCTGATGCGC GGCAGCAACC CCGTGTGGAG CGCGCCGCAG
GCCGTCTTGC CGCAGATCAT GCTGCTCAAG GACGACCAGG GCCGCCTCAT CTGGCAGCCG
AACGCGCAGG ACGGCATTCC CGGCACCCTG CTCGGCTACC CGCTGATCTG GAACAACCGG
GCGCCGCTGC TCGGCACGCT CGGCGACGTC GTGCTGGCGG ACTGGTCCTC GTACCTGATC
AAGGACGGCT CCGGCCCCTA CGTCGCGGCG TCGGAGCACG TGCACTTCAC CCGCAACAAG
ACCGTGATCA AGGTCTTCTG GAACGTCGAC GGCGCGCCCT GGCTCACCGA GCCGATCAAG
GAGGAGAACG GCTACGCCGT CTCGCCCTTC GTCGTGCTCG ACGTGCCCGC CGCGTGA
 
Protein sequence
MAALEAPLRA LQEKRAGIVA RMREILQAAE AEDRDLTAEE AESYDGLKAD KDALDRRIAR 
LEEQAGHEAA LEETRPAVSR RAGPQPVRRH GEASTQFESL GEFMHAVRFR PNDQRLDFHE
GIGASEAEGA LSAEMRMDDG PSGGFAIPPQ FRTELMSVRP QDSIVRSRAN VLPAGSPPDA
PVVIPALDQT GDAPQGMFGG VKVTWIEEGG EKPETDLKLR EIMLTPHEVA GTITIGDKLL
RNWQTSDTLL RTQLRGAVSA AEDYAFLRGN GVGRPLGAIH APAAYKVPRA QATKVTYVDL
VTMLSRLLMR GSNPVWSAPQ AVLPQIMLLK DDQGRLIWQP NAQDGIPGTL LGYPLIWNNR
APLLGTLGDV VLADWSSYLI KDGSGPYVAA SEHVHFTRNK TVIKVFWNVD GAPWLTEPIK
EENGYAVSPF VVLDVPAA