Gene Mchl_3045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_3045 
Symbol 
ID7118323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp3216026 
End bp3217054 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content73% 
IMG OID643525796 
ProductTIM-barrel protein, nifR3 family 
Protein accessionYP_002421811 
Protein GI218530995 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACGTG ACCATTATTT GAGCGCGCTA CGCGGGGCGG ACGATCGGGG GGCGGATAGC 
CCGCCGGTCC TGCTGGCACC GCTATCGGGC GTCACCGATC TCCATATGCG GCGCCTCGCC
CAGCGGCTTG GTGCCAGTGC CGTGGTCTCC GAGATGGTCG CGGCGGCCGA GCTGGCGAAG
GGTGACGCCG AATCGCAGGT TCGGGCCGAG GGTGCGGGGG TGGACCCGCA TGTGGTGCAG
CTCGCCGGCT GCGCGCCCGA ACCGATGGCC GAGGGGGCCC GTATCGCCGT GGCGAACGGC
GCCGATGCCA TCGACATCAA CATGGGCTGC CCGGCCAAGA CCGTGACCGG CGGCGAATCG
GGCTCGGCGC TGATGCGCGA CCTCGATCAC GCCGCGCGGC TCCTCGCGGC GGTGCGCGGC
GCGGTCGATG TGCCCGTCAC CGTGAAGATG CGGCTCGGCT GGGATCACGC CAGCCTCAAC
GCCGCCGAAC TGGCGCGGCG CGCCGAAGCC CTCGGGCTCG ACGGCGTGAC GGTCCACGGC
CGCACCCGGC AGCAATTCTA CAAGGGTCAG GCCGATTGGT CGGCGATCCG TCCCGTGGTC
GAAGCGGTGC GCATCCCCGT CATCGCCAAC GGCGACATCA CCGGCCTGGA GGAGGCCCGC
GCCTGCCTCA GCCAATCCGG TGCGGCGGGC GTGATGGTCG GGCGCGCGGC GGTCGGGCGT
CCCTGGCTCG TCGGCGAGAT CGCCGCCGGG CTCGCCGGTC GTGTGGCTTC GCCGCTCTCG
GCCGAGCAGC GGGCGGCAGT CGCGGCCGAG CATTACCAGG GGCTGATCGC GCTCTTCGGC
GCGGCGATGG GCGTGCGCCA CGCCCGCAAG CATCTCGCGG CCTATGCCGA CGCGGCCGGC
GGACTGACCC CGGACGACCG GCGCCGCCTC GTCACCACAC ATGATCCGGC GGAGGCGCTC
CGCCTCCTGC AACGCGCGTT CCTCGAACCG GCCGGGACGG CGACCACGCC CCTCGGCGAG
GCAGCCTGA
 
Protein sequence
MTRDHYLSAL RGADDRGADS PPVLLAPLSG VTDLHMRRLA QRLGASAVVS EMVAAAELAK 
GDAESQVRAE GAGVDPHVVQ LAGCAPEPMA EGARIAVANG ADAIDINMGC PAKTVTGGES
GSALMRDLDH AARLLAAVRG AVDVPVTVKM RLGWDHASLN AAELARRAEA LGLDGVTVHG
RTRQQFYKGQ ADWSAIRPVV EAVRIPVIAN GDITGLEEAR ACLSQSGAAG VMVGRAAVGR
PWLVGEIAAG LAGRVASPLS AEQRAAVAAE HYQGLIALFG AAMGVRHARK HLAAYADAAG
GLTPDDRRRL VTTHDPAEAL RLLQRAFLEP AGTATTPLGE AA