Gene Mchl_2789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_2789 
Symbol 
ID7114856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp2941296 
End bp2942894 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content70% 
IMG OID643525537 
Productphosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_002421556 
Protein GI218530740 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCGCG ACCAGATCCG GGTCACCCGC GCCCTCCTTT CCGTTTCGGA CAAGACCGGG 
CTCACGGACT TCGCTGCGGC CCTGAGCCAG CGCGGCGTCG AACTGGTCTC GACGGGCGGC
ACCCACCGCG CGTTGACCGA AGCGGGTCTC GCCGTCCGGG AAGTCTCCGA GCTGACGCGC
TTCCCCGAGA TGATGGACGG CCGGGTGAAG ACCTTGCATC CGGCGGTCCA TGGCGGCCTG
CTCGCGGTGC GCGACAACCC CGAGCATCAG GCGGCTTTGG CCGCCCACGG CATCGGCGCG
ATCGACCTGC TCGTGGTCAA CCTCTACCCG TTCGAGGAAA CGCTGAAGGC CGGCAAGGCC
TATGACGATT GCGTCGAGAA CATCGATGTC GGCGGCCCGG CGATGATCCG CGCGGCGGCC
AAGAACCATG CCGACGTCGC CGTGGTGGTG GATGTTTCGG ACTACGGCGT CATCCTCGCC
GAACTCGCGG AGCATGACGG CAACCTCACC GCCACGACCC GCCGCAGGCT GGCGCAGAAG
GCGTTCTCGC GCACCGCCTC CTACGACGCG GCAATCGCCA ACTGGCTCGC CGAAGTCGAG
GGACGCGACA AGGCCCCGAA CTTCAAGGCG CTCGGCGGAA CGCTCGCCCA GAGCCTGCGC
TACGGCGAGA ACCCGCACCA ATCGGCTGCC TTCTACCGCC TGCCCGGCAC CCTGCGCCCC
GGCATCGCCA CCGCCCGGCA GGTCCAGGGC AAGGAACTGT CCTACAACAA CCTCAACGAC
ACCGACGCCG CCTACGAATG CGTCGCCGAG TTCGACCCCG CCCGCACGGC GGCGGTCGCG
ATCATCAAGC ACGCCAATCC CTGCGGCGTG GCCGAAGGGC CGGATCTGCT GGCGGCCTAC
GAGCAGGCGC TGGCCTGCGA TCCGACCTCG GCCTTCGGTG GCATCGTCGC CCTCAACCGG
CCTCTCGACG CCGAGGCCGC GAGAAAGATC GTCGAGATCT TCACCGAGGT CATCATCGCC
CCCGACGCCT CCGAGGAAGC GCTCGCTATC GTCGGCGCCA AGAAGAACCT GCGGCTTCTG
CTCGCCGGCG GCCTCGCCGA TCCGCGGGCG AAGGGTGAGG TCATCCGCAC GGTGGCGGGC
GGCTTCCTGG TCCAGGGCCG GGATGCGCTC AGCGTGGACG ACATGGACCT GAAGGTCGTG
ACCAAGCGCG CCCCGAGCGA GGCGGAACTC GCCGACATGC GCTTTGCCTA TCGGGTGGCC
AAGCACGTGA AGTCGAACGC CATCGTCTAC GCCAAGGGCG GCGCCACGGT CGGCATCGGC
GCCGGCCAGA TGTCGCGGGT GGATTCCTCG ATCACCGCCG CGCGCAAGGC GGCGGAAGCG
GCGCAGCGCC TCGGCCTGTC CGAGAGCCTC GCCAAGGGTT CGGCGGTGGC CTCCGACGCC
TTCTTCCCCT TCGCCGACGG CCTGCTCGCC GCCGCCGAGG CCGGTGCCAC CGCCGTGATC
CAGCCCGGCG GCTCGATGCG CGACGATGAG GTGATCCGGG CCGCCGACGA GGCCGGGCTC
GCCATGGTGT TCACCGGCGT GCGCCACTTC CGGCACTAG
 
Protein sequence
MPRDQIRVTR ALLSVSDKTG LTDFAAALSQ RGVELVSTGG THRALTEAGL AVREVSELTR 
FPEMMDGRVK TLHPAVHGGL LAVRDNPEHQ AALAAHGIGA IDLLVVNLYP FEETLKAGKA
YDDCVENIDV GGPAMIRAAA KNHADVAVVV DVSDYGVILA ELAEHDGNLT ATTRRRLAQK
AFSRTASYDA AIANWLAEVE GRDKAPNFKA LGGTLAQSLR YGENPHQSAA FYRLPGTLRP
GIATARQVQG KELSYNNLND TDAAYECVAE FDPARTAAVA IIKHANPCGV AEGPDLLAAY
EQALACDPTS AFGGIVALNR PLDAEAARKI VEIFTEVIIA PDASEEALAI VGAKKNLRLL
LAGGLADPRA KGEVIRTVAG GFLVQGRDAL SVDDMDLKVV TKRAPSEAEL ADMRFAYRVA
KHVKSNAIVY AKGGATVGIG AGQMSRVDSS ITAARKAAEA AQRLGLSESL AKGSAVASDA
FFPFADGLLA AAEAGATAVI QPGGSMRDDE VIRAADEAGL AMVFTGVRHF RH