Gene Mext_2566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_2566 
Symbol 
ID5832209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2881522 
End bp2883120 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content70% 
IMG OID641368367 
Productphosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001640031 
Protein GI163851988 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCGCG ACCAGATCCG GGTCACCCGC GCCCTCCTTT CCGTTTCGGA CAAGACCGGG 
CTCACGGACT TCGCTGCGGC GCTGAGCCAG CGCGGCGTCG AACTCGTCTC GACCGGCGGC
ACCCACCGCG CGTTGACCGA AGCGGGTCTC GCCGTCCGGG AAGTCTCAGA GCTGACGCGC
TTCCCCGAGA TGATGGACGG CCGGGTGAAG ACGCTGCATC CGGCGGTTCA TGGCGGCCTG
CTCGCGGTGC GCGACAACCC CGAGCATCAG GCGGCTTTGG CCGCCCACGG CATCGGCGCG
ATCGACCTGC TCGTGGTCAA CCTCTACCCG TTCGAGGAAA CACTGAAGGC CGGCAAGGCC
TATGACGACT GCGTCGAGAA CATCGATGTC GGCGGCCCGG CGATGATCCG CGCGGCGGCC
AAGAACCATG CCGACGTCGC CGTGGTGGTG GATGTCTCGG ACTACGGCGC CATCCTCGCC
GAACTCGCGG AGCATGACGG CAACCTCACC GCCACCACCC GCCGCAGGCT GGCGCAGAAG
GCGTTTTCGC GTACCGCCTC CTACGACGCG GCGATCGCCA ACTGGCTCGC CGAGGTCGAG
GGACGCGACA AGGCCCCGAC CTTCAAGGCG CTCGGTGGAA CGCTCGCCCA GAGCCTGCGC
TACGGCGAGA ACCCGCACCA GTCGGCCGCC TTCTATCGCC TGCCCGGCAC CCTGCGCCCC
GGCATCGCCA CCGCCCGGCA GGTCCAGGGC AAGGAACTGT CCTACAACAA CCTCAACGAC
ACCGATGCGG CCTACGAATG CGTCGCCGAG TTCGACCCGG CACGCACGGC GGCGGTCGCG
ATCATCAAGC ACGCCAACCC CTGCGGCGTC GCGGAAGGGC CGGATTTGCT AGCGGCTTAC
GAGCAGGCGC TGGCCTGCGA TCCGACCTCG GCCTTCGGCG GTATCGTCGC CCTCAACCGG
CCTCTCGACG CCGAGGCCGC GAGAAAGATC GTCGAGATCT TCACCGAGGT CATCATCGCC
CCCGACGCCT CCGAGGAGGC GCTCGCTATC GTCGGCGCCA AGAAGAACCT GCGGCTTCTG
CTCGCCGGCG GCCTCGCCGA TCCGCGGGCG AAGGGCGAGG TCATCCGCAC CGTGGCGGGC
GGCTTCCTGG TCCAGGGCCG GGATGCGCTC AGCGTGGACG ACATGGACCT GAAGGTCGTA
ACCAAGCGCG CCCCGAGCGA GGCGGAACTC GCCGACATGC GCTTTGCCTA TCGGGTGGCC
AAGCACGTGA AGTCGAACGC CATCGTCTAC GCCAAGGGCG GCGCCACGGT CGGCATCGGC
GCCGGGCAGA TGTCGCGGGT GGATTCCTCG ATCACCGCCG CGCGCAAGGC GGCGGAAGCG
GCGCAGCGCC TCGGCCTGTC CGAGAGCCTC GCCAAGGGTT CGGCGGTGGC CTCCGACGCC
TTCTTCCCTT TCGCCGACGG CCTGCTCGCC GCCGCCGAGG CCGGTGCCAC CGCCGTGATC
CAGCCCGGCG GCTCGATGCG CGACGACGAG GTGATCCGGG CCGCCGACGA GGCCGGGCTC
GCCATGGTGT TCACCGGCGT GCGCCACTTC CGGCACTAG
 
Protein sequence
MPRDQIRVTR ALLSVSDKTG LTDFAAALSQ RGVELVSTGG THRALTEAGL AVREVSELTR 
FPEMMDGRVK TLHPAVHGGL LAVRDNPEHQ AALAAHGIGA IDLLVVNLYP FEETLKAGKA
YDDCVENIDV GGPAMIRAAA KNHADVAVVV DVSDYGAILA ELAEHDGNLT ATTRRRLAQK
AFSRTASYDA AIANWLAEVE GRDKAPTFKA LGGTLAQSLR YGENPHQSAA FYRLPGTLRP
GIATARQVQG KELSYNNLND TDAAYECVAE FDPARTAAVA IIKHANPCGV AEGPDLLAAY
EQALACDPTS AFGGIVALNR PLDAEAARKI VEIFTEVIIA PDASEEALAI VGAKKNLRLL
LAGGLADPRA KGEVIRTVAG GFLVQGRDAL SVDDMDLKVV TKRAPSEAEL ADMRFAYRVA
KHVKSNAIVY AKGGATVGIG AGQMSRVDSS ITAARKAAEA AQRLGLSESL AKGSAVASDA
FFPFADGLLA AAEAGATAVI QPGGSMRDDE VIRAADEAGL AMVFTGVRHF RH