Gene Mfla_0349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMfla_0349 
SymbolpurH 
ID3999316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacillus flagellatus KT 
KingdomBacteria 
Replicon accessionNC_007947 
Strand
Start bp355117 
End bp356703 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content59% 
IMG OID637937245 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_544461 
Protein GI91774705 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.432083 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGTGA TCAAACGTGC GCTTATCAGT GTCTCTGATA AAACCGGTAT TCTCGAATTT 
GCCAAGGCCC TTGCCGAATT CGGTGTGGAG ATACTCTCTA CCGGCGGTAC AGCAAAGCTG
TTCCGCGACA ACGGCATTCC CGTCACGGAG GTCAGTGACT ACACCGGCTT CCCGGAAATG
CTGGACGGAC GCGTCAAGAC GCTGCACCCG AAAATCCACG GTGGCCTGCT TGGTCGCCGC
GACCTGCCGG AACATGTCAC CGCCATGCAA GCTGCCGGCA TCCCGGATAT CGACATGATC
GTGGTCAACC TCTACCCGTT CGAAGCGACT GTCGCCCGTC CTGACGCCAC ACTGGAAGAT
GCGATCGAGA ATATCGACAT CGGCGGGCCC GCCATGGTAC GTTCCGCTGC CAAGAACTGG
CAGGATGTTG CGGTATTGAC CGATGCCTCC CAATACGAGG AAGTACTGGC CGAGATGCGC
AGCACTGGTG GCGCTACCAG CAAGGCGACG CGCTTTGCCT TGTCTGTTGC CGCGTTCAAC
CGCATCAGCA ATTATGACGG CGCCATCAGC GACTACCTTT CCTCCTTTAA TGCAGACGGC
ACACGCAACG AGTTCCCCGG CCAGATCAAT GGCCGCCTGG TCAAGGTGCA GGATCTGCGC
TATGGCGAGA ACCCGCATCA GCAGGCAGCG TTCTACCGCG ACCTGTATCC TGCGCCCGGC
TCGCTCGTGA CTGCCCAACA ATTGCAGGGC AAGGAGCTTT CCTATAACAA TATTGCCGAT
GCCGACGCGG CATGGGAATG CGTCAAGAGC TTCGACAGCA CGGCCTGCGT CATCGTCAAG
CACGCCAATC CTTGTGGCGT GGCACTGGGC GCCACACCGC TCGAGGCCTA CCAGAAAGCG
TTCCAGACCG ATCCGACCTC CGCGTTCGGC GGCATCATTG CCTTCAACCA CACCCTGGAT
GGCGCAGCAG CAGAGGCCGT TTCCAAGCAG TTCGTCGAAG TGTTGATTGC ACCGGACTAC
ACCGAGGAAG CCCTGGCAGT ATTCAAGGCC AAGGCCAATG TACGCGTGCT CAAGATCGCC
TTGCCGGTAG GCGGCGACAG CCCATGGAGC CGAGGCCGCA ACTCCCATGA CACCAAGCGC
GTCGGTTCCG GCGTACTGAT TCAGACCGCA GATAACCATG AAATCAGCGC GGCCGACATC
AAGGTCGTCA CCAAGAAGCA ACCGACGCCG GAACAGCTGG AAGATCTGCT GTTTGCCTGG
CGTGTCGCCA AATACGTAAA ATCCAACGCC ATCGTCTTCT GCGGCAACGG CATGACATTG
GGTGTGGGCG CTGGCCAGAT GAGCCGCGTC GATAGCACCA GAATTGCCGC GATCAAGGCG
CAGAACGCCG GCCTGAGCTT GCAAGGCTCC GCTGTGGCGT CCGATGCGTT CTTTCCGTTC
CGCGACGGCG TGGATGTCCT GGCGGAAGCT GGTGCCAGCT GCGTGATCCA GCCAGGCGGC
AGCATCCGCG ACGACGAAGT GATTGCGGCG GCGGATGAAC ATGGGTTAGT CATGATATTC
ACCAATATCC GCCACTTCCG CCATTGA
 
Protein sequence
MAVIKRALIS VSDKTGILEF AKALAEFGVE ILSTGGTAKL FRDNGIPVTE VSDYTGFPEM 
LDGRVKTLHP KIHGGLLGRR DLPEHVTAMQ AAGIPDIDMI VVNLYPFEAT VARPDATLED
AIENIDIGGP AMVRSAAKNW QDVAVLTDAS QYEEVLAEMR STGGATSKAT RFALSVAAFN
RISNYDGAIS DYLSSFNADG TRNEFPGQIN GRLVKVQDLR YGENPHQQAA FYRDLYPAPG
SLVTAQQLQG KELSYNNIAD ADAAWECVKS FDSTACVIVK HANPCGVALG ATPLEAYQKA
FQTDPTSAFG GIIAFNHTLD GAAAEAVSKQ FVEVLIAPDY TEEALAVFKA KANVRVLKIA
LPVGGDSPWS RGRNSHDTKR VGSGVLIQTA DNHEISAADI KVVTKKQPTP EQLEDLLFAW
RVAKYVKSNA IVFCGNGMTL GVGAGQMSRV DSTRIAAIKA QNAGLSLQGS AVASDAFFPF
RDGVDVLAEA GASCVIQPGG SIRDDEVIAA ADEHGLVMIF TNIRHFRH