Gene Mkms_4405 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_4405 
SymbolpurH 
ID4612348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4630563 
End bp4632146 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content70% 
IMG OID639794091 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_940386 
Protein GI119870434 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGTG ACCAAGGGCA GGCCGGGGCG AAGAGGCCGA TCCGGCGCGC ACTGATCAGC 
GTCTACGACA AGACCGGGCT GATCGACCTG GCGCGCGGAC TGCACGAGGC CGGCGTCGAC
ATCGTGTCGA CCGGCTCCAC CGCGAAAACC ATTGCCGACA AAGGCATTCC GGTCACACCG
GTCGAATTCG TGACCGGGTT CCCCGAGGTG CTCGACGGCC GCGTCAAGAC GCTGCATCCG
CACATCCACG CCGGCCTGCT CGCCGACACC CGTAAACCCG AGCACGTCGA GGCGCTGGCG
AAACTCGGCA TCGCGCCGTT CGACCTCGTG GTGGTCAACC TCTACCCGTT CAGCGAGACC
GTCGAATCCG GTGCGTCGGT CGACGAGTGC GTGGAGCAGA TCGACATCGG CGGCCCGTCG
ATGGTGCGCG CGGCCGCCAA GAACCACCCG AGCGTGGCGG TCGTCGTCGA ACCGAACGGT
TACGACGGTG TGCTGGCCGC GGTCCGGACC GGCGGCTTCA CGCTTGCCGA ACGAAAGATC
CTGGCGTCGT TGGCATTCCG GCACACCGCC GAATACGACG TGGCGGTGGC GTCGTGGATG
GGTTCGACGC TGGCGCCCGA GGAGCCCGCG CAGAAGCTGC CCGCCTGGGT GGGCGGCACC
TGGCGGCGTG CAGCGGTGCT GCGCTACGGC GAGAACCCCC ATCAGCAGGC GGCGCTCTAC
CGCGACGCCA CCGCGTGGCC GGGGCTGGCG CAGGCCGAGC AGTTGCACGG CAAGGAGATG
TCGTACAACA ACTACACCGA CGCCGATGCG GCGTGGCGGG CGGCGTTCGA CCACGAGGAG
ATCTGCGTCG CGATCATCAA GCACGCCAAC CCGTGCGGTA TCGCGATCTC GTCGGTGTCG
GTCGCCGACG CGCACCGCAA GGCCCACGAA TGTGACCCGC TGTCGGCGTT CGGCGGGGTG
ATCGCGACGA ACAGCTCCGT GAGCGTCGAG ATGGCCGAGA CCGTCGCCGA CATCTTCACC
GAGGTCATCG TCGCCCCGGC CTACGAGCCC GGCGCCGTCG AGATCCTGTC CCGCAAGAAG
AACATCCGCA TCCTGGTGGC GGCGCAACCG CCGACCACCG GCACCGAACT CCGGCCGATC
AGCGGCGGTC TGCTGCTGCA GCAGCGCGAC GCGCTCGACG CCGACGGCGA CGACCCGGTC
AACTGGACCC TCGCGACGGG TGAGCCCGCC GATCCGGCGA CGCTGGCCAA CTTGAAGTTC
GCGTGGCGCA GCTGCCGCGC GGTGAAGTCC AACGCCATCG TCGTGGTCGC CGACGGCGCC
ACCGTGGGCG TCGGGATGGG GCAGGTCAAC CGCGTCGACG CGGCGCGGCT GGCGGTGCAG
CGGGCCGGTG ACCGGGTGCG CGGCGCGGTC GCGGCGTCGG ATGCGTTCTT CCCGTTCCCC
GACGGGCTGG AGACGCTCAC CGAGGCGGGG GTGAAGGCGA TCGTGCACCC CGGCGGATCC
ATGCGCGACG ACGTGGTGAC CGAGGCGGCG GCCAAGGCCG GTATCTCGCT CTACCTGACC
GGCGCGCGGC ACTTCGCGCA CTGA
 
Protein sequence
MSGDQGQAGA KRPIRRALIS VYDKTGLIDL ARGLHEAGVD IVSTGSTAKT IADKGIPVTP 
VEFVTGFPEV LDGRVKTLHP HIHAGLLADT RKPEHVEALA KLGIAPFDLV VVNLYPFSET
VESGASVDEC VEQIDIGGPS MVRAAAKNHP SVAVVVEPNG YDGVLAAVRT GGFTLAERKI
LASLAFRHTA EYDVAVASWM GSTLAPEEPA QKLPAWVGGT WRRAAVLRYG ENPHQQAALY
RDATAWPGLA QAEQLHGKEM SYNNYTDADA AWRAAFDHEE ICVAIIKHAN PCGIAISSVS
VADAHRKAHE CDPLSAFGGV IATNSSVSVE MAETVADIFT EVIVAPAYEP GAVEILSRKK
NIRILVAAQP PTTGTELRPI SGGLLLQQRD ALDADGDDPV NWTLATGEPA DPATLANLKF
AWRSCRAVKS NAIVVVADGA TVGVGMGQVN RVDAARLAVQ RAGDRVRGAV AASDAFFPFP
DGLETLTEAG VKAIVHPGGS MRDDVVTEAA AKAGISLYLT GARHFAH