Gene Mmcs_4319 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_4319 
SymbolpurH 
ID4113149 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp4591358 
End bp4592941 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content70% 
IMG OID638033465 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_641480 
Protein GI108801283 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGGTG ACCAAGGGCA GGCCGGGGCG AAGAGGCCGA TCCGGCGCGC ACTGATCAGC 
GTCTACGACA AGACCGGGCT GATCGACCTG GCGCGCGGAC TGCACGAGGC CGGCGTCGAC
ATCGTGTCGA CCGGCTCCAC CGCGAAAACC ATTGCCGACA AAGGCATTCC GGTCACACCG
GTCGAATTCG TGACCGGGTT CCCCGAGGTG CTCGACGGCC GCGTCAAGAC GCTGCATCCG
CACATCCACG CCGGCCTGCT CGCCGACACC CGTAAACCCG AGCACGTCGA GGCGCTGGCG
AAACTCGGCA TCGCGCCGTT CGACCTCGTG GTGGTCAACC TCTACCCGTT CAGCGAGACC
GTCGAATCCG GTGCGTCGGT CGACGAGTGC GTGGAGCAGA TCGACATCGG CGGCCCGTCG
ATGGTGCGCG CGGCCGCCAA GAACCACCCG AGCGTGGCGG TCGTCGTCGA ACCGAACGGT
TACGACGGTG TGCTGGCCGC GGTCCGGACC GGCGGCTTCA CGCTTGCCGA ACGAAAGATC
CTGGCGTCGT TGGCATTCCG GCACACCGCC GAATACGACG TGGCGGTGGC GTCGTGGATG
GGTTCGACGC TGGCGCCCGA GGAGCCCGCG CAGAAGCTGC CCGCCTGGGT GGGCGGCACC
TGGCGGCGTG CAGCGGTGCT GCGCTACGGC GAGAACCCCC ATCAGCAGGC GGCGCTCTAC
CGCGACGCCA CCGCGTGGCC GGGGCTGGCG CAGGCCGAGC AGTTGCACGG CAAGGAGATG
TCGTACAACA ACTACACCGA CGCCGATGCG GCGTGGCGGG CGGCGTTCGA CCACGAGGAG
ATCTGCGTCG CGATCATCAA GCACGCCAAC CCGTGCGGTA TCGCGATCTC GTCGGTGTCG
GTCGCCGACG CGCACCGCAA GGCCCACGAA TGTGACCCGC TGTCGGCGTT CGGCGGGGTG
ATCGCGACGA ACAGCTCCGT GAGCGTCGAG ATGGCCGAGA CCGTCGCCGA CATCTTCACC
GAGGTCATCG TCGCCCCGGC CTACGAGCCC GGCGCCGTCG AGATCCTGTC CCGCAAGAAG
AACATCCGCA TCCTGGTGGC GGCGCAACCG CCGACCACCG GCACCGAACT CCGGCCGATC
AGCGGCGGTC TGCTGCTGCA GCAGCGCGAC GCGCTCGACG CCGACGGCGA CGACCCGGTC
AACTGGACCC TCGCGACGGG TGAGCCCGCC GATCCGGCGA CGCTGGCCAA CTTGAAGTTC
GCGTGGCGCA GCTGCCGCGC GGTGAAGTCC AACGCCATCG TCGTGGTCGC CGACGGCGCC
ACCGTGGGCG TCGGGATGGG GCAGGTCAAC CGCGTCGACG CGGCGCGGCT GGCGGTGCAG
CGGGCCGGTG ACCGGGTGCG CGGCGCGGTC GCGGCGTCGG ATGCGTTCTT CCCGTTCCCC
GACGGGCTGG AGACGCTCAC CGAGGCGGGG GTGAAGGCGA TCGTGCACCC CGGCGGATCC
ATGCGCGACG ACGTGGTGAC CGAGGCGGCG GCCAAGGCCG GTATCTCGCT CTACCTGACC
GGCGCGCGGC ACTTCGCGCA CTGA
 
Protein sequence
MSGDQGQAGA KRPIRRALIS VYDKTGLIDL ARGLHEAGVD IVSTGSTAKT IADKGIPVTP 
VEFVTGFPEV LDGRVKTLHP HIHAGLLADT RKPEHVEALA KLGIAPFDLV VVNLYPFSET
VESGASVDEC VEQIDIGGPS MVRAAAKNHP SVAVVVEPNG YDGVLAAVRT GGFTLAERKI
LASLAFRHTA EYDVAVASWM GSTLAPEEPA QKLPAWVGGT WRRAAVLRYG ENPHQQAALY
RDATAWPGLA QAEQLHGKEM SYNNYTDADA AWRAAFDHEE ICVAIIKHAN PCGIAISSVS
VADAHRKAHE CDPLSAFGGV IATNSSVSVE MAETVADIFT EVIVAPAYEP GAVEILSRKK
NIRILVAAQP PTTGTELRPI SGGLLLQQRD ALDADGDDPV NWTLATGEPA DPATLANLKF
AWRSCRAVKS NAIVVVADGA TVGVGMGQVN RVDAARLAVQ RAGDRVRGAV AASDAFFPFP
DGLETLTEAG VKAIVHPGGS MRDDVVTEAA AKAGISLYLT GARHFAH