Gene Mjls_4699 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_4699 
SymbolpurH 
ID4880398 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp4939733 
End bp4941316 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content70% 
IMG OID640142004 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001072955 
Protein GI126437264 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGTG ACCAAGGGCA GGCCGGGGCG AAGAGGCCGA TCCGGCGCGC ACTGATCAGC 
GTCTACGACA AGAGCGGGCT GATCGACCTG GCGCGCGGAC TGCACGAGGC CGGCGTCGAC
ATCGTGTCGA CCGGCTCCAC CGCGAAAACC ATTGCCGACA AAGGCATTCC GGTCACACCT
GTCGAATTCG TGACCGGGTT CCCCGAAGTG CTCGACGGCC GCGTCAAGAC GCTGCATCCG
CACATCCACG CCGGCCTGCT CGCCGACACC CGTAAACCCG AGCACGTCGA GGCGCTGGCG
AAACTCGGCA TCGCGCCGTT CGACCTCGTG GTGGTCAACC TCTACCCGTT CAGCGAGACC
GTCGAATCCG GCGCGTCGGT CGACGAGTGC GTGGAGCAGA TCGACATCGG CGGCCCGTCG
ATGGTGCGCG CGGCCGCCAA GAACCACCCG AGCGTGGCGG TGGTCGTCGA ACCGAACGGG
TACGACGGTG TGCTGGCCGC GGTCCGGACC GGCGGCTTCA CGCTTGCCGA ACGAAAGATC
CTGGCGTCGT TGGCATTCCG GCACACCGCC GAATACGACG TGGCGGTGGC GTCGTGGATG
GGTTCGACGC TGGCGCCCGA GGAGCCCGCG CAGAAGCTGC CCGCCTGGGT GGGCGGCACC
TGGCGGCGTG CCGCGGTACT GCGCTACGGC GAGAACCCCC ATCAGCAGGC GGCGCTCTAC
CGCGACGCCA CCGCGTGGCC GGGGCTGGCG CAGGCCGAGC AGTTGCACGG CAAGGAGATG
TCGTACAACA ACTACACCGA CGCCGATGCG GCGTGGCGGG CGGCGTTCGA CCACGAGGAG
ATCTGCGTCG CGATCATCAA GCACGCCAAC CCGTGCGGTA TCGCGATCTC GTCGGTGTCG
GTCGCCGACG CGCACCGCAA GGCCCACGAA TGTGACCCGC TGTCGGCGTT CGGCGGGGTG
ATCGCGACGA ACAGCTCCGT GAGCGTCGAG ATGGCCGAGA CCGTCGCCGA CATCTTCACC
GAGGTCATCG TCGCCCCGGC CTACGAGCCC GGCGCCGTCG AGATCCTGTC CCGCAAGAAG
AACATCCGCA TCCTGTTGGC GGCGCAACCG CCGACCACCG GCACCGAACT CCGGCCGATC
AGCGGCGGTC TGCTGCTGCA GCAGCGCGAT GCGCTCGACG CCGACGGCGA CGACCCGGTC
AACTGGACCC TCGCGACGGG TGAGCCCGCC GATCCGGCGA CGCTGGCCAA CTTGAAGTTC
GCCTGGCGCA GCTGCCGCGC CGTGAAGTCC AACGCCATCG TCGTGGTCGC CGACGGCGCC
ACCGTGGGCG TCGGGATGGG GCAGGTCAAC CGCGTCGACG CGGCGCGGCT GGCGGTGCAG
CGGGCCGGTG ACCGGGTGCG CGGCGCGATC GCGGCGTCGG ATGCGTTCTT CCCGTTCCCC
GACGGGCTGG AGACGCTCAC CGAGGCGGGG GTGAAGGCGA TCGTGCACCC CGGCGGATCC
ATGCGCGACG ACGTGGTGAC CGAGGCGGCG GCCAAGGCCG GTATCTCGCT CTACCTGACC
GGCGCGCGGC ACTTCGCGCA CTGA
 
Protein sequence
MSGDQGQAGA KRPIRRALIS VYDKSGLIDL ARGLHEAGVD IVSTGSTAKT IADKGIPVTP 
VEFVTGFPEV LDGRVKTLHP HIHAGLLADT RKPEHVEALA KLGIAPFDLV VVNLYPFSET
VESGASVDEC VEQIDIGGPS MVRAAAKNHP SVAVVVEPNG YDGVLAAVRT GGFTLAERKI
LASLAFRHTA EYDVAVASWM GSTLAPEEPA QKLPAWVGGT WRRAAVLRYG ENPHQQAALY
RDATAWPGLA QAEQLHGKEM SYNNYTDADA AWRAAFDHEE ICVAIIKHAN PCGIAISSVS
VADAHRKAHE CDPLSAFGGV IATNSSVSVE MAETVADIFT EVIVAPAYEP GAVEILSRKK
NIRILLAAQP PTTGTELRPI SGGLLLQQRD ALDADGDDPV NWTLATGEPA DPATLANLKF
AWRSCRAVKS NAIVVVADGA TVGVGMGQVN RVDAARLAVQ RAGDRVRGAI AASDAFFPFP
DGLETLTEAG VKAIVHPGGS MRDDVVTEAA AKAGISLYLT GARHFAH