Gene Mvan_4856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4856 
SymbolpurH 
ID4643834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5195923 
End bp5197506 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content70% 
IMG OID639808327 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_955635 
Protein GI120405806 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.228504 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACA ACGACGACCT GTTCCGGAGG CCGATCAGGC GTGCCCTGAT CAGCGTCTAC 
GACAAGACCG GGCTTGTGCC CCTGGCGCAG GGTCTGCACG CTGCCGGCGT CGACATCGTG
TCCACCGGTT CGACGGCCAA AACGATTGCC GGCGCCGGGA TTCCGGTCAC ACCCGTGGAG
GACGTCACGG GCTTCCCCGA GGTGCTCGAC GGCCGTGTCA AGACGTTGCA CCCGCACGTG
CACGCCGGGT TGCTCGCCGA TCAGCGCAAG GCCGAACACG TCGCGGCACT GGCCGAGCTC
GGCGTCACGG CGTTCGAGCT GGTGGTGGTG AACCTGTACC CGTTCACCCA GACGGTGAAC
TCCGGCGCAG ACGAAGACGA ATGCGTGGAG CAGATCGACA TCGGCGGGCC GTCGATGGTG
CGCGCCGCCG CCAAGAACCA TCCCAGCGTC GCGGTCGTGG TCGATCCGCT GGGCTACGAC
GGGGTGCTGG CCGCGGTGCG TGCCGGCGGC TTCACCTACT CGGAGCGAAA GAAGCTGGCG
GCGTTGGCAT TCCGGCACAC CGCCGAGTAC GACGTGGCGG TGGCCTCGTG GATGGAGTCG
GTGCTGGCCC CCGAGGCCGA GGCGACGAGC GGCGACCTGC CGCCCTGGCT GGGCGCGACG
TTCCGGCGCG CGGCCGTGCT GCGCTACGGC GAGAACCCGC ACCAGCAGGC CGCGCTGTAC
CGCGACGACG GCGGATGGCC CGGCCTCGCA CAGGCCGAAC AGCTGCACGG CAAGGAGATG
TCCTACAACA ACTACACCGA CGCCGACGCG GCGTGGCGTG CGGCGTTCGA CCACGAGGAC
ATCTGCGTGG CGATCATCAA GCACGCCAAC CCGTGCGGCA TCGCGATCTC GCCGGTCTCG
GTCGCCGACG CCCACCGCAA GGCACACGAG TGTGACCCGC TGTCGGCGTT CGGCGGGGTG
ATCGCGGCGA ACACCGAGGT CACCGTAGAG ATGGCCGAGA CCGTCGCCGG AATCTTCACC
GAGGTGATCA TCGCGCCGGC CTACGAACCG GGTGCCGTCG AAGTGCTCTC GGGCAAGAAG
AACATCCGCG TTCTGGTCGC CTCCGAACCC CAGCGGGGCG GCACCGAGTT CCGTCAGGTC
AGCGGCGGGC TGCTGCTGCA GCAGCGCGAC GCCCTCGACG CCGCCGGCGA CAACCCGAAC
ACGTGGACGC TGGCCGCAGG CCCCGCCGCC GACCCCGACA CGCTGGCCGA CCTGGCGTTC
GCATGGCGGA CCTGCCGGGC GGTCAAATCT AACGCCATCG TGCTCGCCAG GGACGGCGCC
ACGGTCGGCG TGGGCATGGG TCAGGTCAAC CGCGTCGACG CGGCCCGCCT GGCCGTCGAG
CGCGCCGGCG GGCGCAGCAG CGGCGCGGTC GGCGCCTCCG ACGCGTTCTT CCCGTTCCCG
GACGGGCTGG AGACCCTCAT CAGGGCGGGC GTCAAGGCCG TCGTCCACCC CGGCGGATCG
GTGCGCGACG ACGAGGTGAC GGCCGCCGCC GAAGCGGACG GGATCACGCT CTACCTCACC
GGCGCAAGGC ATTTCGCGCA CTAG
 
Protein sequence
MSDNDDLFRR PIRRALISVY DKTGLVPLAQ GLHAAGVDIV STGSTAKTIA GAGIPVTPVE 
DVTGFPEVLD GRVKTLHPHV HAGLLADQRK AEHVAALAEL GVTAFELVVV NLYPFTQTVN
SGADEDECVE QIDIGGPSMV RAAAKNHPSV AVVVDPLGYD GVLAAVRAGG FTYSERKKLA
ALAFRHTAEY DVAVASWMES VLAPEAEATS GDLPPWLGAT FRRAAVLRYG ENPHQQAALY
RDDGGWPGLA QAEQLHGKEM SYNNYTDADA AWRAAFDHED ICVAIIKHAN PCGIAISPVS
VADAHRKAHE CDPLSAFGGV IAANTEVTVE MAETVAGIFT EVIIAPAYEP GAVEVLSGKK
NIRVLVASEP QRGGTEFRQV SGGLLLQQRD ALDAAGDNPN TWTLAAGPAA DPDTLADLAF
AWRTCRAVKS NAIVLARDGA TVGVGMGQVN RVDAARLAVE RAGGRSSGAV GASDAFFPFP
DGLETLIRAG VKAVVHPGGS VRDDEVTAAA EADGITLYLT GARHFAH