Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_4856 |
Symbol | purH |
ID | 4643834 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | - |
Start bp | 5195923 |
End bp | 5197506 |
Gene Length | 1584 bp |
Protein Length | 527 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639808327 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_955635 |
Protein GI | 120405806 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.228504 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGACA ACGACGACCT GTTCCGGAGG CCGATCAGGC GTGCCCTGAT CAGCGTCTAC GACAAGACCG GGCTTGTGCC CCTGGCGCAG GGTCTGCACG CTGCCGGCGT CGACATCGTG TCCACCGGTT CGACGGCCAA AACGATTGCC GGCGCCGGGA TTCCGGTCAC ACCCGTGGAG GACGTCACGG GCTTCCCCGA GGTGCTCGAC GGCCGTGTCA AGACGTTGCA CCCGCACGTG CACGCCGGGT TGCTCGCCGA TCAGCGCAAG GCCGAACACG TCGCGGCACT GGCCGAGCTC GGCGTCACGG CGTTCGAGCT GGTGGTGGTG AACCTGTACC CGTTCACCCA GACGGTGAAC TCCGGCGCAG ACGAAGACGA ATGCGTGGAG CAGATCGACA TCGGCGGGCC GTCGATGGTG CGCGCCGCCG CCAAGAACCA TCCCAGCGTC GCGGTCGTGG TCGATCCGCT GGGCTACGAC GGGGTGCTGG CCGCGGTGCG TGCCGGCGGC TTCACCTACT CGGAGCGAAA GAAGCTGGCG GCGTTGGCAT TCCGGCACAC CGCCGAGTAC GACGTGGCGG TGGCCTCGTG GATGGAGTCG GTGCTGGCCC CCGAGGCCGA GGCGACGAGC GGCGACCTGC CGCCCTGGCT GGGCGCGACG TTCCGGCGCG CGGCCGTGCT GCGCTACGGC GAGAACCCGC ACCAGCAGGC CGCGCTGTAC CGCGACGACG GCGGATGGCC CGGCCTCGCA CAGGCCGAAC AGCTGCACGG CAAGGAGATG TCCTACAACA ACTACACCGA CGCCGACGCG GCGTGGCGTG CGGCGTTCGA CCACGAGGAC ATCTGCGTGG CGATCATCAA GCACGCCAAC CCGTGCGGCA TCGCGATCTC GCCGGTCTCG GTCGCCGACG CCCACCGCAA GGCACACGAG TGTGACCCGC TGTCGGCGTT CGGCGGGGTG ATCGCGGCGA ACACCGAGGT CACCGTAGAG ATGGCCGAGA CCGTCGCCGG AATCTTCACC GAGGTGATCA TCGCGCCGGC CTACGAACCG GGTGCCGTCG AAGTGCTCTC GGGCAAGAAG AACATCCGCG TTCTGGTCGC CTCCGAACCC CAGCGGGGCG GCACCGAGTT CCGTCAGGTC AGCGGCGGGC TGCTGCTGCA GCAGCGCGAC GCCCTCGACG CCGCCGGCGA CAACCCGAAC ACGTGGACGC TGGCCGCAGG CCCCGCCGCC GACCCCGACA CGCTGGCCGA CCTGGCGTTC GCATGGCGGA CCTGCCGGGC GGTCAAATCT AACGCCATCG TGCTCGCCAG GGACGGCGCC ACGGTCGGCG TGGGCATGGG TCAGGTCAAC CGCGTCGACG CGGCCCGCCT GGCCGTCGAG CGCGCCGGCG GGCGCAGCAG CGGCGCGGTC GGCGCCTCCG ACGCGTTCTT CCCGTTCCCG GACGGGCTGG AGACCCTCAT CAGGGCGGGC GTCAAGGCCG TCGTCCACCC CGGCGGATCG GTGCGCGACG ACGAGGTGAC GGCCGCCGCC GAAGCGGACG GGATCACGCT CTACCTCACC GGCGCAAGGC ATTTCGCGCA CTAG
|
Protein sequence | MSDNDDLFRR PIRRALISVY DKTGLVPLAQ GLHAAGVDIV STGSTAKTIA GAGIPVTPVE DVTGFPEVLD GRVKTLHPHV HAGLLADQRK AEHVAALAEL GVTAFELVVV NLYPFTQTVN SGADEDECVE QIDIGGPSMV RAAAKNHPSV AVVVDPLGYD GVLAAVRAGG FTYSERKKLA ALAFRHTAEY DVAVASWMES VLAPEAEATS GDLPPWLGAT FRRAAVLRYG ENPHQQAALY RDDGGWPGLA QAEQLHGKEM SYNNYTDADA AWRAAFDHED ICVAIIKHAN PCGIAISPVS VADAHRKAHE CDPLSAFGGV IAANTEVTVE MAETVAGIFT EVIIAPAYEP GAVEVLSGKK NIRVLVASEP QRGGTEFRQV SGGLLLQQRD ALDAAGDNPN TWTLAAGPAA DPDTLADLAF AWRTCRAVKS NAIVLARDGA TVGVGMGQVN RVDAARLAVE RAGGRSSGAV GASDAFFPFP DGLETLIRAG VKAVVHPGGS VRDDEVTAAA EADGITLYLT GARHFAH
|
| |