Gene Mflv_1875 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_1875 
SymbolpurH 
ID4973199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp1951525 
End bp1953120 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content68% 
IMG OID640456083 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001133143 
Protein GI145222465 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0363498 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACA ACGACTTTCG GCGGCCGATC CGGCGTGCCC TGATCAGCGT CTACGACAAG 
TCGGGTCTGG AGGCGCTGGC CCAGGGCCTG CACGCCGCAG GCGTCGACAT CGTCTCCACC
GGCTCTACGG CGAAAAACAT TGCTGCAGCG GGCATTCCCG TCACCCCGGT CGAGGACGTC
ACCGGATTCC CCGAGGTCCT CGACGGCCGG GTGAAGACGT TGCACCCGCA CGTGCACGCG
GGCCTGCTCG CCGATCAGCG CAAGGCTGAG CACGTCACCG CACTCGAACA GCTCGGTGTC
GCGGCGTTCG AGCTCGTCGT GGTCAACCTC TATCCGTTCA CCCAGACGGT GAATTCCGGC
GCAGGTGTCG ACGAGTGCGT GGAGCAGATC GACATCGGTG GCCCGTCGAT GGTCCGTGCC
GCGGCAAAGA ACCACCCCAG CGTCGCCGTC GTCGTCGATC CACTGGGCTA CGACGGGGTC
CTCGCCGCGG TCCGGGCGGG TGGATTCACC TACACGGAGC GGAAGAAATT GGCGGCGTTG
GCGTTCCGGC ACACCGCCGA GTACGACGTC GCGGTCGCCT CGTGGATGGG GTCGGTGCTG
GCCCCTGAAG ACGACGCCCC TGAAGGCGCC GCGGATGATG TCGCGCAGCT GCCGCCGTGG
CTGGGCTCGA CCTTCCGCCG CTCGGCGGTG CTGCGCTACG GGGAGAACCC CCACCAGAAG
GCGGCGCTCT ACGGCGACGA GGGTGGATGG CCGGGCCTTG CGCAGGCCGA GCAGCTGCAC
GGCAAGGAGA TGTCCTACAA CAACTACACC GACGCCGACG CCGCGTGGCG CGCCGCGTTC
GACCAGGAGG CCATCTGCGT CGCGATCATC AAGCACGCCA ACCCGTGTGG CATCGCGATC
TCGTCGGTCT CGGTCGCCGA CGCTCATCGC AAGGCCCACG AGTGTGATCC GCTCTCGGCG
TTCGGCGGTG TGATCGCGGC CAACACCGAG GTCACCGTCG AGATGGCCGA GACCGTGGCG
GGCATCTTCA CCGAGGTGAT CATCGCGCCC GCCTACGAAC CCGGTGCCGT CGATGTGCTG
AAGGGCAAGA AGAACATTCG GGTGCTCGTC GCTTCAGAGC CGCAGCCCGG GGGTACCGAG
TTCCGGCAGA TCAGCGGCGG GTTGTTGCTC CAGCAGCGCG ACGCGCTCGA CGCGTCAGGC
GACGACCCCA ACAACTGGAC ACTGGCGACC GGCGCTCCCG CCGATCCGGA TACCCTCGCT
GACCTGGTGT TCGCCTGGCG AACCTGCCGC GCGGTCAAGT CCAACGCGAT CGTGCTCGCC
AAGGACGGCG CGACGGTCGG CGTCGGTATG GGACAGGTCA ACCGGGTCGA CGCGGCGCGA
CTGGCCGTGG AACGCGCCGG GGAGCGGACC CGGGACGCGG TCGGCGCCTC CGACGCGTTC
TTCCCGTTCC CGGACGGTCT GCAGACGCTC ATCGATGCCG GCGTCAAGGC CGTCGTGCAC
CCCGGTGGGT CTGTCCGCGA CGACGAGGTG ACGGCGGCCG CCGAAGCGGC GGGGATCACG
TTGTATCTCA CCGGCGCACG ACATTTCGCA CACTAG
 
Protein sequence
MSDNDFRRPI RRALISVYDK SGLEALAQGL HAAGVDIVST GSTAKNIAAA GIPVTPVEDV 
TGFPEVLDGR VKTLHPHVHA GLLADQRKAE HVTALEQLGV AAFELVVVNL YPFTQTVNSG
AGVDECVEQI DIGGPSMVRA AAKNHPSVAV VVDPLGYDGV LAAVRAGGFT YTERKKLAAL
AFRHTAEYDV AVASWMGSVL APEDDAPEGA ADDVAQLPPW LGSTFRRSAV LRYGENPHQK
AALYGDEGGW PGLAQAEQLH GKEMSYNNYT DADAAWRAAF DQEAICVAII KHANPCGIAI
SSVSVADAHR KAHECDPLSA FGGVIAANTE VTVEMAETVA GIFTEVIIAP AYEPGAVDVL
KGKKNIRVLV ASEPQPGGTE FRQISGGLLL QQRDALDASG DDPNNWTLAT GAPADPDTLA
DLVFAWRTCR AVKSNAIVLA KDGATVGVGM GQVNRVDAAR LAVERAGERT RDAVGASDAF
FPFPDGLQTL IDAGVKAVVH PGGSVRDDEV TAAAEAAGIT LYLTGARHFA H