Gene Hmuk_3155 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_3155 
Symbol 
ID8412708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp3045433 
End bp3047043 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content68% 
IMG OID645021502 
Productphosphoribosylglycinamide formyltransferase 
Protein accessionYP_003178967 
Protein GI257389194 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase
[TIGR00639] phosphoribosylglycinamide formyltransferase, formyltetrahydrofolate-dependent 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.262908 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCG CCGGTCTGGC CAGCAACCGT GGCCGCAACC TGATGAACGT CGCCGACCGC 
GCGCCCGGTG GAGCGGAGCT CGCGGTCGTG CTCACGAACG ACGCCGACGC GCCCGTCATA
GAGGCCGCCG CCGAGCGCGA CATTCCCACC GAGGTCGTCG AGCGCCCCGA CGACCAGGAG
CGGGAAGCCC ACGAACTGCG CGTGCTGGAC GCCATCGAGG AGTACGACTT CGATCTGGTC
TGTCTGGACG GCTACATGCG AGTCCTCACG GAGACGTTCC TCGACGAGGT GCCCACGACG
CTGAACGTCC ACCCGTCGCT GCTGCCCGCC TTCCCGGGCA TGGACGCCCA CGAGCAGGTG
CTCGACGCCG GAGTCAAGAC CACCGGCTGT ACCGTCCACG TCGTCGACGA GGAGGTCGAC
GACGGCCCGA TCGTCACGCA GGAACCGATC CCGGTGTACG ACGGCGACGA CGTGGCCGAC
CTCAAAGAGC GCGTCCTCTA CGAGGGCGAG TTCACTGCGT ATCCGCGCGC GATCGAGTGG
TTCGCAGAGG ACCGCGTCAC CGTCGACTGG GACGCCCACA GCGTCACCGT CGAGGGCGAC
GACGGCGGTC CGTTCCCCGC GCGCCGGCTC GTCAGCAACG ACCGCACCGC CGACCTGCGC
TACGGGGAGA ACCCGCATCA GGACGCCGCG GTGTACGCCG ACCGGACGAC CGAAGAGGCC
AGCGTCGTCC ACGCCGACCA GCTCAACGAG GGCGCGAAGG CACTCAGCTA CAACAACTAC
AACGACGCCG ACGGAGCCCT GAATCTGATC AAGGAGTTCG ACGAGCCGGC CGCCGCCGTC
ATCAAGCACA CCAATCCCGC CGGCTGTGCG ACCGCCGACT CCGTCGCCGA GGCCTACGAG
CGGGCCCTCT CGACGGACCC CCAGAGCGCC TTCGGCGGCA TCGTCGCGCT GAACCGCGAG
TGCGACGTTG CCACGGCCGA GCAGATCGTC GACTCCTTCA AGGAGATCGT CGTCGCGCCG
GGCTACACCG ACGACGCGCT CGACGTGCTC TTCCAGAAGG AGAACCTGCG CGTGCTGGAC
GTTGGAGACG GCCGGACGGG CGAGTCCGGC CGGCCGGAAA ACTACGACGT GACCGAGCCG
ATCACGGAGA AACCACTCGT CGGCGGCCGC CTCGTCCAGG AGCGGGACAC CCAGCACCTC
ACGGCCGACG ACCTCGAAGT CGTCACCGAC CGCGAGCCCA CCGACGAGCA GATCGAGTCG
ATGCTGTTCG CCTGGCACAC GCTCAAGCAC GTGAAATCGA ACGGCATCCT CTTTGCCAAG
GGCACGGAGA CGGTCGGCAT CGGGATGGGC CAGGTCTCTC GGGTCGACGC CGTCCGCCTC
GCCGCGATGA AGGCCGACGA GCACGCCCAG GGCAAGGACG CCGACGGCGC GGTCATGGCG
AGCGACGCCT TCTTCCCGTT CCCGGACGGC CTCGAAGCCG CCGCCGAGGC GGGCATCGAG
GCGGTCATCC AGCCGGGCGG CTCGAAGAAC GACGACATGG TCATCGAGGC CGCGAACGAA
CACGACGTGG CGATGGTCCT TACCGGCCAG CGGTCGTTCC GACACGACTG A
 
Protein sequence
MKIAGLASNR GRNLMNVADR APGGAELAVV LTNDADAPVI EAAAERDIPT EVVERPDDQE 
REAHELRVLD AIEEYDFDLV CLDGYMRVLT ETFLDEVPTT LNVHPSLLPA FPGMDAHEQV
LDAGVKTTGC TVHVVDEEVD DGPIVTQEPI PVYDGDDVAD LKERVLYEGE FTAYPRAIEW
FAEDRVTVDW DAHSVTVEGD DGGPFPARRL VSNDRTADLR YGENPHQDAA VYADRTTEEA
SVVHADQLNE GAKALSYNNY NDADGALNLI KEFDEPAAAV IKHTNPAGCA TADSVAEAYE
RALSTDPQSA FGGIVALNRE CDVATAEQIV DSFKEIVVAP GYTDDALDVL FQKENLRVLD
VGDGRTGESG RPENYDVTEP ITEKPLVGGR LVQERDTQHL TADDLEVVTD REPTDEQIES
MLFAWHTLKH VKSNGILFAK GTETVGIGMG QVSRVDAVRL AAMKADEHAQ GKDADGAVMA
SDAFFPFPDG LEAAAEAGIE AVIQPGGSKN DDMVIEAANE HDVAMVLTGQ RSFRHD