Gene Hmuk_3147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_3147 
Symbol 
ID8412700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp3033296 
End bp3035749 
Gene Length2454 bp 
Protein Length817 aa 
Translation table11 
GC content70% 
IMG OID645021494 
ProductFolC bifunctional protein 
Protein accessionYP_003178959 
Protein GI257389186 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0285] Folylpolyglutamate synthase 
TIGRFAM ID[TIGR01496] dihydropteroate synthase
[TIGR01499] folylpolyglutamate synthase/dihydrofolate synthase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0120997 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTCC ACGAGGCGGC GAACTTCCTC TTCGATCTCC GCCGATTCGC GCTCCGACCG 
GGCACGGAGG CCACTCGGTC GCTGCTGGCC CAGCTGGGCG ACCCGCACGA CTCGCTCGAC
TGCGTGCAGA TCGCCGGCTC GAACGGCAAG GGAAGCACGG CTCGCATGGT CGAACGGACG
CTCAGGGAGG CGGGGCTCGA CGTGGGCCTG TACACCTCGC CACACCTAGA CGACGTTCGG
GAACGCATCC GGGTCAACGG CCGGAAGATC ACCGAGTCGG CGATCGTCGA GTTCGTCGAA
TCGGTCGAGC CGTACGTCAC CGACCGCGCG GCCGACGGCA CTTCGCCGAC GCTCTTCGAG
ACGCTGACGG CACTGGCCTA CTGGGAGTTC GAGCGCCAGT CGGTCGACGT GGCGGTCATC
GAGGTCGGCA TCGGTGGCAA GCTCGACGCC ACCAGCGTCG TCGACCCCGT CGCCAGCGCG
GTGACGACGG TGACGCTCGA ACACACGGAC ATCCTGGGCG ACACGGTCGG CGAGATCGCC
CGAGACAAGG CACACGTCGC GCCCGCGGAC CGCCCGCTCG TGACTGCCAC AGAGGGCGAT
GCGCTCGACG CCGTCCGGGA CGTGGCCGGC GACGTGGTCA CCGTCGCAGA CACCGCTGAC
GGCGAGGCCG GCGACGTGTC GGTCACCGAC CACGGCCGCG AGGGACTGGA GGGATCGGTC
TCGATCGCGG GCGACGAGTG GGGCGTCGAG ACGAAGCTCC CGCTGCTCGG TGCCCACCAG
GCGATCAACG CCGGGATCGC CGCGACGCTG TGTCGCCAGG TTTCGGATGT CGACGAGGCG
ACGATCGCAC GCGGCCTGCG AAACGCCCAC TGGCCCGGTC GCTTCGAGAT CATGGGACGG
GATCCCCTGG TGGTACTGGA CGGCGCGCAC AACCCCGGCG GCGTCGAACG CGTCGTCGAG
ACGCTGGCCG CGTTCGACTA CGACGAGCTC CACGTCGTCG CGGGGTCGAT GGTCGACAAG
GACCTGCGTG CCATCGCGGG CGCACTCGAC GGGGCCGACC ACGTCGTCGC CTGCGAGCCC
GACCGAGACC GGGCCGAAGA CGAGCAGGTG GTCGCGAAGG CGTTCCGAGA CGAGACTGCG
GCCAGCGTCG AGACGCGCAG CGACGTGGCC GGCGCGTTCG ATGTCGCCCT CGATGGGGCC
GGCGAGGACG ACTGTGTGCT GGTCACGGGC TCGCTCTACG CCGTCGCAGA GGCTCGCCAG
CGCTGGATCC GGCCGACGAT TCCCAAGCGG GTCGACGGCG TGGCGTCGGC ACGCGAGACG
ATCGACGCCG CCCACGTCAC CGACGCCGGT GCGTGGCGCA TGCGCGGCAA GGCGGTCCAC
CGCCTCCTCA AGACGCGGGT CCAGCCCCGA CAGGCACAGT ACCTCAAAGA AGAGCTGCTC
TCGCTGGGTG GGGACTGTGC GACCTCGGGG CTGAACGACC AGGACGAGGA GATGCTGGAC
GTACTCCTGA TGGGGACGAT GGCCCAGTTC CGCCGCCTGA CGGACAAGCT CGACGGCCAA
CCGTACGGCC TCGCGTCGCT GGCCGACGAA CTCCGGACCG CGCTGGACAT CCAGCAGCCC
GATCGAGACC GGGGCTATCC CTGGGACGAC GGCACCGCAG TCATGGGGAT CTGTAACATC
ACGCCGGACT CCTTCCACGA CGGCGGGGAG TACGACGCCG TCGAGGACGC CGTTTCCCGC
GCCGAGTCGA TGGTCGCGGC CGGCGTCGAC GTCCTCGACG TCGGCGGCGA GTCCACCCGC
CCCGGCGCGG ACGAGATCCC CGTCGAGGAG GAGATCGAGC GCATCGTCCC GGTGATCGAG
CGCCTCGCCG ACCTCGACGC CGCCGTCGCC GTCGACACAC GCAAGGCACC GGTCGCCCGC
GCCGCGCTGG ACGCCGGTGC GGACATCCTC AACGACGTGT CCGGGCTCGA AGACCCCGAG
ATGCGCCTCG TCGCCGCAGA CTACGACGTG CCCGTCGTCG TGATGCACAG CATCGAGGTA
CCGGTCGATC CCGACAGCGC CGTCGACTAC GACGACGTGG TCGAGGACTG CATCGACCAG
CTGACCGAAC GCGTCCTGCT GGCCGAGAAA GCGGGCCTCG ACCGCGAACA GATCGTGGTC
GATCCGGGGC TGGGATTCGG CAAATCGGCG GCCGAGAGCT TCGAACTGCT GGGCCGACTC
GAAGAGTTCC AGTCGCTGGG CTGCCCGATC CTGGTCGGTC ACTCCCACAA GTCGATGTTC
GAACTCGTCG GAGCCGAGGC CGGGGACTGC CTGGACGCCA CCATCGCCGG GACGACCCTG
GCCGCCGAGC GCGGGGCCGA CATCGTTCGC GTCCACGATG TGCCCGAGGC CGTGACCGCC
GTGAACGTGG TCGAAGCCGC CGACGAGCCT GGGTCGTTCG TCGACGAAGG CTAA
 
Protein sequence
MKFHEAANFL FDLRRFALRP GTEATRSLLA QLGDPHDSLD CVQIAGSNGK GSTARMVERT 
LREAGLDVGL YTSPHLDDVR ERIRVNGRKI TESAIVEFVE SVEPYVTDRA ADGTSPTLFE
TLTALAYWEF ERQSVDVAVI EVGIGGKLDA TSVVDPVASA VTTVTLEHTD ILGDTVGEIA
RDKAHVAPAD RPLVTATEGD ALDAVRDVAG DVVTVADTAD GEAGDVSVTD HGREGLEGSV
SIAGDEWGVE TKLPLLGAHQ AINAGIAATL CRQVSDVDEA TIARGLRNAH WPGRFEIMGR
DPLVVLDGAH NPGGVERVVE TLAAFDYDEL HVVAGSMVDK DLRAIAGALD GADHVVACEP
DRDRAEDEQV VAKAFRDETA ASVETRSDVA GAFDVALDGA GEDDCVLVTG SLYAVAEARQ
RWIRPTIPKR VDGVASARET IDAAHVTDAG AWRMRGKAVH RLLKTRVQPR QAQYLKEELL
SLGGDCATSG LNDQDEEMLD VLLMGTMAQF RRLTDKLDGQ PYGLASLADE LRTALDIQQP
DRDRGYPWDD GTAVMGICNI TPDSFHDGGE YDAVEDAVSR AESMVAAGVD VLDVGGESTR
PGADEIPVEE EIERIVPVIE RLADLDAAVA VDTRKAPVAR AALDAGADIL NDVSGLEDPE
MRLVAADYDV PVVVMHSIEV PVDPDSAVDY DDVVEDCIDQ LTERVLLAEK AGLDREQIVV
DPGLGFGKSA AESFELLGRL EEFQSLGCPI LVGHSHKSMF ELVGAEAGDC LDATIAGTTL
AAERGADIVR VHDVPEAVTA VNVVEAADEP GSFVDEG