Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_2297 |
Symbol | |
ID | 8411838 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 2217530 |
End bp | 2218957 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645020640 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_003178116 |
Protein GI | 257388343 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACCC AGCTCCAGCG TGCACGAGAC GGAGAGATCA CGTCGGCGAT GGAACGGATC GCAGAGCGCG AGACCGTCGA CGCCGAGTTC GTCCGCGAGC AGGTCGCGGA CGGACAGGCA GTGATCCCGG CGAACGTCGG CCACGAGACG CTCGACCCGA TGATCATCGG CCGGGAGTTC TCGACGAAGG TCAACGCCAA CATCGGCAAC AGCGAGGAGA CGAGCGACCT CGACGGCGAA CTGGAGAAGC TCCACACCGC GGTCCACTAC GGCGCGGACA CGGTGATGGA CCTCTCGACG GGCGCGAACT TAGACGAGAT CCGCGAGGCA AACGTCGAGC ATTCGCCGGT TCCCGTCGGG ACGGTCCCGA TCTACGAGGC CGTCAAGCGA GCGGGCAGCC CCGAGGAGAT CACCCACGAA CTCCTGCTGG ACGTGATCGA GAAACAGGCC GAGCAGGGCG TCGACTACAT GACGATCCAC GCGGGCGTGC TGATGGAACA CCTCCCGCTG ACCGACGGCC GCAAGACCGG GATCGTCTCT CGCGGCGGAT CGATCATGGC CAAGTGGATG GAGGAAAACG GGATGCAGAA CCCCCTCTAC ACGAAATACG AGGAGATCTG CGAGATCTTC CGTGAACACG ACGTGACCTT CAGTCTCGGC GACGGCCTGC GGCCCGGCTG TATCGCAGAC GCGAGCGACG AGGCGCAGTT CGCAGAGCTG GACACCCTGG GTGAACTGAC CCGAAAAGCG TGGGACGAGG GCGTCCAGGT GATGGTCGAG GGACCGGGCC ACGTGCCCAT GGACGAGGTC GCGGACAACG TCGAGCGCCA GCAGGAGGTC TGTGACGGCG CGCCGTTCTA CGTCCTCGGC CCGCTGGTGA CCGACATCGC GCCCGGCTAC GACCACATCA CCAGCGCGAT CGGCGCGACC GAGGCCGGAC GCGCGGGCGC GGCGATGCTG TGTTACGTCA CGCCCAAAGA GCACCTCGGC CTGCCAGAAC GAGAGGACGT GCGCGAAGGG CTCGCGGCCT ACCGGATCGC CGCACACGCC GCCGACGTTG CCAACGGGCG CGAGGGGGCC AGCGACTGGG ACGACGCCCT CTCGGAGGCC CGCTACGCCT TCGACTGGTC GGAGCAATTC GAGCTCGCGC TCGACCCCGA GCGCGCGAAG GCCTACCACG ACAAGACGCT GCCGGGTGAC AACTACAAGG ACGCCCGCTT CTGTTCGATG TGTGGCGTCG AGTTCTGCTC GATGCGGATC GATCAGGACG CGCGGGAGGG CGACGAGATG GCGTCGATCG CCGACGAGAC CGACCTCGAA GGATCGGCCG CCGCGTCGGT GAACCGACCG CCCGTCGGCA CCCACGACAG CGACGCCGAG TTGCACCACC ACGAGGGGCG ACCGACAGTC GTCGGCGACG ACGACTGA
|
Protein sequence | MTTQLQRARD GEITSAMERI AERETVDAEF VREQVADGQA VIPANVGHET LDPMIIGREF STKVNANIGN SEETSDLDGE LEKLHTAVHY GADTVMDLST GANLDEIREA NVEHSPVPVG TVPIYEAVKR AGSPEEITHE LLLDVIEKQA EQGVDYMTIH AGVLMEHLPL TDGRKTGIVS RGGSIMAKWM EENGMQNPLY TKYEEICEIF REHDVTFSLG DGLRPGCIAD ASDEAQFAEL DTLGELTRKA WDEGVQVMVE GPGHVPMDEV ADNVERQQEV CDGAPFYVLG PLVTDIAPGY DHITSAIGAT EAGRAGAAML CYVTPKEHLG LPEREDVREG LAAYRIAAHA ADVANGREGA SDWDDALSEA RYAFDWSEQF ELALDPERAK AYHDKTLPGD NYKDARFCSM CGVEFCSMRI DQDAREGDEM ASIADETDLE GSAAASVNRP PVGTHDSDAE LHHHEGRPTV VGDDD
|
| |