Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_0837 |
Symbol | |
ID | 8410352 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 808642 |
End bp | 809778 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 645019173 |
Product | 2-methylcitrate synthase/citrate synthase II |
Protein accession | YP_003176675 |
Protein GI | 257386902 |
COG category | [C] Energy production and conversion |
COG ID | [COG0372] Citrate synthase |
TIGRFAM ID | [TIGR01800] 2-methylcitrate synthase/citrate synthase II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.531649 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGACG ACCTCAAGCA AGGTCTGGAG GGTGTCCTCG TTACCGAGTC GGAGCTCAGC AAGATCGACG GTGACGCGGG CAAACTCGTC TATCGTGGCT ACACGATCGA GGACCTGGCG ACCGGCGCAA GCTTCGAGGA GGTCCTGTAT CTCCTCTGGC ACGGTCACCT CCCGAACGCA GCGGAGCTCG ACGAGTTCAC CGACGCGATG GTCGAGGAGC GACACGTCGA CGACGACGTC ATGCAGACCG TCGAGCAGCT CGCCGACGCC GACGAGAATC CGATGGCCGC GCTCCGAACG GCGGTCTCGA TGCTGTCCTC GCACGATCCC GACGCCGAAA CCGACCCGAC CGACCTCGAC GCCAACCTCC GGAAAGGTCG CCGGATCACC GCCAAGATCC CGACCGTGCT GGCTGCCTTC GCCCGGTTCC GTGACGGACA GGACGCCGTC GAACCGCGGG AAGACCTCTC GCACGCCGCG AACTTCCTCT ACATGCTCAA CGGCGAGGCA CCGGACGAGG TGCTCGCCGA GACGTTCGAC ATGGCGCTCG TGCTCCACGC CGACCACGGC ATCAACGCCT CGACGTTCTC GGCCATGGTC ACGGCCTCGA CGCTGTCGGA TCTCCACAGC GCGATCACCT CCGCGATCGG CACCCTGAAG GGATCGCTCC ACGGCGGCGC GAACCAGGAC GTCATGGAGA TGCTCAAGGA GGTCGACGAC GCCCAGCAGG ACCCGATCGA CTGGGTGAAG ACGGCACTCG ACGAGGGACG GCGCGTCTCC GGCTTCGGCC ACCGCGTCTA CAACGTCAAG GACCCCCGTG CGAAGATCCT CAGCCAGCGC TCGAAGGAAC TGGGGGAGGC CGCCGGCTCG CTCAAGTGGT ACGAGATGTC CACCGCCATC GAGGACTACC TCAAAGCGGA GAAGGGGCTG GCCCCGAACG TCGACTTCTA CTCGGCCTCG ACGTACTACC AGATGGGGAT CCCCATCGAC ATCTACACTC CCATCTTCGC GATGTCCCGC GTCGGCGGCT GGACGGCCCA CGTCCTCGAA CAGTACGAGA ACAACCGTCT GATCCGGCCC CGCGCTCGCT ACGTCGGCCC GACGGATCAG ACGTTCGTCC CCCTCGACGA GCGATAG
|
Protein sequence | MSDDLKQGLE GVLVTESELS KIDGDAGKLV YRGYTIEDLA TGASFEEVLY LLWHGHLPNA AELDEFTDAM VEERHVDDDV MQTVEQLADA DENPMAALRT AVSMLSSHDP DAETDPTDLD ANLRKGRRIT AKIPTVLAAF ARFRDGQDAV EPREDLSHAA NFLYMLNGEA PDEVLAETFD MALVLHADHG INASTFSAMV TASTLSDLHS AITSAIGTLK GSLHGGANQD VMEMLKEVDD AQQDPIDWVK TALDEGRRVS GFGHRVYNVK DPRAKILSQR SKELGEAAGS LKWYEMSTAI EDYLKAEKGL APNVDFYSAS TYYQMGIPID IYTPIFAMSR VGGWTAHVLE QYENNRLIRP RARYVGPTDQ TFVPLDER
|
| |