Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_3412 |
Symbol | |
ID | 8409490 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013201 |
Strand | + |
Start bp | 216129 |
End bp | 216980 |
Gene Length | 852 bp |
Protein Length | 283 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 645018333 |
Product | 5-oxopent-3-ene-1,2,5- tricarboxylatedecarboxylas e |
Protein accession | YP_003175854 |
Protein GI | 257373080 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.243982 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.910651 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCTTCG TTCGATACAC CGACGGAACG ACCCCAGCGT GGGGTCTGGA ACGCGAACAG ACGATTCACG CACTGTCCGA TCTCCCGTGG GGAGAGCCGT CACTGAACGA CCTCGCGAAC CCCAGCTACC GATCACACGT CGCAGCCCGC ATCGAGAACG GGGCGTCGAC GCAGATCGAT CCCACCGCTG TGTCGCTGCT CGCACCGGTG CCACAGCCGG GCAAGATCGT CTGCTGTGGC CTCAACTACC ACGATCACGC CCAGGAGCAA GACGAGACGG TGCCGGACAG CCCGATGCTG TTCGGGAAGG CCCCGACCGC CGTCACCAAC CCGGCAGATC CGATCGTCCA CCCCGATCCC GAGGGGCCGC CCCAGGTCGA CTACGAGGTC GAACTGGCCG TCGTCGTCGG CGACACGATC TCCTCGGTCG ACGAGGCCGA CGCCTACGAC CACATCGCCG GCTACACCGT GCTCAACGAC GTGAGCGAGC GAACGGCCCA GAACGAAGAC GGGCAGTTCT TCCGGGGCAA GAGCTACGAC ACGTTCGCCC CGATGGGGCC GCGACTGGTC ACCGGCGACG ACATCGATCC GAACGCCCTC GACGTGGAGT TGCGCGTCGA CGGCGAGACG AAGCAGTCCT CGAACACCGA GCAGTTCATC TTCGACGTGG GCGAGCTCGT CGCCTACATC AGCGATGCCA TGACGCTGCG GCCCGGCGAC GTGATCTCGA CGGGGACGCC CGGCGGCGTC GGCATCTTCC GCGATCCGGT CGAGGTGCTG GAACCGGGCC AGACGGTCGA GGCAGAGATC GAAGGTATCG GAACGCTTCG GAATCCGGTC GTCGGCCGAT AG
|
Protein sequence | MRFVRYTDGT TPAWGLEREQ TIHALSDLPW GEPSLNDLAN PSYRSHVAAR IENGASTQID PTAVSLLAPV PQPGKIVCCG LNYHDHAQEQ DETVPDSPML FGKAPTAVTN PADPIVHPDP EGPPQVDYEV ELAVVVGDTI SSVDEADAYD HIAGYTVLND VSERTAQNED GQFFRGKSYD TFAPMGPRLV TGDDIDPNAL DVELRVDGET KQSSNTEQFI FDVGELVAYI SDAMTLRPGD VISTGTPGGV GIFRDPVEVL EPGQTVEAEI EGIGTLRNPV VGR
|
| |