Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mboo_2075 |
Symbol | |
ID | 5409784 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Methanoregula boonei 6A8 |
Kingdom | Archaea |
Replicon accession | NC_009712 |
Strand | + |
Start bp | 2150344 |
End bp | 2151594 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640869320 |
Product | 3-isopropylmalate dehydratase |
Protein accession | YP_001405232 |
Protein GI | 154151614 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0065] 3-isopropylmalate dehydratase large subunit |
TIGRFAM ID | [TIGR01343] homoaconitate hydratase family protein [TIGR02086] 3-isopropylmalate dehydratase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGATCAA CAATCGTAGA GAAGATCTTC TCACGGAAAT GCGGCAGCGA TATTAAAGCA GGCGAAGTCG TCATGGCACC CATTGACGGG GCCATGATCC ACGACATTAC CGGCCCGCTT GCCATCCGGA AGTTCTACGA GATGGGCGGA AAACAGGTCT TTGACCCAAA AAGCGTCATC ATGCTCTTTG ACCACCAGAT ACCGGCCGAC TCGCTTGAGG CTGCGGAGAA CCATGTCTTC ATGCGGAAGT TTGCCGCAGA GCAGGGTATC CACAACTATG ATATCAACGA GGGCGTCTGC CACCAGGTGG TCCTGGAGAA GGGCAGGGCA GCGCCCGGCG AGATCGTGGT CGGGGCAGAT TCCCACACCT GCATGTACGG GGCGGTGGGG GCATTTGCCA CCGGTATCGG GTCTACAGAC ATGGGCTTTG CGCTCAAGTT CGGGGCCCTG TACTTCAAGG TGCCGGAGAC TGTTCGGGCG GAGATTTCGG GAAAGTTCCA GAAACGTGTT GGCGCAAAGG ATCTGATCCT CTCAATTGCG GCAGATATCG GAGCTGACGG CGCGACCTAC CAGGCAATCG AGTTTACCGG TAAAACTATC TCTAAAATGG ACATGGCCGG ACGGATGACC TGCTGCAACA TGGCCATCGA GATGGGGGCA AAGGCAGGAA TAGTCCCTCC CGACAAGGTG ACCTGGGAGT ACATGAAGGG CCGGCGGAAG ATAACGCCGT TTGCGCTTGC AAGTGACGAC GATGCAAAGT TCGCAGAGAA GCGGACATAC AATGTCGCGG ATCTCGAACC ACAGGTTGCG GTCCCCCACA ACGTTGATCA GGGAATCCCG GTAGGTAAGG TTGAAGGAAC CCACATCGAC CAGGTCTTCA TCGGCTCGTG TACGAACGGC CGGTACGAGG ATCTCGTGGA GGCAGCAGAG GTGCTGGGCA AAAAGAAATT CGACCCGAAG GTCCGGGTGC TTGTCATCCC CGCGTCCCGG GACGAGTATC TCAAGGCGCT TGAAGCGGGG CTCGTTGAAC GGTTCGTCAG AGCCGGGGCC CTTGTCGAGG CCCCCTGCTG CGGACCCTGC ATGGGCGGAT CGTTTGGCCT GATAGCAGCT GGCGAGGTTT CGCTTGCCAC CTCAAATCGG AACTTCCGGG GCCGGCAGGG AAGCACGGAG GGGAAAGTAT ACCTCTGCTC GCCGGCTACG GCTGCTGCAA GTGCAATAAA GGGTGAAATC ACCGATCCAA GGGAGGTGTA A
|
Protein sequence | MGSTIVEKIF SRKCGSDIKA GEVVMAPIDG AMIHDITGPL AIRKFYEMGG KQVFDPKSVI MLFDHQIPAD SLEAAENHVF MRKFAAEQGI HNYDINEGVC HQVVLEKGRA APGEIVVGAD SHTCMYGAVG AFATGIGSTD MGFALKFGAL YFKVPETVRA EISGKFQKRV GAKDLILSIA ADIGADGATY QAIEFTGKTI SKMDMAGRMT CCNMAIEMGA KAGIVPPDKV TWEYMKGRRK ITPFALASDD DAKFAEKRTY NVADLEPQVA VPHNVDQGIP VGKVEGTHID QVFIGSCTNG RYEDLVEAAE VLGKKKFDPK VRVLVIPASR DEYLKALEAG LVERFVRAGA LVEAPCCGPC MGGSFGLIAA GEVSLATSNR NFRGRQGSTE GKVYLCSPAT AAASAIKGEI TDPREV
|
| |