Gene Mboo_2075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_2075 
Symbol 
ID5409784 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp2150344 
End bp2151594 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content58% 
IMG OID640869320 
Product3-isopropylmalate dehydratase 
Protein accessionYP_001405232 
Protein GI154151614 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR01343] homoaconitate hydratase family protein
[TIGR02086] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATCAA CAATCGTAGA GAAGATCTTC TCACGGAAAT GCGGCAGCGA TATTAAAGCA 
GGCGAAGTCG TCATGGCACC CATTGACGGG GCCATGATCC ACGACATTAC CGGCCCGCTT
GCCATCCGGA AGTTCTACGA GATGGGCGGA AAACAGGTCT TTGACCCAAA AAGCGTCATC
ATGCTCTTTG ACCACCAGAT ACCGGCCGAC TCGCTTGAGG CTGCGGAGAA CCATGTCTTC
ATGCGGAAGT TTGCCGCAGA GCAGGGTATC CACAACTATG ATATCAACGA GGGCGTCTGC
CACCAGGTGG TCCTGGAGAA GGGCAGGGCA GCGCCCGGCG AGATCGTGGT CGGGGCAGAT
TCCCACACCT GCATGTACGG GGCGGTGGGG GCATTTGCCA CCGGTATCGG GTCTACAGAC
ATGGGCTTTG CGCTCAAGTT CGGGGCCCTG TACTTCAAGG TGCCGGAGAC TGTTCGGGCG
GAGATTTCGG GAAAGTTCCA GAAACGTGTT GGCGCAAAGG ATCTGATCCT CTCAATTGCG
GCAGATATCG GAGCTGACGG CGCGACCTAC CAGGCAATCG AGTTTACCGG TAAAACTATC
TCTAAAATGG ACATGGCCGG ACGGATGACC TGCTGCAACA TGGCCATCGA GATGGGGGCA
AAGGCAGGAA TAGTCCCTCC CGACAAGGTG ACCTGGGAGT ACATGAAGGG CCGGCGGAAG
ATAACGCCGT TTGCGCTTGC AAGTGACGAC GATGCAAAGT TCGCAGAGAA GCGGACATAC
AATGTCGCGG ATCTCGAACC ACAGGTTGCG GTCCCCCACA ACGTTGATCA GGGAATCCCG
GTAGGTAAGG TTGAAGGAAC CCACATCGAC CAGGTCTTCA TCGGCTCGTG TACGAACGGC
CGGTACGAGG ATCTCGTGGA GGCAGCAGAG GTGCTGGGCA AAAAGAAATT CGACCCGAAG
GTCCGGGTGC TTGTCATCCC CGCGTCCCGG GACGAGTATC TCAAGGCGCT TGAAGCGGGG
CTCGTTGAAC GGTTCGTCAG AGCCGGGGCC CTTGTCGAGG CCCCCTGCTG CGGACCCTGC
ATGGGCGGAT CGTTTGGCCT GATAGCAGCT GGCGAGGTTT CGCTTGCCAC CTCAAATCGG
AACTTCCGGG GCCGGCAGGG AAGCACGGAG GGGAAAGTAT ACCTCTGCTC GCCGGCTACG
GCTGCTGCAA GTGCAATAAA GGGTGAAATC ACCGATCCAA GGGAGGTGTA A
 
Protein sequence
MGSTIVEKIF SRKCGSDIKA GEVVMAPIDG AMIHDITGPL AIRKFYEMGG KQVFDPKSVI 
MLFDHQIPAD SLEAAENHVF MRKFAAEQGI HNYDINEGVC HQVVLEKGRA APGEIVVGAD
SHTCMYGAVG AFATGIGSTD MGFALKFGAL YFKVPETVRA EISGKFQKRV GAKDLILSIA
ADIGADGATY QAIEFTGKTI SKMDMAGRMT CCNMAIEMGA KAGIVPPDKV TWEYMKGRRK
ITPFALASDD DAKFAEKRTY NVADLEPQVA VPHNVDQGIP VGKVEGTHID QVFIGSCTNG
RYEDLVEAAE VLGKKKFDPK VRVLVIPASR DEYLKALEAG LVERFVRAGA LVEAPCCGPC
MGGSFGLIAA GEVSLATSNR NFRGRQGSTE GKVYLCSPAT AAASAIKGEI TDPREV