Gene Mpal_0185 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_0185 
Symbol 
ID7270956 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp215437 
End bp216696 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content60% 
IMG OID643568840 
Producthomoaconitate hydratase family protein 
Protein accessionYP_002465297 
Protein GI219850865 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR01343] homoaconitate hydratase family protein
[TIGR02086] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.576372 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0771341 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGCAA TGGGAGCAAC AATAGCGGAG AAGATATTCT CAACCCGATG CGGAAGGCCG 
GTCCACGCAG GTGACGTGGT GATGGCTCCG ATAGATGCAG CGATGATCCA TGATATCACA
GGCCCGCTTG CCATTCAGAC TTTTTACCAG ATGGGTGGCA CCAGGGTTTT TGACCCAAAG
AAGGTGATCA TGCTCTTTGA TCATCAGATC CCTGCAGACT CCATTGCAGC AGCAGGAAAC
CATCAACTGA TGCGAAAGTT TGCAGCAGAA CAGGGGATCC ACAACTATGA CCTCCACGAG
GGGGTCTGCC ATCAGGTGGT TCTTGAAAAG GGAAGGGCTG GGCCTGGTGA GATCGTGGTC
GGGTCCGACT CGCACACCTG CATGTATGGT GCCGCAGGGG CGTTTGCAAC CGGAATAGGG
TCGACCGATA TGGGTTTTGT CCTGAAGTTC GGGGCCCTCT ACTTCCGGGT GCCCGAGACG
ATCAGGATGA CTATCGACGG TGCCTTCCAG CGCCGGGTCG GTCCCAAGGA TCTGATCCTC
TCGATCATCG GGGATATCGG TGCTGACGGG GCCACCTATA AGGCGGTGGA GTTTGCAGGG
TCGACGATCC GGGGGATGGA GATGCCTGGG CGGATGACTC TTTCGAATAT GGCCATCGAG
ATGGGGGGCA AGGCCGGGAT CGTCCCGCCT GATCAGGTGA CCTGGGATTA CCTGAAGTCA
AGGCGGCAGG TCACGCCGTT TGAACTGGAC AGTGACGAGG ACGCGACCTT TGCCGATCAG
CGACGGTATG ATGTGACGAA CCTTGTACCC AAAGTCGCCG TCCCGCACAA CGTGGACCAC
GTGGTCGACG TGACTGAAGT GGCAGGGACG CACCTCGACC AGGTTTTCAT CGGATCGTGC
ACCAACGGGC GGTTCGAGGA TCTTGCAGAG GCCGCGGCCG TCCTCGGCGA TCGGAATTTC
TCCGAGGATC TCCGTGTGCT CGTCATCCCG GCATCAAGGG ATGAATACTT GAAGACGCTG
CGGGCCGGGC TGATCGAGCG GTTCGTCGAG GCCGGGGCGA TGGTCGAGGC GCCGTGCTGC
GGGCCGTGTA TGGGCGGATC GTTCGGGCTG ATCGGCCCGG GCGAGGCTTC TCTCTCCACC
TCGAACCGGA ACTTTAGGGG CCGGCAGGGG TCGGCCGAGG GATCGGTGTA CCTGGCCTCG
GCGGCGACGG CTGCAGCGAG TGCGATCACC GGTGTGATCA CCGATCCGAG GGAGGTGTGA
 
Protein sequence
MNAMGATIAE KIFSTRCGRP VHAGDVVMAP IDAAMIHDIT GPLAIQTFYQ MGGTRVFDPK 
KVIMLFDHQI PADSIAAAGN HQLMRKFAAE QGIHNYDLHE GVCHQVVLEK GRAGPGEIVV
GSDSHTCMYG AAGAFATGIG STDMGFVLKF GALYFRVPET IRMTIDGAFQ RRVGPKDLIL
SIIGDIGADG ATYKAVEFAG STIRGMEMPG RMTLSNMAIE MGGKAGIVPP DQVTWDYLKS
RRQVTPFELD SDEDATFADQ RRYDVTNLVP KVAVPHNVDH VVDVTEVAGT HLDQVFIGSC
TNGRFEDLAE AAAVLGDRNF SEDLRVLVIP ASRDEYLKTL RAGLIERFVE AGAMVEAPCC
GPCMGGSFGL IGPGEASLST SNRNFRGRQG SAEGSVYLAS AATAAASAIT GVITDPREV