Gene GM21_1504 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1504 
Symbol 
ID8136833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1758571 
End bp1759854 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content64% 
IMG OID644869116 
Product3-isopropylmalate dehydratase large subunit 
Protein accessionYP_003021318 
Protein GI253700129 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR00170] 3-isopropylmalate dehydratase, large subunit
[TIGR01343] homoaconitate hydratase family protein
[TIGR02086] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones75 
Fosmid unclonability p-value0.691126 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTATGA CTACGGCTCA AAAAATATTT GCGGCCCATC TTGTTGATGA GCCTTTTGCT 
GGCACCAAGG TGCTCAGCAT CGACGTCGTG ATGTGCCACG AGATCACGAC CCCGATCGCC
ATCGCCGACC TGATGGCGCG CGGCAAGGAC CGGGTTTTCG ACCCGAGCAA GATCAAGGCG
GTCATCGACC ACGTAACGCC GAGCAAGGAC AGCAAGACCG CCACCCAGGC GAAGATGCTG
CGCGACTGGG CCAGGCGGCA CGACATCAAG GACTTCTTCG ACATCGGGGC CAACGGCGTC
TGCCACGCGC TCTTCCCGGA GAAGGGTTTC ATCCGTCCGG GGAACACGGT GATCATGGGC
GACTCCCATA CCTGTACCCA CGGCGCCTTC GGGGCCTTCG CCGCCGGCGT CGGCACCACC
GACCTGGAAG TGGGGATCCT CAAGGGGGTC TGCGCCTTCC GCGAGCCCAA GACCATCCGC
GTCAACCTGA ACGGCACCCT CCCCAAAGGG GTTTTCGCGA AGGACGCCAT CCTGCGCGTG
ATCGGGCACC TGGGTGTTAA CGGCGCCACC GATCGTGTCA TCGAGTTCGG CGGACCGGTC
GTGGCCCGGA TGACCATGGA ATCGAGGATG ACGCTTTGCA ACATGGCGAT CGAGGCGGGG
GGCACCTCCG GCATCTGCAT GCCGGACCAG GTCACCGTCG ATTACCTCTG GCCCTTCATC
TCCGGATCCT TCGGCTCGAA GGAAGAGGCG CTTGCCGCTT ACAGCGTCTG GTGCTCGGAC
GCCGACGCCG CCTACGAGCA GGTGATCGAT CTCGATCTTT CCGACCTCGC CCCGCTTTGC
ACCTTCGGCT ACAAGCCGGA CCAGGTGAAG AGTGTGACCG AGATGGCCGG CACCCAGGTG
GACCAGGTTT ATCTCGGATC CTGCACCAAC GGCCGGTTGG AAGACCTCAG GGTCGCGGCC
CAGATCCTCA AGGGGAAGAA GATCGCCTCC CACGTGCGCG CCATCCTTTC TCCGGCGACG
CCGCAGATCT ACAAGGACGC GGTCGCCGAA GGGCTGATCC AGATCTTCAT GGACGCAGGC
TTCTGCGTCA CCAACCCGAC CTGCGGCGCC TGCCTCGGCA TGAGCAACGG CGTCCTCGCC
GAAGGCGAGG TCTGCGCCTC CACCACCAAC CGCAACTTCA TGGGGCGGAT GGGCAAGGGG
GGAATGGTGC ACCTGCTGTC GCCGGCGACC GCGGCTGCCT CCGCCATCGA GGGTAAGATC
GCTGACCCGC GCAACTACCT GTAA
 
Protein sequence
MGMTTAQKIF AAHLVDEPFA GTKVLSIDVV MCHEITTPIA IADLMARGKD RVFDPSKIKA 
VIDHVTPSKD SKTATQAKML RDWARRHDIK DFFDIGANGV CHALFPEKGF IRPGNTVIMG
DSHTCTHGAF GAFAAGVGTT DLEVGILKGV CAFREPKTIR VNLNGTLPKG VFAKDAILRV
IGHLGVNGAT DRVIEFGGPV VARMTMESRM TLCNMAIEAG GTSGICMPDQ VTVDYLWPFI
SGSFGSKEEA LAAYSVWCSD ADAAYEQVID LDLSDLAPLC TFGYKPDQVK SVTEMAGTQV
DQVYLGSCTN GRLEDLRVAA QILKGKKIAS HVRAILSPAT PQIYKDAVAE GLIQIFMDAG
FCVTNPTCGA CLGMSNGVLA EGEVCASTTN RNFMGRMGKG GMVHLLSPAT AAASAIEGKI
ADPRNYL