Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1504 |
Symbol | |
ID | 8136833 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1758571 |
End bp | 1759854 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644869116 |
Product | 3-isopropylmalate dehydratase large subunit |
Protein accession | YP_003021318 |
Protein GI | 253700129 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0065] 3-isopropylmalate dehydratase large subunit |
TIGRFAM ID | [TIGR00170] 3-isopropylmalate dehydratase, large subunit [TIGR01343] homoaconitate hydratase family protein [TIGR02086] 3-isopropylmalate dehydratase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 75 |
Fosmid unclonability p-value | 0.691126 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTATGA CTACGGCTCA AAAAATATTT GCGGCCCATC TTGTTGATGA GCCTTTTGCT GGCACCAAGG TGCTCAGCAT CGACGTCGTG ATGTGCCACG AGATCACGAC CCCGATCGCC ATCGCCGACC TGATGGCGCG CGGCAAGGAC CGGGTTTTCG ACCCGAGCAA GATCAAGGCG GTCATCGACC ACGTAACGCC GAGCAAGGAC AGCAAGACCG CCACCCAGGC GAAGATGCTG CGCGACTGGG CCAGGCGGCA CGACATCAAG GACTTCTTCG ACATCGGGGC CAACGGCGTC TGCCACGCGC TCTTCCCGGA GAAGGGTTTC ATCCGTCCGG GGAACACGGT GATCATGGGC GACTCCCATA CCTGTACCCA CGGCGCCTTC GGGGCCTTCG CCGCCGGCGT CGGCACCACC GACCTGGAAG TGGGGATCCT CAAGGGGGTC TGCGCCTTCC GCGAGCCCAA GACCATCCGC GTCAACCTGA ACGGCACCCT CCCCAAAGGG GTTTTCGCGA AGGACGCCAT CCTGCGCGTG ATCGGGCACC TGGGTGTTAA CGGCGCCACC GATCGTGTCA TCGAGTTCGG CGGACCGGTC GTGGCCCGGA TGACCATGGA ATCGAGGATG ACGCTTTGCA ACATGGCGAT CGAGGCGGGG GGCACCTCCG GCATCTGCAT GCCGGACCAG GTCACCGTCG ATTACCTCTG GCCCTTCATC TCCGGATCCT TCGGCTCGAA GGAAGAGGCG CTTGCCGCTT ACAGCGTCTG GTGCTCGGAC GCCGACGCCG CCTACGAGCA GGTGATCGAT CTCGATCTTT CCGACCTCGC CCCGCTTTGC ACCTTCGGCT ACAAGCCGGA CCAGGTGAAG AGTGTGACCG AGATGGCCGG CACCCAGGTG GACCAGGTTT ATCTCGGATC CTGCACCAAC GGCCGGTTGG AAGACCTCAG GGTCGCGGCC CAGATCCTCA AGGGGAAGAA GATCGCCTCC CACGTGCGCG CCATCCTTTC TCCGGCGACG CCGCAGATCT ACAAGGACGC GGTCGCCGAA GGGCTGATCC AGATCTTCAT GGACGCAGGC TTCTGCGTCA CCAACCCGAC CTGCGGCGCC TGCCTCGGCA TGAGCAACGG CGTCCTCGCC GAAGGCGAGG TCTGCGCCTC CACCACCAAC CGCAACTTCA TGGGGCGGAT GGGCAAGGGG GGAATGGTGC ACCTGCTGTC GCCGGCGACC GCGGCTGCCT CCGCCATCGA GGGTAAGATC GCTGACCCGC GCAACTACCT GTAA
|
Protein sequence | MGMTTAQKIF AAHLVDEPFA GTKVLSIDVV MCHEITTPIA IADLMARGKD RVFDPSKIKA VIDHVTPSKD SKTATQAKML RDWARRHDIK DFFDIGANGV CHALFPEKGF IRPGNTVIMG DSHTCTHGAF GAFAAGVGTT DLEVGILKGV CAFREPKTIR VNLNGTLPKG VFAKDAILRV IGHLGVNGAT DRVIEFGGPV VARMTMESRM TLCNMAIEAG GTSGICMPDQ VTVDYLWPFI SGSFGSKEEA LAAYSVWCSD ADAAYEQVID LDLSDLAPLC TFGYKPDQVK SVTEMAGTQV DQVYLGSCTN GRLEDLRVAA QILKGKKIAS HVRAILSPAT PQIYKDAVAE GLIQIFMDAG FCVTNPTCGA CLGMSNGVLA EGEVCASTTN RNFMGRMGKG GMVHLLSPAT AAASAIEGKI ADPRNYL
|
| |