Gene GM21_3353 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3353 
Symbol 
ID8138720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3880134 
End bp3881222 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content62% 
IMG OID644870971 
Product3-isopropylmalate dehydrogenase 
Protein accessionYP_003023136 
Protein GI253701947 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0473] Isocitrate/isopropylmalate dehydrogenase 
TIGRFAM ID[TIGR00169] 3-isopropylmalate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value0.133211 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAAGC TTTTTAAAGT GGCGGTATTG CCAGGAGACG GCATAGGTCC CGAGGTTATG 
GCGGAAGCAC TGAGGGTGCT CGATGCGGTT GAGAAACGTT ACGAAGTCAC TTTCGAGCGG
ACCCACGCCA ACGTAGGCGG AGCGGGCATC GACCTGGAAG GTCGTGCGCT TCCCGAGACC
ACGGTAAATA TATGCAAGGC TTCGGACGCC ATCCTTTTCG GCTCCGTAGG CGGACCCAAG
TGGGAAACCC TTCCCCCGGA CGAGCAGCCC GAGCGCGGCG CCCTGCTGCC GCTTCGCAAG
ATCTTCGGCC TCTACGCCAA CCTGCGTCCG GCCATCATCT TCCCGTCGCT CACCAGCGCC
TCCTCGCTGA AGGAAGAGGT GATCGCAGGG GGCTTCGACA TCCTGGTGAT CCGCGAATTG
ACCGGCGGCA TCTACTTCTC CCAGCCCAAA GGGATCGAAG GCGAGGGGCG CAACCGCGTC
GGCGTCGACA CCATGCGCTA CAGCGTCCCC GAGATCGAGC GCATCGCGCA CGTGGCCTTC
CAGGCGGCGA GAAAGCGCGG CAAGAAGGTC TGCTCCATCG ACAAGGCCAA CGTTCTTTCC
AGCTCCGTCC TTTGGCGCGA GATAGTGATC AACATCGCCA ACGAATACCC GGACGTCGAG
CTCTCCCACA TGTACGTGGA CAACGCCGCG ATGCAGCTCG TTAAGTGGCC CAAGCAGTTC
GACGTGATCC TTTGCGAGAA CATGTTCGGC GACATTCTCT CGGACGAGGC GGCCATGCTG
ACCGGCTCTT TGGGGATGCT TCCCTCCGCC TCGCTGGCCG AGGGGACCTT CGGCATGTAC
GAGCCCTCCG GCGGGAGCGC CCCGGACATC GCAGGGCAGG GGATCGCCAA CCCGATCGCC
CAGATCCTCT CCGCGGGGAT GATGCTCCGT TACTCCTTCG GCATGATCGA GGCGGCCGAC
GCCATCGACA ACGCCGTCGC CAAGGTACTC GACGGCGGTT TCCGCACCAG GGACATCTAT
CAGGAGAAGG CAGGCGAGAA GCTGGTGAAC ACCAAGGAGA TCGGCGACGC CATCATCGCC
AATCTCTGA
 
Protein sequence
MGKLFKVAVL PGDGIGPEVM AEALRVLDAV EKRYEVTFER THANVGGAGI DLEGRALPET 
TVNICKASDA ILFGSVGGPK WETLPPDEQP ERGALLPLRK IFGLYANLRP AIIFPSLTSA
SSLKEEVIAG GFDILVIREL TGGIYFSQPK GIEGEGRNRV GVDTMRYSVP EIERIAHVAF
QAARKRGKKV CSIDKANVLS SSVLWREIVI NIANEYPDVE LSHMYVDNAA MQLVKWPKQF
DVILCENMFG DILSDEAAML TGSLGMLPSA SLAEGTFGMY EPSGGSAPDI AGQGIANPIA
QILSAGMMLR YSFGMIEAAD AIDNAVAKVL DGGFRTRDIY QEKAGEKLVN TKEIGDAIIA
NL