Gene Mext_4355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4355 
Symbol 
ID5831574 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4843765 
End bp4845207 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content69% 
IMG OID641370147 
Productisopropylmalate isomerase large subunit 
Protein accessionYP_001641795 
Protein GI163853752 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR00170] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCCTT CAACCCGTCA GAATAGCTGC ATCATGACCG CGCCGCGCAC CCTCTACGAC 
AAGATCTGGG ACGACCACGT CGTCGATGTC GAGCCGGATG GCTCCGCCCT CCTCTACATC
GACCGCCACC TCGTGCACGA GGTGACGAGC CCCCAGGCGT TCGAGGGGCT TCGCGTCGCC
GGCCGCACGG TGCGCGCGCC GCACAAGACG CTGGCCGTCG TCGATCACAA CGTTCAGACC
TCGGACCGCT CCAAGGGCAT CGAGGATCCC GAGAGCCGCA CCCAGCTCGA GGCGCTCGCC
GAGAACGTGC GCGACTTCGG CATCGAGTTC TACGATGCCC TCGACCAGCG CCAGGGCATC
GTCCACATCA TCGGGCCGGA GCAGGGCTTC ACCCTGCCGG GCCAGACGAT CGTGTGCGGC
GATTCCCACA CCTCGACCCA CGGCGCCTTC GGCGCGCTGG CCCACGGCAT CGGCACGTCG
GAGGTCGAGC ACGTGCTCGC CACGCAGACG CTGATCCAGC GCAAGGCCAA GAACATGCGG
GTGACGGTCG ACGGCACCCT GCCGCGGGGC GTCAGCGCCA AGGACATCGT GCTCGCCATC
ATCGGCGAGA TCGGCACCGC CGGCGGCACC GGCCACGTCA TCGAGTATGC CGGCGAGGCA
ATCCGCGCCC TCTCGATGGA AGGCCGGATG ACGATCTGCA ACATGTCGAT CGAGGGCGGC
GCCCGCGCCG GCATGGTCGC GCCCGACGAG ACCACCTACG CCTACGTCAA CGGCCGGCCG
AAGGCGCCGA AGGGCGCGGC GTTCGACGCG GCCCGCCGCT ACTGGGAGAG CCTGGCCACG
GACGAAGGCG CGCATTTCGA CCGCGAGATC CGTCTCGACG CCGCCAACCT CCCCCCGCTG
GTCTCCTGGG GCACGAGCCC TGAGGACATC GTCTCGATCC TCGGCACGGT GCCCGATCCG
GCCCAGATCG CGGATGAGAA CAAGCGCCAG TCCAAGGAGA AGGCGCTGGC CTATATGGGC
CTGACGCCGG GAACCCGGAT GACCGACGTC ACCCTCGACC GGGTGTTCAT CGGCTCCTGC
ACCAATGGCC GCATCGAGGA TCTGCGGATC GTCGCCAAGA TGGTCGAGGG CCGCAAGGTG
CATGACAGCG TCTCGGCGAT GGTGGTGCCG GGCTCCGGAC TGGTGAAGGC GCAGGCCGAA
GCCGAGGGGA TCGACCGCAT CCTGAAGGAT GCCGGCTTCG ATTGGCGCGA GCCTGGCTGC
TCGATGTGCC TCGGCATGAA CCCGGACAAG CTGCGGCCGG GCGAGCGCTG CGCCTCGACC
TCCAACCGCA ACTTCGAGGG CCGCCAGGGC CCGCGCGGCC GCACCCACCT CGTCTCCCCG
GCAATGGCGG CCGCCGCAGC GGTGGCTGGC CGCTTCGTCG ACATCCGCGA GTGGCGCGGC
TGA
 
Protein sequence
MEPSTRQNSC IMTAPRTLYD KIWDDHVVDV EPDGSALLYI DRHLVHEVTS PQAFEGLRVA 
GRTVRAPHKT LAVVDHNVQT SDRSKGIEDP ESRTQLEALA ENVRDFGIEF YDALDQRQGI
VHIIGPEQGF TLPGQTIVCG DSHTSTHGAF GALAHGIGTS EVEHVLATQT LIQRKAKNMR
VTVDGTLPRG VSAKDIVLAI IGEIGTAGGT GHVIEYAGEA IRALSMEGRM TICNMSIEGG
ARAGMVAPDE TTYAYVNGRP KAPKGAAFDA ARRYWESLAT DEGAHFDREI RLDAANLPPL
VSWGTSPEDI VSILGTVPDP AQIADENKRQ SKEKALAYMG LTPGTRMTDV TLDRVFIGSC
TNGRIEDLRI VAKMVEGRKV HDSVSAMVVP GSGLVKAQAE AEGIDRILKD AGFDWREPGC
SMCLGMNPDK LRPGERCAST SNRNFEGRQG PRGRTHLVSP AMAAAAAVAG RFVDIREWRG