Gene Mpe_A2164 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2164 
Symbol 
ID4784853 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2321520 
End bp2322959 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content68% 
IMG OID640090732 
Productisopropylmalate isomerase large subunit 
Protein accessionYP_001021355 
Protein GI124267351 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR00170] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.170701 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTTGTTGA CGCCTAAAAT CGAGGCCATG GGACGCACTC TGTACGACAA GCTGTGGGAC 
GAACACGTCG TCCACTCCGA GGACGACGGC ACCGCCGTGC TCTACATCGA CCGACACCTG
GTGCACGAGG TGACGAGCCC GCAGGCCTTC GAAGGCTTGG ACCTCGCCGG CCGCAAGATC
TGGCGACTTT CGGCCAATCT GGCGGTGAGC GACCACAACG TGCCGACCAC CGACCGATCC
CGGGGCATCG CCGACCCGGT GTCGCGCCTG CAGGTCGACA CGCTGGACGC CAACTGCGAC
CGCTTCGGCA TCACGCAGTT CAAGATGAAC GATCGTCGCC AGGGCATCGT GCACGTGATC
GGGCCGGAGC AGGGCGCCAC GCTGCCGGGC ATGACGGTGG TGTGCGGTGA TTCGCACACC
AGCACCCACG GCGCCTTCGG TGCGCTGGCG CATGGCATCG GCACCAGCGA GGTCGAGCAC
GTGCTTGCGA CTCAGACGCT GCTCGCCAAG AAGGCGAAGA ACCTGCTGGT GCGGGTGGAC
GGCGTGCTAC CGGCCGGCTG CAGCGCCAAG GACATCGTGC TGGCGATCAT CGGTCGCATC
GGCACGGCCG GCGGCAACGG CCATACCATC GAGTTCGGTG GCTCGGCGAT TCGCGCGCTG
AGCATGGAAG GCCGCATGAC GGTGTGCAAC ATGGCCATCG AGGCCGGCGC GAGGGCCGGC
CTGGTCGCGG TGGACGACAC GACGATCCAG TACGTGAAGG GGCGGCCGTT CTCGCCGTCA
GGTGTGGAGT GGGAGCACGC GGTCGCCTAC TGGCGCACGC TGCATTCCGA CGAGGATGCC
GTCTTTGATC GCGTGGTCGA ACTCGATGCG GGTCAGATCG CACCGCAGGT CACCTGGGGC
ACCTCGCCCG AGATGGTGCT TTCGATCAAC GACCGGGTGC CCGATCCGGA CCGCGAGAAG
GATGCTGGCA AGCGCGGCGC CATCGAGCGC GCGCTGACCT ACATGTCGCT CGAGCCGAAC
AAGCCGATCG GTGACATCCG CATCGACAAG GTGTTCATCG GCTCGTGTAC CAACTCCCGC
ATCGAGGACC TGCGCGAGGC CGCCGCGGTG GTGCGGCGCG TCGGCGGGCG CATCGCCGGC
AACGTGAAGC TGGCGCTGGT CGTGCCGGGT TCCGGGCTGG TCAAGGCGCA GGCCGAGCGC
GAAGGGCTCG ATGCGGTGTT CAAGGCGGCC GGCTTCGAAT GGCGGGAGCC GGGCTGCTCG
ATGTGCCTGG CGATGAACGC CGACCGCCTC GAGCCCGGAG AACGGTGCGC GTCTACCAGC
AACCGCAACT TCGAGGGTCG CCAGGGCGCC GGCGGCCGCA CGCACCTCGT GAGCCCCGCG
ATGGCCGCCG CCGCCGCCAT GGAAGGGCAT TTCGTCGACG TCCGGCGCAT TGCCGCCTGA
 
Protein sequence
MLLTPKIEAM GRTLYDKLWD EHVVHSEDDG TAVLYIDRHL VHEVTSPQAF EGLDLAGRKI 
WRLSANLAVS DHNVPTTDRS RGIADPVSRL QVDTLDANCD RFGITQFKMN DRRQGIVHVI
GPEQGATLPG MTVVCGDSHT STHGAFGALA HGIGTSEVEH VLATQTLLAK KAKNLLVRVD
GVLPAGCSAK DIVLAIIGRI GTAGGNGHTI EFGGSAIRAL SMEGRMTVCN MAIEAGARAG
LVAVDDTTIQ YVKGRPFSPS GVEWEHAVAY WRTLHSDEDA VFDRVVELDA GQIAPQVTWG
TSPEMVLSIN DRVPDPDREK DAGKRGAIER ALTYMSLEPN KPIGDIRIDK VFIGSCTNSR
IEDLREAAAV VRRVGGRIAG NVKLALVVPG SGLVKAQAER EGLDAVFKAA GFEWREPGCS
MCLAMNADRL EPGERCASTS NRNFEGRQGA GGRTHLVSPA MAAAAAMEGH FVDVRRIAA