Gene Mpe_A2101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2101 
Symbol 
ID4784320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2248269 
End bp2249810 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content65% 
IMG OID640090669 
Product2-isopropylmalate synthase 
Protein accessionYP_001021292 
Protein GI124267288 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR00973] 2-isopropylmalate synthase, bacterial type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0462257 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.390804 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGACA AGCTCATCAT CTTCGACACC ACCTTGCGCG ACGGCGAACA GTCGCCCGGC 
GCCTCCATGA CCAAGGATGA GAAGCTGCGC ATCGCGCGCC AGCTGGAGCG TTTGCGGGTC
GACGTGATCG AGGCGGGTTT CGCGGCGTCG AGCAACGGCG ACTTCGAGGC GGTCCGGGCG
ATTGCCGACG TGATCAAGGA ATCGACCGTG TGCTCGCTGG CGCGCGCCAA TGACCGCGAC
ATCGCGCGGG CGGCCGAGGC GCTGAAGAGC GCTGCGCGTT CTCGCATCCA CACCTTCATC
GCGACCAGTG AACTGCACAT GGAGAAGAAG TTGCGGATGA CGCGCGAGCA GGTGCTGGAG
CAGGCCAGGC TGTCGATTCG CTTCGCCCGC AACCTGTGCG AGGACATCGA GTTTTCGCCG
GAGGATGGCT ACCGGTCCGA CCCGGACTTC CTGTGCCGTG TGATCGAGGC CGTGATCAAC
GAGGGCGCGA CCACCATCAA CGTGCCCGAC ACCGTCGGAT ACGGCATCCC CGAGCTGTAC
GGCAATTTCA TCAGGACCTT GCGGGAGCGG GTGCCGAATT CGGACAAGGC GGTCTGGTCG
GTTCACTGCC ACAACGACCT CGGTATGGCG GTGGCGAACT CGTTGGCTGG CGTGAAGATC
GGGGGTGCCC GCCAGATCGA ATGCACGATC AATGGGCTCG GCGAGCGCGC AGGCAACTGC
TCGCTCGAGG AAGTCGTGAT GGCGGTTCGC ACGCGGCGCG ACCATTTCGG GCTCGAAGTG
GGCATCGATA CCACGCAGAT CGTGCCGGCT TCGCGGCTGG TGTCGCAGAC GACTGGCTTC
ATCGTGCAGC CGAACAAGGC GGTCGTCGGC GCAAATGCCT TCGCGCACGC CTCCGGTATC
CACCAGGACG GCGTCCTGAA GGCGCGCGAC ACCTACGAGA TCATGCGCGC CGAGGACGTG
GGCTGGAGTG CCAACAAGAT CGTGCTCGGC AAGCTCAGCG GCCGGAACGC CTTCAAGCAG
CGCCTGCAGG AGCTCGGAAT CGAGCTCGAG TCCGAGACCG ACGTCAACGC GGCCTTTGCG
CGCTTCAAGG ATCTGGCCGA TCGCAAGAGC GACATCTTCG ACGAAGACAT CATCGCGCTG
GTCGGCGATG AGAGCGTGAC CCACGAGCAG GAGACGTACC GGCTGCTCTC GCTGGAGCAG
CAATCGGCGA CTGGGGAGCG TCCGCATGCG AAGGTGGCTT TCGCGGTCGG AGAGACCGAG
TTCCATGCCG AGAGCGAAGG CAACGGGCCG GTCGACGCGA GTCTCAAGGC CATCGAGTCG
AAGCTGAAAA GCGGCGCAGA AATGCTGCTC TATTCGGTCA ATGCCATCAC CTCGGGCAGC
ACAGAATCCC AGGGCGAGGT GACTGTGCGG CTGCAGCACG GCGGACGGGT GGTGAATGGC
GTGGGGGCGG ACCCGGACAT CGTGGTGGCC TCGGCCAAGG CCTACCTGTC GGCCCTGAAC
AAGCTGCACA GCAAGAACGA GCGCGTCGCC GCCCAGGGGT AA
 
Protein sequence
MADKLIIFDT TLRDGEQSPG ASMTKDEKLR IARQLERLRV DVIEAGFAAS SNGDFEAVRA 
IADVIKESTV CSLARANDRD IARAAEALKS AARSRIHTFI ATSELHMEKK LRMTREQVLE
QARLSIRFAR NLCEDIEFSP EDGYRSDPDF LCRVIEAVIN EGATTINVPD TVGYGIPELY
GNFIRTLRER VPNSDKAVWS VHCHNDLGMA VANSLAGVKI GGARQIECTI NGLGERAGNC
SLEEVVMAVR TRRDHFGLEV GIDTTQIVPA SRLVSQTTGF IVQPNKAVVG ANAFAHASGI
HQDGVLKARD TYEIMRAEDV GWSANKIVLG KLSGRNAFKQ RLQELGIELE SETDVNAAFA
RFKDLADRKS DIFDEDIIAL VGDESVTHEQ ETYRLLSLEQ QSATGERPHA KVAFAVGETE
FHAESEGNGP VDASLKAIES KLKSGAEMLL YSVNAITSGS TESQGEVTVR LQHGGRVVNG
VGADPDIVVA SAKAYLSALN KLHSKNERVA AQG