Gene Mpe_A2468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2468 
Symbol 
ID4785665 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2628621 
End bp2629874 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content63% 
IMG OID640091038 
Productisocitrate dehydrogenase 
Protein accessionYP_001021658 
Protein GI124267654 
COG category[C] Energy production and conversion 
COG ID[COG0538] Isocitrate dehydrogenases 
TIGRFAM ID[TIGR00183] isocitrate dehydrogenase, NADP-dependent, prokaryotic type 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.240334 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.106657 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACCAGC ACATCAAGGT GCCTGCCGAC GGCCAAAAGA TCACCGTCAA TGCCGATTTC 
TCGCTCAACG TGCCCGATCA GCCGATCATC CCCTTCATCG AGGGCGACGG CACCGGCTTC
GACATCACGC CGGTGATGAT CAAGGTGGTC GACGCGGCGG TCGAGAAAAG CTACGGTGGC
AAGCGCAAGA TCCACTGGAT GGAGATCTAC GCCGGCGAGA AGTCGACCAA GGTCTACGGC
CCCGACGTCT GGCTGCCGGA AGAGACGCTG CAGGTCCTGA AGGAGTACGT GGTCTCGATC
AAGGGCCCGC TCACCACGCC CGTGGGTGGT GGCATCCGCT CGCTCAACGT GGCGTTGCGC
CAGGAGCTCG ACCTCTACGT CTGCCTGCGC CCGATCCAGT ACTTCGAGGG GGTGCCGAGC
CCGGTGAAGG AGCCGCACAA GACCAACATG GTGATCTTCC GCGAGAACTC GGAGGACATC
TACGCCGGCA TCGAGTTCGA AGCCGAGAGC GAGAAGGCCA AGAAGCTCAT CAAGATCCTG
CAGGACGAAT TCGGTGTCAA GAAGATCCGC TTCCCCGCCA CATCGGGCAT CGGCATCAAG
CCGGTGTCGC GCGAGGGCAC CGAGCGCCTG GTGCGCAAGG CCATCCAGTA CGCGATCGAT
AACGACAAGC CCAGCGTGAC CATCGTGCAC AAGGGCAACA TCATGAAGTT CACCGAGGGC
GGTTTCCGCG ATTGGGCCTA TGGCCTGGCG CAGAAGGAGT TCGGCGCGCA GCCGATCGAC
GGCGGCCCGT GGTGCAAGTT CAAGAACCCG AAGACCGGCA AGGAGATCAC CGTCAAGGAC
TCGATCGCCG ACGCCTTCCT GCAGCAGATC CTGCTGCGCC CGGCCGAGTA CTCGGTGATC
GCCACGCTCA ACCTGAACGG GGACTACGTG TCCGACGCGC TGGCCGCGCA GGTGGGCGGC
ATCGGCATTG CGCCGGGGGC CAACCTGTCG GACTCGATCG CGATGTTCGA GGCCACCCAC
GGCACCGCGC CCAAGTACGC CGGCAAGGAC TACGTGAACC CGGGCTCCGA GATCCTGTCG
GCCGAGATGA TGCTGCGCCA CATGGGCTGG ACCGAGGCGG CCGACCTCGT CATCTCGTCG
ATGGAGAAGT CGATCCTGTC GAAGAAGGTC ACCTACGACT TCGCGCGCTT GCTGGACGGC
GCCACGCAGG TCAGCTGCTC GGGCTTCGGG CAGGTGATGA TCGACAACAT GTGA
 
Protein sequence
MYQHIKVPAD GQKITVNADF SLNVPDQPII PFIEGDGTGF DITPVMIKVV DAAVEKSYGG 
KRKIHWMEIY AGEKSTKVYG PDVWLPEETL QVLKEYVVSI KGPLTTPVGG GIRSLNVALR
QELDLYVCLR PIQYFEGVPS PVKEPHKTNM VIFRENSEDI YAGIEFEAES EKAKKLIKIL
QDEFGVKKIR FPATSGIGIK PVSREGTERL VRKAIQYAID NDKPSVTIVH KGNIMKFTEG
GFRDWAYGLA QKEFGAQPID GGPWCKFKNP KTGKEITVKD SIADAFLQQI LLRPAEYSVI
ATLNLNGDYV SDALAAQVGG IGIAPGANLS DSIAMFEATH GTAPKYAGKD YVNPGSEILS
AEMMLRHMGW TEAADLVISS MEKSILSKKV TYDFARLLDG ATQVSCSGFG QVMIDNM