Gene Mpe_A1409 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1409 
Symbol 
ID4783922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1517727 
End bp1520042 
Gene Length2316 bp 
Protein Length771 aa 
Translation table11 
GC content69% 
IMG OID640089975 
Productputative NADH dehydrogenase I chain G 
Protein accessionYP_001020606 
Protein GI124266602 
COG category[C] Energy production and conversion 
COG ID[COG1034] NADH dehydrogenase/NADH:ubiquinone oxidoreductase 75 kD subunit (chain G) 
TIGRFAM ID[TIGR01973] NADH-quinone oxidoreductase, chain G 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0277288 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTGAAA TCGAACTCGA CGGCCGCAAG GTCGAGGTGC AGGAAGGCAG CATGGTGATG 
CATGCCGCCG ACAAGGCCGG CACCTACATC CCGCACTTCT GCTATCACAA GAAGCTCAGC
ATCGCGGCCA ACTGCCGGAT GTGCCTGGTC GACATCGAGA AGGCGCCCAA GCCGATGCCG
GCCTGCGCCA CGCCGGTGAC GCAGGGCATG ATCGTGCGCA CCAAGAGCGA CAAGGCGCTC
AAGGCGCAGA GCTCGGTGAT GGAATTCCTG CTCATCAACC ACCCGCTCGA CTGCCCGATC
TGTGACCAGG GAGGCGAGTG CCAGCTGCAG GACCTGGCGG TGGGCTACGG CAAGTCGGCC
TCGCGCTACG AGGAAGAGAA GCGCGTCGTG CTGCACAAGG ACGTCGGACC GCTGATTTCC
ATGGAGGAGA TGAGTCGCTG CATTCACTGC ACCCGCTGCG TGCGCTTCGG CCAGGAAGTG
GCCGGCGTGA TGGAGCTGGG CATGATCCAT CGCGGCGAGC ACAGCGAGAT CACCACGGTG
GCCGGCGACA CGGTCGACTC CGAACTGTCG GGCAACATGA TCGACATCTG CCCGGTCGGT
GCGCTCACCA GCAAGCCGTT CCGCTACAGC GCCCGCACCT GGGAGCTGTC GCGACGCAAG
TCGGTCAGCC CGCACGATTC GACGGGCGCG AACCTGATCG TGCAGGTCAA GAACCACCAG
GTGATGCGTG TCGTGCCGCT TGAGAACGAG GCGGTCAACG AATGCTGGAT CGCCGACCGC
GACCGCTTCT CCTACGAAGC CCTGAACACT GACGAGCGCC TCACCGCGCC GATGATCAAG
CACGACGGGC AGTGGCGCAC CGTCGACTGG AATACCGCGC TGGAGTACGT CGCCAACGGG
CTGAAGTCGA TCCAGTCCGA GCACGGTGCC GCGTCCATCG GTGCGCTTGC CACCGCGCAC
AGCACGGTGG AAGAGCTTTA CCTGCTCGGG CAACTGGTGC GTGGCGTGGG TTCGGACAAC
ATCGACCACC GCTTGCGGCA TGCCGAGTTC GACACGGTCG AGAAGGCTCG CTGGCTCGGC
ACGAGCATCG AGTCGCTGTC GACGCTGGAC CGTGCCTTCG TCATCGGTTC CTTCCTGCGC
AAGGACCATC CGCTGTTCGC GCAGCGTCTG CGCCAGGCGG CCAAGCACGG CGCGCAGGTC
TCCAGCCTGC ATGCGCTGGC CGACGATTGG CTGATGCCGA TCTCCACGCA GTTCACCGCG
GCGCCGAGCG CCTGGGTGGC CGCACTGGCC GGTGTGGCCG CGGCGGTGGC CTCGCACACC
GGCGCCGCGG CGCCGGTGGC GGCCGAGGCG AGCGAGCCGG CCAAGGCGAT CGCCGCGTCA
CTGCTGAGCG GGCAGCGCAA GGCGGTGCTG CTCGGCAATG CCGCGGTCCA GCACCCGCAG
GCTTCCCGGC TGCTTGCACT CGCCCAGTTC ATCGCCGAGC AGACCGGTGC CACCTTCGGG
GTGCTCGGTG AGGCGGCCAA CAGCGTGGGC GCGCAACTTG TCGGGGCCCA GCCGCGCAGC
GGCGGGCTGA ACGCCGGGCA GATGCTGTCG CAACCGCTGA AGGCCTACCT GCTGCTCAAT
GCCGAACCGG TGCTCGACGC GGCCGATGGC CGGCAGGCGG CGGACACGCT GGCCCGGGCC
GGCATGGTGG TGTCTCTGAG CGCCTTCAGG AACGCCAATC TCGAGCACGC CGACGTGCTG
CTGCCGATCA CGCCGTTCAC CGAGACTTCG GGCAGCTTCG TCAACGCGGA AGGCCGCGTG
CAGGGCTTCC ACGGCGTCGT CCGGCCGCGC GGTGATGCCC GCCCGGCCTG GAAGGTCCTG
CGCGTGCTCG GCTCGATGCT GGGCCTGCCG GGCTTCGGCT TCGAGACCTC CGAGGAAGTC
AAGGCACAGG CGCTGGGTGA CGTGTCTGCA GCACTGGCGT CGCGGCTCGG CAACGCCAGC
CGTGCGAGCG TGGCGATTGC CGCAGATCGC CCCACGCTCG AGCGCGTGGC CGACGTGCCG
ATCTACGCAG GCGACGCCAT CGTGCGCCGT GCACCGTCGC TGCAGGCCAC CGCCGATGCA
CGCGCACCGC GTGCCGGGCT GCCGACGGCG CTGTGGCAGC GCCTCGGGTT GGTTGAGGGG
GGCCAGGTCC ACGTGCAGCA GGGCAGTGCA TCGCTGCGGC TCGCGGCCTA CCACGATGCC
ACGCTAGCAC CGACCGCCGT GCGGGTGCCC GCCGGCCATG CCGCCACGGC AGCACTGGGC
GCGATGTTCG GCGAGATCGC CGTCGAGAAG GCCTGA
 
Protein sequence
MVEIELDGRK VEVQEGSMVM HAADKAGTYI PHFCYHKKLS IAANCRMCLV DIEKAPKPMP 
ACATPVTQGM IVRTKSDKAL KAQSSVMEFL LINHPLDCPI CDQGGECQLQ DLAVGYGKSA
SRYEEEKRVV LHKDVGPLIS MEEMSRCIHC TRCVRFGQEV AGVMELGMIH RGEHSEITTV
AGDTVDSELS GNMIDICPVG ALTSKPFRYS ARTWELSRRK SVSPHDSTGA NLIVQVKNHQ
VMRVVPLENE AVNECWIADR DRFSYEALNT DERLTAPMIK HDGQWRTVDW NTALEYVANG
LKSIQSEHGA ASIGALATAH STVEELYLLG QLVRGVGSDN IDHRLRHAEF DTVEKARWLG
TSIESLSTLD RAFVIGSFLR KDHPLFAQRL RQAAKHGAQV SSLHALADDW LMPISTQFTA
APSAWVAALA GVAAAVASHT GAAAPVAAEA SEPAKAIAAS LLSGQRKAVL LGNAAVQHPQ
ASRLLALAQF IAEQTGATFG VLGEAANSVG AQLVGAQPRS GGLNAGQMLS QPLKAYLLLN
AEPVLDAADG RQAADTLARA GMVVSLSAFR NANLEHADVL LPITPFTETS GSFVNAEGRV
QGFHGVVRPR GDARPAWKVL RVLGSMLGLP GFGFETSEEV KAQALGDVSA ALASRLGNAS
RASVAIAADR PTLERVADVP IYAGDAIVRR APSLQATADA RAPRAGLPTA LWQRLGLVEG
GQVHVQQGSA SLRLAAYHDA TLAPTAVRVP AGHAATAALG AMFGEIAVEK A