Gene Mpe_A1406 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1406 
Symbol 
ID4783919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1514617 
End bp1515870 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content65% 
IMG OID640089972 
ProductNADH dehydrogenase subunit D 
Protein accessionYP_001020603 
Protein GI124266599 
COG category[C] Energy production and conversion 
COG ID[COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 
TIGRFAM ID[TIGR01962] NADH dehydrogenase I, D subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.564759 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.621774 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAGA TCAAGAACTA CACCCTCAAT TTCGGGCCGC AGCATCCGGC GGCCCACGGC 
GTCCTGCGCC TGATCCTCGA GCTCGACGGC GAGGTGATCC AGCGCGCAGA TCCGCACATC
GGCCTGCTGC ACCGCGCGAC CGAGAAGCTC GCGGAGAGCA AGACCTACAT CCAGTCGCTG
CCCTACATGG ACCGTCTCGA CTACGTGTCC ATGATGTGCA ACGAGCACGC CTACTGCCTG
GCGATCGAGA AGCTGCTCGG CGTCGATGTG CCGGTCCGTG CCCAGTACAT CCGCGTGATG
TTCTCCGAGA TCACGCGCAT CCTGAACCAC CTGCTGTGGA TCGGCGCTCA CGGTCTCGAC
TGCGGTGCGA TGAACATCCT CATCTACGCC TTCCGCGAGC GCGAGGACCT GTTCGACATG
TACGAGGCGG TGTCGGGTGC GCGCATGCAC GCGGCCTACT TCCGTCCGGG CGGCGTGTAC
CGCGACCTGC CGGACACCAT GCCGCAATAC CGGGTCTCGA AGATCCGCAA CGCCAAGGCA
CTGGCGAAGA TGAACGAGAA CCGGCAGGGC TCGCTGCTGG ACTTCATCGA GGACTTCTGC
CGTCGGTTCC CCAAGAACGT CGACGATTAC GAGACGTTGC TCACCGACAA CCGCATCTGG
AAGCAGCGCA CCGTGGGCAT CGGCGTCGTG ACGCCGGAGC GCGCACTGAA CCTCGGCTTC
ACCGGTCCGA TGCTGCGCGG CTCGGGCATC GCCTGGGACC TGCGCAAGAC GCAGCCCTAC
GACGTCTACG ACCGCGTCGA TTTCGACATC CCGGTCGGTG CCGGCGGCGA TTGCTACGAC
CGCTACCTGG TGCGCGTCGA GGAGCTGCGC CAGTCGAACC GCATCATCCA GCAGTGCGCG
GCCTGGCTGC GCGCTAACCC CGGCCCGGTG ATCACCGACA ACCACAAGGT CGCGGCGCCG
GGGCGGGTGG ACATGAAGTC CAACATGGAA GAGCTGATTC ACCACTTCAA GCTCTTCACC
GAGGGCTTCC ACGTGCCCGA GGGCGAGGCC TACGCGGCCG TCGAGCACCC GAAGGGCGAG
TTCGGCATCT ACCTGGTGAG CGACGGTGCC AACAAGCCCT ACCGACTCAA GATCCGCGCG
CCCGGCTTCG CCCACCTCGC GGCGATGGAC GAGATGTCGC GCGGCCACAT GATCGCCGAC
GCCGTGGCGG TGATCGGCAC GATGGACATC GTTTTCGGCG AGATTGACCG ATGA
 
Protein sequence
MAEIKNYTLN FGPQHPAAHG VLRLILELDG EVIQRADPHI GLLHRATEKL AESKTYIQSL 
PYMDRLDYVS MMCNEHAYCL AIEKLLGVDV PVRAQYIRVM FSEITRILNH LLWIGAHGLD
CGAMNILIYA FREREDLFDM YEAVSGARMH AAYFRPGGVY RDLPDTMPQY RVSKIRNAKA
LAKMNENRQG SLLDFIEDFC RRFPKNVDDY ETLLTDNRIW KQRTVGIGVV TPERALNLGF
TGPMLRGSGI AWDLRKTQPY DVYDRVDFDI PVGAGGDCYD RYLVRVEELR QSNRIIQQCA
AWLRANPGPV ITDNHKVAAP GRVDMKSNME ELIHHFKLFT EGFHVPEGEA YAAVEHPKGE
FGIYLVSDGA NKPYRLKIRA PGFAHLAAMD EMSRGHMIAD AVAVIGTMDI VFGEIDR