Gene Mchl_5042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_5042 
Symbol 
ID7113645 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp5389283 
End bp5391001 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content69% 
IMG OID643527736 
ProductRespiratory-chain NADH dehydrogenase domain 51 kDa subunit 
Protein accessionYP_002423735 
Protein GI218532919 
COG category[C] Energy production and conversion 
COG ID[COG1894] NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit
[COG1905] NADH:ubiquinone oxidoreductase 24 kD subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.558734 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.200825 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAGG CAAGTGGGAC TGTCCGGAGC TTCGCGCATC CGGGCCGTGG CCGTAACGTC 
GCCCGCGCCG TGCCGAAGGG GCGTCAGGTC GATCCCCACG CCAAGGTTGA GATCGAGGAG
CTGCTCGGCA CCCGCTCGCG CCAGCGCGAC CTGCTGATCG AGCACCTGCA CCTGATCCAG
GACACCTACG GCCAGATCAG CGCCGATCAT CTCGCGGCGC TGGCCGACGA GATGAGCCTC
GCCTTCGCCG AGGTGTTCGA GACCGCGACC TTCTACGCGC ATTTCGACGT GGTGAAGGAG
GGCGAGGCCG ACATCCCGCG CCTGACGATC CGGGTCTGTG ACAGCATCAC CTGCGCGATG
TTCGGCGCCG ACGAGCTGCT GGAGACGCTG CAGCGCGAAC TGGCCTCGGA TGCGGTCCGC
GTCGTGCGCG CGCCCTGTGT CGGCCTGTGC GACCACGCCC CGGCGGTCGA GGTCGGGCAC
AACTTCCTGC ACCGGGCCGA CCTCGCCTCC GTGCGCGCCG CGGTCGAGGC CGAGGACACC
CACGCCCACA TCCCCACTTA CGTCGATTAC GACGCCTACC GGGCCGGCGG CGGCTACGCG
ACCCTGGAGC GGCTGCGCAG CGGCGAACTG TCGGTCGATG ACGTGCTGAA GGTGCTCGAC
GACGGCGGCC TGCGCGGCCT CGGCGGCGCT GGCTTCCCCA CCGGCCGCAA GTGGCGCTCC
GTGCGCGGCG AGCCCGGCCC CCGGCTGATG GCGGTCAACG GCGACGAGGG CGAGCCCGGG
ACCTTCAAGG ACCAGCTCTA CCTCAACACC GACCCGCACC GGTTCCTTGA GGGCATGCTG
ATCGGTGCCC ACGTCGTCGA GGCCGCCGAG GTCTACATCT ACCTGCGCGA CGAGTACCCG
ATCTCCCGCG AGATCCTGGC CCGCGAGATC GCGAAGCTCC CCGAGGGCGG CACCCGCATC
CACCTGCGCC GTGGGGCCGG CGCCTATATC TGCGGTGAGG AATCTTCGCT GATCGAGTCG
CTGGAGGGCA AGCGCGGCCT GCCGCGGCAC AAGCCGCCCT TCCCGTTCCA GGTCGGCCTG
TTCAACCGCC CGACGCTGAT CAACAACATC GAGACGCTGT TCTGGGTGCG CGACCTGATC
GAGCGCGGCG CCGAATGGTG GAAGAGCCAT GGCCGCAACG GCCGCGTCGG CCTACGCTCC
TACTCGGTTT CGGGCCGGGT CAAGGAGCCG GGCGTCAAGC TCGCGCCCGC CGGCCTGACC
ATCCAGGAAC TCATCGACGA GTATTGCGGC GGCATCTCTG ACGGCCACAG CTTCGCGGCC
TACCTGCCGG GCGGAGCCTC GGGCGGCATC CTGCCGGCCT CGATGAACGA CATCCCGCTC
GATTTCGGCA CGCTGGAAAA ATACGGCTGC TTCATCGGTT CGGCCGCGGT CGTGATCCTG
TCCGATCAGG ACGATGTGCG CGGTGCCGCG TTGAACCTGA TGAAGTTCTT CGAGGACGAG
TCCTGCGGGC AGTGCACGCC CTGCCGCTCG GGCACGCAGA AGGCCCGCAT GCTGATGGAG
AACGGCGTGT GGGACACCGA TCTCCTCGGC GAGCTGGCGC AGTGCATGCG CGACGCCTCG
ATCTGCGGTC TCGGTCAGGC GGCCTCGAAC CCCGTCAGCA CCGTGATCAA GTATTTCCCC
GATCTCTTCC CGGAGCCGCG GGCCGTGGCG GCCGAGTGA
 
Protein sequence
MSEASGTVRS FAHPGRGRNV ARAVPKGRQV DPHAKVEIEE LLGTRSRQRD LLIEHLHLIQ 
DTYGQISADH LAALADEMSL AFAEVFETAT FYAHFDVVKE GEADIPRLTI RVCDSITCAM
FGADELLETL QRELASDAVR VVRAPCVGLC DHAPAVEVGH NFLHRADLAS VRAAVEAEDT
HAHIPTYVDY DAYRAGGGYA TLERLRSGEL SVDDVLKVLD DGGLRGLGGA GFPTGRKWRS
VRGEPGPRLM AVNGDEGEPG TFKDQLYLNT DPHRFLEGML IGAHVVEAAE VYIYLRDEYP
ISREILAREI AKLPEGGTRI HLRRGAGAYI CGEESSLIES LEGKRGLPRH KPPFPFQVGL
FNRPTLINNI ETLFWVRDLI ERGAEWWKSH GRNGRVGLRS YSVSGRVKEP GVKLAPAGLT
IQELIDEYCG GISDGHSFAA YLPGGASGGI LPASMNDIPL DFGTLEKYGC FIGSAAVVIL
SDQDDVRGAA LNLMKFFEDE SCGQCTPCRS GTQKARMLME NGVWDTDLLG ELAQCMRDAS
ICGLGQAASN PVSTVIKYFP DLFPEPRAVA AE