Gene Mext_4581 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4581 
Symbol 
ID5835113 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp5115258 
End bp5116976 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content69% 
IMG OID641370375 
Productrespiratory-chain NADH dehydrogenase domain-containing protein 
Protein accessionYP_001642020 
Protein GI163853977 
COG category[C] Energy production and conversion 
COG ID[COG1894] NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit
[COG1905] NADH:ubiquinone oxidoreductase 24 kD subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.531155 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAGG CAAGTGGGAC TGTCCGGAGC TTCGCGCATC CGGGCCGTGG CCGTAACGTC 
GCCCGCGCCG TGCCGAAGGG GCGTCAGGTC GATCCCCACG CCAAGGTCGA GATCGAGGAA
CTGCTTGGCA CCCGCTCGCG CCAGCGCGAC CTGCTGATCG AGCACCTGCA CCTGATCCAG
GACACCTACG GCCAGATCAG CGCCGACCAT CTCGCGGCGC TGGCCGACGA GATGAGCCTC
GCCTTCGCCG AGGTGTTCGA GACCGCGACC TTCTACGCGC ATTTCGACGT GGTGAAGGAG
GGCGAGGCCG ACATCCCGCG CCTGACGATC CGGGTCTGCG ACAGCATCAC CTGCGCCATG
TTCGGCGCCG ACGAGTTGCT GGAGACGCTG CAGCGCGAAC TGGCCTCGGA TGCGGTCCGC
GTCGTGCGCG CGCCCTGTGT CGGCCTGTGC GACCACGCCC CGGCGGTCGA GGTCGGGCAC
AACTTCCTGC ACCGGGCCGA CCTCGCCTCC GTGCGCGCCG CGGTCGAGGC CGAGGACACC
CACGCCCACA TCCCCACCTA CGTCGATTAC GACGCCTACC GGGCCGGTGG CGGCTACGCG
ACCCTGGAGC GGCTGCGCAG CGGCGAACTG CCGGTCGATG ACGTGCTGAA GGTGCTCGAC
GACGGCGGCC TGCGCGGCCT CGGCGGCGCC GGCTTCCCCA CGGGCCGCAA GTGGCGCTCC
GTGCGCGGCG AGCCCGGCCC CCGGCTGATG GCGGTCAACG GCGACGAGGG CGAGCCCGGC
ACCTTCAAGG ACCAGCTCTA CCTCAACACC GACCCGCACC GCTTCCTTGA GGGCATGCTG
ATCGGTGCCC ACGTCGTCGA GGCCGCCGAG GTCTACATCT ACCTGCGCGA CGAGTATCCG
ATCTCCCGCG AGATCCTGGC CCGCGAGATC GCGAAGCTCC CCGAGGGCGG CACCCGCATC
CACCTGCGCC GGGGCGCGGG CGCCTATATC TGCGGCGAGG AATCCTCGCT GATCGAGTCG
CTGGAGGGCA AGCGCGGCCT GCCGCGGCAC AAGCCGCCCT TCCCGTTCCA GGTCGGCCTG
TTCAACCGGC CGACGCTGAT CAACAACATC GAGACGCTGT TCTGGGTGCG CGACCTGATC
GAGCGCGGCG CCGAATGGTG GAAGAGCCAT GGCCGCAACG GCCGCGTCGG CCTGCGCTCC
TACTCGGTTT CGGGCCGGGT CAAGGAGCCG GGCGTCAAGC TCGCGCCCGC CGGCCTGACC
ATCCAGGAAC TCATCGACGA GTATTGCGGC GGCATCTCTG ACGGCCACAG CTTCGCGGCC
TACCTGCCGG GCGGAGCCTC GGGCGGCATC CTGCCGGCCT CGATGAACGA CATCCCGCTC
GATTTCGGCA CGCTCGAAAA ATACGGCTGC TTCATCGGCT CGGCCGCGGT CGTGATCCTG
TCCGATCAGG ATGATGTGCG CGGTGCCGCG TTGAACCTGA TGAAGTTCTT CGAGGACGAG
TCCTGCGGGC AGTGCACGCC CTGCCGCTCG GGCACGCAGA AGGCCCGCAT GCTGATGGAG
AACGGCGTGT GGGACACCGA TCTCCTCGGC GAACTGGCGC AGTGCATGCG CGACGCCTCG
ATCTGCGGTC TCGGTCAGGC GGCCTCGAAC CCCGTCAGCA CCGTGATCAA GTACTTCCCC
GATCTCTTCC CGGAGCCGCG GGCCGTGGCG GCCGAGTGA
 
Protein sequence
MSEASGTVRS FAHPGRGRNV ARAVPKGRQV DPHAKVEIEE LLGTRSRQRD LLIEHLHLIQ 
DTYGQISADH LAALADEMSL AFAEVFETAT FYAHFDVVKE GEADIPRLTI RVCDSITCAM
FGADELLETL QRELASDAVR VVRAPCVGLC DHAPAVEVGH NFLHRADLAS VRAAVEAEDT
HAHIPTYVDY DAYRAGGGYA TLERLRSGEL PVDDVLKVLD DGGLRGLGGA GFPTGRKWRS
VRGEPGPRLM AVNGDEGEPG TFKDQLYLNT DPHRFLEGML IGAHVVEAAE VYIYLRDEYP
ISREILAREI AKLPEGGTRI HLRRGAGAYI CGEESSLIES LEGKRGLPRH KPPFPFQVGL
FNRPTLINNI ETLFWVRDLI ERGAEWWKSH GRNGRVGLRS YSVSGRVKEP GVKLAPAGLT
IQELIDEYCG GISDGHSFAA YLPGGASGGI LPASMNDIPL DFGTLEKYGC FIGSAAVVIL
SDQDDVRGAA LNLMKFFEDE SCGQCTPCRS GTQKARMLME NGVWDTDLLG ELAQCMRDAS
ICGLGQAASN PVSTVIKYFP DLFPEPRAVA AE