Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_4581 |
Symbol | |
ID | 5835113 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 5115258 |
End bp | 5116976 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641370375 |
Product | respiratory-chain NADH dehydrogenase domain-containing protein |
Protein accession | YP_001642020 |
Protein GI | 163853977 |
COG category | [C] Energy production and conversion |
COG ID | [COG1894] NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit [COG1905] NADH:ubiquinone oxidoreductase 24 kD subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.531155 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGAGG CAAGTGGGAC TGTCCGGAGC TTCGCGCATC CGGGCCGTGG CCGTAACGTC GCCCGCGCCG TGCCGAAGGG GCGTCAGGTC GATCCCCACG CCAAGGTCGA GATCGAGGAA CTGCTTGGCA CCCGCTCGCG CCAGCGCGAC CTGCTGATCG AGCACCTGCA CCTGATCCAG GACACCTACG GCCAGATCAG CGCCGACCAT CTCGCGGCGC TGGCCGACGA GATGAGCCTC GCCTTCGCCG AGGTGTTCGA GACCGCGACC TTCTACGCGC ATTTCGACGT GGTGAAGGAG GGCGAGGCCG ACATCCCGCG CCTGACGATC CGGGTCTGCG ACAGCATCAC CTGCGCCATG TTCGGCGCCG ACGAGTTGCT GGAGACGCTG CAGCGCGAAC TGGCCTCGGA TGCGGTCCGC GTCGTGCGCG CGCCCTGTGT CGGCCTGTGC GACCACGCCC CGGCGGTCGA GGTCGGGCAC AACTTCCTGC ACCGGGCCGA CCTCGCCTCC GTGCGCGCCG CGGTCGAGGC CGAGGACACC CACGCCCACA TCCCCACCTA CGTCGATTAC GACGCCTACC GGGCCGGTGG CGGCTACGCG ACCCTGGAGC GGCTGCGCAG CGGCGAACTG CCGGTCGATG ACGTGCTGAA GGTGCTCGAC GACGGCGGCC TGCGCGGCCT CGGCGGCGCC GGCTTCCCCA CGGGCCGCAA GTGGCGCTCC GTGCGCGGCG AGCCCGGCCC CCGGCTGATG GCGGTCAACG GCGACGAGGG CGAGCCCGGC ACCTTCAAGG ACCAGCTCTA CCTCAACACC GACCCGCACC GCTTCCTTGA GGGCATGCTG ATCGGTGCCC ACGTCGTCGA GGCCGCCGAG GTCTACATCT ACCTGCGCGA CGAGTATCCG ATCTCCCGCG AGATCCTGGC CCGCGAGATC GCGAAGCTCC CCGAGGGCGG CACCCGCATC CACCTGCGCC GGGGCGCGGG CGCCTATATC TGCGGCGAGG AATCCTCGCT GATCGAGTCG CTGGAGGGCA AGCGCGGCCT GCCGCGGCAC AAGCCGCCCT TCCCGTTCCA GGTCGGCCTG TTCAACCGGC CGACGCTGAT CAACAACATC GAGACGCTGT TCTGGGTGCG CGACCTGATC GAGCGCGGCG CCGAATGGTG GAAGAGCCAT GGCCGCAACG GCCGCGTCGG CCTGCGCTCC TACTCGGTTT CGGGCCGGGT CAAGGAGCCG GGCGTCAAGC TCGCGCCCGC CGGCCTGACC ATCCAGGAAC TCATCGACGA GTATTGCGGC GGCATCTCTG ACGGCCACAG CTTCGCGGCC TACCTGCCGG GCGGAGCCTC GGGCGGCATC CTGCCGGCCT CGATGAACGA CATCCCGCTC GATTTCGGCA CGCTCGAAAA ATACGGCTGC TTCATCGGCT CGGCCGCGGT CGTGATCCTG TCCGATCAGG ATGATGTGCG CGGTGCCGCG TTGAACCTGA TGAAGTTCTT CGAGGACGAG TCCTGCGGGC AGTGCACGCC CTGCCGCTCG GGCACGCAGA AGGCCCGCAT GCTGATGGAG AACGGCGTGT GGGACACCGA TCTCCTCGGC GAACTGGCGC AGTGCATGCG CGACGCCTCG ATCTGCGGTC TCGGTCAGGC GGCCTCGAAC CCCGTCAGCA CCGTGATCAA GTACTTCCCC GATCTCTTCC CGGAGCCGCG GGCCGTGGCG GCCGAGTGA
|
Protein sequence | MSEASGTVRS FAHPGRGRNV ARAVPKGRQV DPHAKVEIEE LLGTRSRQRD LLIEHLHLIQ DTYGQISADH LAALADEMSL AFAEVFETAT FYAHFDVVKE GEADIPRLTI RVCDSITCAM FGADELLETL QRELASDAVR VVRAPCVGLC DHAPAVEVGH NFLHRADLAS VRAAVEAEDT HAHIPTYVDY DAYRAGGGYA TLERLRSGEL PVDDVLKVLD DGGLRGLGGA GFPTGRKWRS VRGEPGPRLM AVNGDEGEPG TFKDQLYLNT DPHRFLEGML IGAHVVEAAE VYIYLRDEYP ISREILAREI AKLPEGGTRI HLRRGAGAYI CGEESSLIES LEGKRGLPRH KPPFPFQVGL FNRPTLINNI ETLFWVRDLI ERGAEWWKSH GRNGRVGLRS YSVSGRVKEP GVKLAPAGLT IQELIDEYCG GISDGHSFAA YLPGGASGGI LPASMNDIPL DFGTLEKYGC FIGSAAVVIL SDQDDVRGAA LNLMKFFEDE SCGQCTPCRS GTQKARMLME NGVWDTDLLG ELAQCMRDAS ICGLGQAASN PVSTVIKYFP DLFPEPRAVA AE
|
| |