Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_1079 |
Symbol | |
ID | 5832767 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 1175730 |
End bp | 1177856 |
Gene Length | 2127 bp |
Protein Length | 708 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641366873 |
Product | NADH dehydrogenase subunit G |
Protein accession | YP_001638554 |
Protein GI | 163850511 |
COG category | [C] Energy production and conversion |
COG ID | [COG1034] NADH dehydrogenase/NADH:ubiquinone oxidoreductase 75 kD subunit (chain G) |
TIGRFAM ID | [TIGR01973] NADH-quinone oxidoreductase, chain G |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.697159 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGTGCC GATGGCGGCG GAGTGAGTGC CGCCCCGATG CTCACCTTCG AGATGACCCG ATGACCAAAA TCCTCATCGA CGGCACCGAG GTCGATGTTC CGGCCGACTA CACCCTGCTT CAGGCCTGCG AGATCGCGGG CGCGGAGATC CCGCGCTTCT GCTTCCACGA GCGGCTGTCG ATCGCCGGCA ATTGCCGCAT GTGTCTGGTC GAGCTGAAGG GCGCGCCCAA GCCCGTAGCC TCCTGCGCCT ACGCGGTGAA GGATTGCCGG CCCGGCCCCA ACGGCGAGCC GCCGGAGGTG CTGACGCGCT CCGGCACCAC GAAGAAGGCG CGTGAGGGGG TGATGGAGTT CCTTCTCATC AACCACCCGC TCGATTGCCC GATCTGCGAC CAGGGCGGCC ATTGCGACCT GCAGGATCAG GCGATGGCCT ACGGCGTCGA CTCGACCCGC TACAGCGAGA ACAAGCGGGC GGTCGAGGAA AAGTATATCG GCCCGCTGGT GCGCACGGCG ATGAACCGCT GCATCCACTG CACCCGCTGC GTCCGCTTCC TCGCGGAAGT GGCCGGCGTG CCGGATCTCG GCGCCATCGG CCGCGGCGAG GACATGGAGA TCACCAGCTA CCTCGAAGAA GCGATGGGCT CGGAGCTTCA GGGCAACGTC GCCGACCTCT GCCCCGTCGG CGCGCTGGTG CACAAGCCGC AGAGCTACAA CGTGCGCCCG TGGGAGCTGC ACAAGACCGA GTCCGTCGAT GTGATGGATG CGGTCGGCTC GGCGATTCGC ATCGATGCCC GCGGCCGCGA GGTGATGCAG ATCGAGCCGC GGATCAGCGA GGAGATCAAT GAGGAGTGGA TCTCCGACAA GACCCGCCAC GTCGTCGACG GCCTGCGGCT GCAGCGCCTC GACCGCCCAT TCCTGCGCGA GAACGGCCGC CTGCGTCCCG CTTCCTGGGG TGAGGCGTTC TCGGCGATTG CCGCCAAGGT GAAGGGCGCC GATCCCAAGC GCGTCGGCGC CCTCGTCGGC GACCTCGCGG GTGCGGAGGA GATCTTCGCT CTCAAGGCGC TGATGGGTTC GCTCGGCGTC ACCAATCTCG ACGCGCGCCA GACCGGCGAG GCGATCGATC CGGCCTGGGG CCGGGCCGCC TACACGCTCG GCGCGACGAT CCCCGGTATC GAGCAGGCGG ACGCGATCCT GATCGTCGGC GCCAACCCGC GCACCGAGGC TTCGCTGCTC AACGTGCGCA TCCGCAAGCG CTGGCGCATG GCTCCGGTCT CGATCGGCCT GATCCTCGAT GAGCAGCCGG ATCTGACCTA CCCCTACACC TATCTCGGCG CCGGAACCGA CACGCTGGCC TCGATCGCCA AGGGCGAGCA CAGCTTCCTC GATGTGCTGA GACAAGCCGA GCGTCCGCTC GTCATCGTCG GCGAGGGCGG GCTCGGTTCG CTCGCCGCCG CCGCCGCGTT GGCGAAGGAT GTCGGCGCGG TGACCGACGG CTGGAACGGC TTCGGCGTGC TCAACACGGC CGCTGCCCGC GTCGGTGCTC TCGATCTCGG CTTCGTGCCG GGGGAGGGGG GCTTGAGCTT CTCGCAAATG CTGGAGCCGG GCGCGCTGGA CGTGGTGTTC AACCTCGGTG CCGACGAGCG GGTGATCGGG CCGGGCGCCT TCGTGATCTA TCAGGGCACC CACGGCGATG CCGGCGCGAG CCGCGCCGAC GTGATCCTGC CGGGCGCCGC CTACGCCGAG AAGAGCGCGA CCTACGTCAA CCTCGAAGGC CGGGTGCAGA TGGCCAACCG CGCCGGCTTC CCGCCGGGCG ACGCGCGCGA GGATTGGGCG ATCCTGCGCG CCCTGTCCGA CGTGCTCGGC AAGCGCCTGC CCTACGATTC GCTCGCCGCT TTGCGTAAGG CGATGTATGC AGCCCATCCG CATCTCGCGG CGGTCGGACA GGTCGAGCCG TCCGATGCGG CGGCCACGCT TGATCGCCTT GCCGCACTGC CGGCCGCGAC GGAGAAGGCA ACCTTCTCCT CACCGGTCGC CGACTTCTAC CTCACCAACC CGATCGCCCG CGCCTCGCGG GTGCTCGCCG AGTGTTCCGG CCTCGCCCGA GGTCGCGCCC TCGAAGCGGC GGAATAG
|
Protein sequence | MRCRWRRSEC RPDAHLRDDP MTKILIDGTE VDVPADYTLL QACEIAGAEI PRFCFHERLS IAGNCRMCLV ELKGAPKPVA SCAYAVKDCR PGPNGEPPEV LTRSGTTKKA REGVMEFLLI NHPLDCPICD QGGHCDLQDQ AMAYGVDSTR YSENKRAVEE KYIGPLVRTA MNRCIHCTRC VRFLAEVAGV PDLGAIGRGE DMEITSYLEE AMGSELQGNV ADLCPVGALV HKPQSYNVRP WELHKTESVD VMDAVGSAIR IDARGREVMQ IEPRISEEIN EEWISDKTRH VVDGLRLQRL DRPFLRENGR LRPASWGEAF SAIAAKVKGA DPKRVGALVG DLAGAEEIFA LKALMGSLGV TNLDARQTGE AIDPAWGRAA YTLGATIPGI EQADAILIVG ANPRTEASLL NVRIRKRWRM APVSIGLILD EQPDLTYPYT YLGAGTDTLA SIAKGEHSFL DVLRQAERPL VIVGEGGLGS LAAAAALAKD VGAVTDGWNG FGVLNTAAAR VGALDLGFVP GEGGLSFSQM LEPGALDVVF NLGADERVIG PGAFVIYQGT HGDAGASRAD VILPGAAYAE KSATYVNLEG RVQMANRAGF PPGDAREDWA ILRALSDVLG KRLPYDSLAA LRKAMYAAHP HLAAVGQVEP SDAAATLDRL AALPAATEKA TFSSPVADFY LTNPIARASR VLAECSGLAR GRALEAAE
|
| |