Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_2301 |
Symbol | |
ID | 5835650 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 2550268 |
End bp | 2551668 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641368100 |
Product | pyridine nucleotide-disulphide oxidoreductase dimerisation region |
Protein accession | YP_001639767 |
Protein GI | 163851724 |
COG category | [C] Energy production and conversion |
COG ID | [COG1249] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide dehydrogenase (E3) component, and related enzymes |
TIGRFAM ID | [TIGR01424] glutathione-disulfide reductase, plant |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0108496 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.00487639 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCGAGA CACCGATGAG CGAGTCCTTC GACGTCGACC TGTTCGTGAT CGGTGGCGGT TCGGGCGGGG TGCGGGCGGC CCGCATCGCC GCGGGCCACG GCGCGCGGGT GATGCTGGCC GAGGAGTACC GCGTCGGCGG CACCTGCGTG ATCCGCGGCT GCGTCCCGAA GAAGCTGATG GTCTATGCCG GCCGCTTCAC CGACGAGTTC GAGGACGCCG CCGGCTTTGG CTGGCACCTC GAGACGCCGC GCTTTGACTG GGCCGTTCTG AAGCGCTCCC GCGACGCGGA GGTCGCGCGG CTGGAGGGCA TCTACGGCCG CAACCTCGCG GGCGCCGGGG TCGAGGTCGT GGCCGACCGC GCGGTGATCG AGGACCCCCA TACGGTGCGC CTCGTGCACG CGGACCGCAC GGTCCGGGCC CGCTTCATCC TGATTGCGAC GGGCGCCACA CCGGTGCGTG AGCCGCTGAT CCCCGGTGCG GAACTCGCTA TCGATTCCAA CGGCGTGTTC GAGTTGGAGA CCCAGCCCGA GCGCATCCTC GTGGTCGGCG GCGGCTACAT CGCCGTGGAA TTCGCGGGCG TCTTCGCCAG CCTCGGCTCC AAGACCACGC TGCTCCATCG CGGACAAAGC CTGCTGCGCG GCTTCGACCC TGAGATCGCC GATGCGCTGG GCGAGGCCTA TGCCAAGCGG ATGGATCTAC GCTTGGGGCA GACCGTCGAG CGCCTGGAGC GCGACGGCTC GGCGATCCGC GCCACCCTGA ACGGGGGCGA GAGCCTCACC GTCGATTGCG TGCTGGTGGC CACCGGCCGG CGCCCGAACG TCGCCGGGCT CGGGCTGGAA CGGGTCGGGA TCGAACTCGA CGAACGCGGC GCGATTCCCG TCGAGGCGGA TTCGCGCACC CGGGTGCCGT CGATCTACGC CGTCGGCGAC GTGAACGGCC GCGCGGCGCT GACCCCCGTG GCGATTCGTG AGGGCCACGC CTTCGCCGAC ACGGTGTTCG GCAACAAGCC CTGGTGCGTC GATCACCGCC TGATTGCGAC CGCCGTGTTC TCGACGCCGG AGATCGGCGT GATCGGCCAC AACGAGGACG TGGCCCGGCG CTGCTACGGG GAGATTGACG TCTACAAGGC GAGCTTCCGC CCGATGAAAG CGACGCTCTC GGGCCGCGAC GAGCGGGTGA TCATGAAGAT TCTGGTGGAC CGCGCCAGCG ACCGCGTGGT CGGCGTCCAC GTGCTCGGCA CGGATGCCGG CGAGATCATC CAAGCGGTCG GCATCGCCGT GACCATGGGC GCGACCAAGG CCGATTTCGA CCGCACCATC GCCGTGCATC CGACGCTCGG CGAGGAACTG GTGACGATGC GGACGCCCTT CGTGGTGAAG CATCCCGTCG GCGTGGGCTA G
|
Protein sequence | MSETPMSESF DVDLFVIGGG SGGVRAARIA AGHGARVMLA EEYRVGGTCV IRGCVPKKLM VYAGRFTDEF EDAAGFGWHL ETPRFDWAVL KRSRDAEVAR LEGIYGRNLA GAGVEVVADR AVIEDPHTVR LVHADRTVRA RFILIATGAT PVREPLIPGA ELAIDSNGVF ELETQPERIL VVGGGYIAVE FAGVFASLGS KTTLLHRGQS LLRGFDPEIA DALGEAYAKR MDLRLGQTVE RLERDGSAIR ATLNGGESLT VDCVLVATGR RPNVAGLGLE RVGIELDERG AIPVEADSRT RVPSIYAVGD VNGRAALTPV AIREGHAFAD TVFGNKPWCV DHRLIATAVF STPEIGVIGH NEDVARRCYG EIDVYKASFR PMKATLSGRD ERVIMKILVD RASDRVVGVH VLGTDAGEII QAVGIAVTMG ATKADFDRTI AVHPTLGEEL VTMRTPFVVK HPVGVG
|
| |