Gene Mext_2301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_2301 
Symbol 
ID5835650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2550268 
End bp2551668 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content70% 
IMG OID641368100 
Productpyridine nucleotide-disulphide oxidoreductase dimerisation region 
Protein accessionYP_001639767 
Protein GI163851724 
COG category[C] Energy production and conversion 
COG ID[COG1249] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide dehydrogenase (E3) component, and related enzymes 
TIGRFAM ID[TIGR01424] glutathione-disulfide reductase, plant 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0108496 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.00487639 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCGAGA CACCGATGAG CGAGTCCTTC GACGTCGACC TGTTCGTGAT CGGTGGCGGT 
TCGGGCGGGG TGCGGGCGGC CCGCATCGCC GCGGGCCACG GCGCGCGGGT GATGCTGGCC
GAGGAGTACC GCGTCGGCGG CACCTGCGTG ATCCGCGGCT GCGTCCCGAA GAAGCTGATG
GTCTATGCCG GCCGCTTCAC CGACGAGTTC GAGGACGCCG CCGGCTTTGG CTGGCACCTC
GAGACGCCGC GCTTTGACTG GGCCGTTCTG AAGCGCTCCC GCGACGCGGA GGTCGCGCGG
CTGGAGGGCA TCTACGGCCG CAACCTCGCG GGCGCCGGGG TCGAGGTCGT GGCCGACCGC
GCGGTGATCG AGGACCCCCA TACGGTGCGC CTCGTGCACG CGGACCGCAC GGTCCGGGCC
CGCTTCATCC TGATTGCGAC GGGCGCCACA CCGGTGCGTG AGCCGCTGAT CCCCGGTGCG
GAACTCGCTA TCGATTCCAA CGGCGTGTTC GAGTTGGAGA CCCAGCCCGA GCGCATCCTC
GTGGTCGGCG GCGGCTACAT CGCCGTGGAA TTCGCGGGCG TCTTCGCCAG CCTCGGCTCC
AAGACCACGC TGCTCCATCG CGGACAAAGC CTGCTGCGCG GCTTCGACCC TGAGATCGCC
GATGCGCTGG GCGAGGCCTA TGCCAAGCGG ATGGATCTAC GCTTGGGGCA GACCGTCGAG
CGCCTGGAGC GCGACGGCTC GGCGATCCGC GCCACCCTGA ACGGGGGCGA GAGCCTCACC
GTCGATTGCG TGCTGGTGGC CACCGGCCGG CGCCCGAACG TCGCCGGGCT CGGGCTGGAA
CGGGTCGGGA TCGAACTCGA CGAACGCGGC GCGATTCCCG TCGAGGCGGA TTCGCGCACC
CGGGTGCCGT CGATCTACGC CGTCGGCGAC GTGAACGGCC GCGCGGCGCT GACCCCCGTG
GCGATTCGTG AGGGCCACGC CTTCGCCGAC ACGGTGTTCG GCAACAAGCC CTGGTGCGTC
GATCACCGCC TGATTGCGAC CGCCGTGTTC TCGACGCCGG AGATCGGCGT GATCGGCCAC
AACGAGGACG TGGCCCGGCG CTGCTACGGG GAGATTGACG TCTACAAGGC GAGCTTCCGC
CCGATGAAAG CGACGCTCTC GGGCCGCGAC GAGCGGGTGA TCATGAAGAT TCTGGTGGAC
CGCGCCAGCG ACCGCGTGGT CGGCGTCCAC GTGCTCGGCA CGGATGCCGG CGAGATCATC
CAAGCGGTCG GCATCGCCGT GACCATGGGC GCGACCAAGG CCGATTTCGA CCGCACCATC
GCCGTGCATC CGACGCTCGG CGAGGAACTG GTGACGATGC GGACGCCCTT CGTGGTGAAG
CATCCCGTCG GCGTGGGCTA G
 
Protein sequence
MSETPMSESF DVDLFVIGGG SGGVRAARIA AGHGARVMLA EEYRVGGTCV IRGCVPKKLM 
VYAGRFTDEF EDAAGFGWHL ETPRFDWAVL KRSRDAEVAR LEGIYGRNLA GAGVEVVADR
AVIEDPHTVR LVHADRTVRA RFILIATGAT PVREPLIPGA ELAIDSNGVF ELETQPERIL
VVGGGYIAVE FAGVFASLGS KTTLLHRGQS LLRGFDPEIA DALGEAYAKR MDLRLGQTVE
RLERDGSAIR ATLNGGESLT VDCVLVATGR RPNVAGLGLE RVGIELDERG AIPVEADSRT
RVPSIYAVGD VNGRAALTPV AIREGHAFAD TVFGNKPWCV DHRLIATAVF STPEIGVIGH
NEDVARRCYG EIDVYKASFR PMKATLSGRD ERVIMKILVD RASDRVVGVH VLGTDAGEII
QAVGIAVTMG ATKADFDRTI AVHPTLGEEL VTMRTPFVVK HPVGVG