Gene Mext_1646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1646 
SymbolsucA 
ID5833002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp1836547 
End bp1839537 
Gene Length2991 bp 
Protein Length996 aa 
Translation table11 
GC content67% 
IMG OID641367444 
Product2-oxoglutarate dehydrogenase E1 component 
Protein accessionYP_001639116 
Protein GI163851073 
COG category[C] Energy production and conversion 
COG ID[COG0567] 2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, and related enzymes 
TIGRFAM ID[TIGR00239] 2-oxoglutarate dehydrogenase, E1 component 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.44941 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACGCC AGGACGCGAA CGAAGCGCTT CTTCGAACCT CCTTCCTCTA CGGCGCCAAC 
GCCGCCTGGA TCGAGGAGCT GCAGGCGGCC TATGCCCGCG ACCCGAACTC GGTCGATCCC
GAGTGGCAGC GCTTCTTCAA GGACCTGGGC GAGGACGACG CCCTGGTGAA GAAGAACGCC
GAGGGCGCCT CCTGGGCCAA GCCGAACTGG CCGGTCGTGG CCAACGGCGA GATCGTCTCG
GCGCTCGACG GCAATTGGGG CGCTCTCGAA AAGACGTTCG GCGAGAAGAT CCAGGCCAAG
GCCCAGCCCG GCAAGCCCGG CGACTCGACC AAGGGCGCGG CCATCGTCGC GGCCACGGGC
GTTTCCGTCG AGCAGGCCAC CAAGGATTCC GTGCGCGCGA TCATGCTGAT CCGCGCCTAC
CGCATGCGCG GCCACCTCCA CGCCAAGCTC GACCCGATCG GGCTCGCCCC GCGCGGCGAC
CACGAGGAGC TGCACCCGCA GCATTACGGC TTCCAGGAGA GCGACTGGGA CCGCAAGATC
TTCCTCGACA ACGTGCTCGG CATGGAATTC TCGACGATCC GCGAGATCGT CGCGATCCTG
GAGCGTACCT ACTGCCAGAC GCTCGGCGTC GAGTTCATGC ACATCTCCGA TCCTGAGGAG
AAGGCGTGGA TCCAGGAGCG CATCGAGGGC AAGGACAAGG AAATCTCGTT CACGCCGGAA
GGCCGGCGGG CGATCCTGAA CAAGCTGATC GAGGCCGAGG GCTTCGAGAA GTTCCTCGAT
CTCAAATACA CCGGCACCAA GCGCTTCGGC CTCGACGGCG GCGAGTCGAT GGTCCCGGCC
ATGGAGCAGA TCATCAAGCG CGGCGGCGCG CTCGGCATCG AGGAGATCGT GCTCGGCATG
GCCCATCGCG GCCGGCTGAA CGTGCTCACC AACGTGATGG CTAAGCCCTT CCGGGCGGTG
TTCCACGAGT TCAAGGGCGG CTCGGCCTCA CCCGCCGAGG TCGAAGGCTC GGGCGACGTG
AAGTACCATC TCGGCGCCTC GTCCGACCGC GCCTTCGACG ACAACACCGT TCACCTCTCG
CTCACCGCCA ACCCGTCCCA CCTCGAGATC GTCGATCCGG TGGTGCTCGG AAAGGTGCGG
GCCAAGCAGG ACCAGAAGGC CAAGCCGAAC GTCGAGCGCC GCCGCGTGCT GCCGCTCCTC
ATCCACGGCG ACGCGGCCTT TGCCGGCCAG GGCGTGGTCG CGGAATGCCT CGGCCTGTCC
GGCCTGAAGG GTCACCGCAC CGGCGGCTCG ATCCACTTCA TCATCAACAA CCAGATCGGC
TTCACCACCG ATCCGCGCTT CTCGCGCTCC TCGCCCTATC CGTCCGACGT GGCGAAGATG
GTGGAGGCGC CGATCTTCCA CTGCAACGGC GACGACCCGG AGGCGGTGAC CTTCGCGGCG
AAGGTCGCGG TCGAGTACCG GCAGAAATTC GGCAAGCCGG TCGTGATCGA CATGCTGTGC
TACCGCCGCT TCGGCCACAA CGAGGGCGAC GAGCCGGCCT TCACCCAGCC GAAGATGTAC
CAGCGGATCC GCAAGCATCC GACTGCACTG GAGACCTACG GCAAGAAGCT CGTCGCCCAG
GGTGACCTGA CCCAGGAGCA GCTCGACGCA CGCAAGGCCG AGTTCCGCGC GATGCTGGAA
AGCGAGCTCG AGGTCGCGGG CGGCTACAAG GCCAACAAGG CCGACTGGCT CGACGGCCGC
TGGTCCGGCT TCAAGGCCGT GCGCGAGGAC GTGGACGATC CCCGCCGCGG CCGCACCGGG
GTGCCGCTCG AGACGCTGCG CGACATCGCC ACCCGGATCA CCACGCCCCC GCCGGGCTTC
CACCTGCACC GCACGATCCA GCGCTTCTTC GACAACCGCG CCAAGGCGGT CGAGACGGGC
GTCGGCATCG ATTGGGCTAC CGCCGAGGCG CTCGCCTTCG GCTCGCTGCT GATCGAGGGC
CACCGGGTCC GGCTCTCGGG CCAGGACGTC GAGCGCGGCA CCTTCTCCCA GCGCCACGCC
GTGGTGATCG ATCAGGAGAA CGAGCAGCGC TACACGCCGC TCAACTCCCT GCGCGAGGGG
CAGGCGAACC TGGAGGTCAT CAACTCGATG CTCTCCGAGG AGGCCGTGCT CGGCTTCGAG
TACGGCTACT CGCTCGCCGA GCCGAACTCC CTGGTGCTGT GGGAGGCGCA GTTCGGCGAC
TTCGCCAACG GCGCGCAGGT CGTCATCGAC CAGTTCATCT CATCGGGCGA GCGCAAGTGG
CTGCGCATGT CCGGCCTCGT GATGCTGCTG CCCCACGGCT ACGAGGGCCA GGGGCCGGAG
CACTCGTCCG CCCGTCTGGA GCGCTATCTC CAGATGTGCG CCGAGGACAA CATGCAGGTC
GCCAACTGCT CGACGCCCTC GAACTACTTC CACATCCTGC GCCGTCAGTT GAAGCGCGAC
TTCCGCAAGC CGCTGATCCT GATGACGCCG AAATCGCTGC TGCGCCACAA GCGGGCGGTC
TCGAAAATCG AGGACATCGC GGACGGCTCG ACCTTCCACC GCATCCTGTG GGACGACGCC
GAGCACGACG AGAACGGCGT GAAGCTCGTG CGCGACGACA AGATCCGCCG CGTCGTGCTG
TGCTCGGGCA AGGTCTATTA CGACCTCTAC GAGGAGCGGG AGAAGCGCGG CGTCAACGAC
GTCTACCTGA TGCGCGTCGA GCAGCTCTAC CCGTTCCCGC TCAAGGCGCT GGCCAACGAG
ATGACCCGCT TCCGCAACGC GGAGGTGGTG TGGTGCCAGG AAGAGCCCAA GAACATGGGC
TCGTGGACCT TCGTCGAGCC CTATCTCGAT TGGGTGCTGG GCCAGGCCGG CTCCGCCTCG
AAGCGCGCTC GCTATGTCGG CCGCCCGGCC TCGGCCTCGA CCGCGGTCGG CCTGATGTCG
AAGCACCTCG CCCAGCTCCA GGCCTTCCTC AACGAAGCGC TGGCGGTCTG A
 
Protein sequence
MARQDANEAL LRTSFLYGAN AAWIEELQAA YARDPNSVDP EWQRFFKDLG EDDALVKKNA 
EGASWAKPNW PVVANGEIVS ALDGNWGALE KTFGEKIQAK AQPGKPGDST KGAAIVAATG
VSVEQATKDS VRAIMLIRAY RMRGHLHAKL DPIGLAPRGD HEELHPQHYG FQESDWDRKI
FLDNVLGMEF STIREIVAIL ERTYCQTLGV EFMHISDPEE KAWIQERIEG KDKEISFTPE
GRRAILNKLI EAEGFEKFLD LKYTGTKRFG LDGGESMVPA MEQIIKRGGA LGIEEIVLGM
AHRGRLNVLT NVMAKPFRAV FHEFKGGSAS PAEVEGSGDV KYHLGASSDR AFDDNTVHLS
LTANPSHLEI VDPVVLGKVR AKQDQKAKPN VERRRVLPLL IHGDAAFAGQ GVVAECLGLS
GLKGHRTGGS IHFIINNQIG FTTDPRFSRS SPYPSDVAKM VEAPIFHCNG DDPEAVTFAA
KVAVEYRQKF GKPVVIDMLC YRRFGHNEGD EPAFTQPKMY QRIRKHPTAL ETYGKKLVAQ
GDLTQEQLDA RKAEFRAMLE SELEVAGGYK ANKADWLDGR WSGFKAVRED VDDPRRGRTG
VPLETLRDIA TRITTPPPGF HLHRTIQRFF DNRAKAVETG VGIDWATAEA LAFGSLLIEG
HRVRLSGQDV ERGTFSQRHA VVIDQENEQR YTPLNSLREG QANLEVINSM LSEEAVLGFE
YGYSLAEPNS LVLWEAQFGD FANGAQVVID QFISSGERKW LRMSGLVMLL PHGYEGQGPE
HSSARLERYL QMCAEDNMQV ANCSTPSNYF HILRRQLKRD FRKPLILMTP KSLLRHKRAV
SKIEDIADGS TFHRILWDDA EHDENGVKLV RDDKIRRVVL CSGKVYYDLY EEREKRGVND
VYLMRVEQLY PFPLKALANE MTRFRNAEVV WCQEEPKNMG SWTFVEPYLD WVLGQAGSAS
KRARYVGRPA SASTAVGLMS KHLAQLQAFL NEALAV