Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_1646 |
Symbol | sucA |
ID | 5833002 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 1836547 |
End bp | 1839537 |
Gene Length | 2991 bp |
Protein Length | 996 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641367444 |
Product | 2-oxoglutarate dehydrogenase E1 component |
Protein accession | YP_001639116 |
Protein GI | 163851073 |
COG category | [C] Energy production and conversion |
COG ID | [COG0567] 2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, and related enzymes |
TIGRFAM ID | [TIGR00239] 2-oxoglutarate dehydrogenase, E1 component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.44941 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACGCC AGGACGCGAA CGAAGCGCTT CTTCGAACCT CCTTCCTCTA CGGCGCCAAC GCCGCCTGGA TCGAGGAGCT GCAGGCGGCC TATGCCCGCG ACCCGAACTC GGTCGATCCC GAGTGGCAGC GCTTCTTCAA GGACCTGGGC GAGGACGACG CCCTGGTGAA GAAGAACGCC GAGGGCGCCT CCTGGGCCAA GCCGAACTGG CCGGTCGTGG CCAACGGCGA GATCGTCTCG GCGCTCGACG GCAATTGGGG CGCTCTCGAA AAGACGTTCG GCGAGAAGAT CCAGGCCAAG GCCCAGCCCG GCAAGCCCGG CGACTCGACC AAGGGCGCGG CCATCGTCGC GGCCACGGGC GTTTCCGTCG AGCAGGCCAC CAAGGATTCC GTGCGCGCGA TCATGCTGAT CCGCGCCTAC CGCATGCGCG GCCACCTCCA CGCCAAGCTC GACCCGATCG GGCTCGCCCC GCGCGGCGAC CACGAGGAGC TGCACCCGCA GCATTACGGC TTCCAGGAGA GCGACTGGGA CCGCAAGATC TTCCTCGACA ACGTGCTCGG CATGGAATTC TCGACGATCC GCGAGATCGT CGCGATCCTG GAGCGTACCT ACTGCCAGAC GCTCGGCGTC GAGTTCATGC ACATCTCCGA TCCTGAGGAG AAGGCGTGGA TCCAGGAGCG CATCGAGGGC AAGGACAAGG AAATCTCGTT CACGCCGGAA GGCCGGCGGG CGATCCTGAA CAAGCTGATC GAGGCCGAGG GCTTCGAGAA GTTCCTCGAT CTCAAATACA CCGGCACCAA GCGCTTCGGC CTCGACGGCG GCGAGTCGAT GGTCCCGGCC ATGGAGCAGA TCATCAAGCG CGGCGGCGCG CTCGGCATCG AGGAGATCGT GCTCGGCATG GCCCATCGCG GCCGGCTGAA CGTGCTCACC AACGTGATGG CTAAGCCCTT CCGGGCGGTG TTCCACGAGT TCAAGGGCGG CTCGGCCTCA CCCGCCGAGG TCGAAGGCTC GGGCGACGTG AAGTACCATC TCGGCGCCTC GTCCGACCGC GCCTTCGACG ACAACACCGT TCACCTCTCG CTCACCGCCA ACCCGTCCCA CCTCGAGATC GTCGATCCGG TGGTGCTCGG AAAGGTGCGG GCCAAGCAGG ACCAGAAGGC CAAGCCGAAC GTCGAGCGCC GCCGCGTGCT GCCGCTCCTC ATCCACGGCG ACGCGGCCTT TGCCGGCCAG GGCGTGGTCG CGGAATGCCT CGGCCTGTCC GGCCTGAAGG GTCACCGCAC CGGCGGCTCG ATCCACTTCA TCATCAACAA CCAGATCGGC TTCACCACCG ATCCGCGCTT CTCGCGCTCC TCGCCCTATC CGTCCGACGT GGCGAAGATG GTGGAGGCGC CGATCTTCCA CTGCAACGGC GACGACCCGG AGGCGGTGAC CTTCGCGGCG AAGGTCGCGG TCGAGTACCG GCAGAAATTC GGCAAGCCGG TCGTGATCGA CATGCTGTGC TACCGCCGCT TCGGCCACAA CGAGGGCGAC GAGCCGGCCT TCACCCAGCC GAAGATGTAC CAGCGGATCC GCAAGCATCC GACTGCACTG GAGACCTACG GCAAGAAGCT CGTCGCCCAG GGTGACCTGA CCCAGGAGCA GCTCGACGCA CGCAAGGCCG AGTTCCGCGC GATGCTGGAA AGCGAGCTCG AGGTCGCGGG CGGCTACAAG GCCAACAAGG CCGACTGGCT CGACGGCCGC TGGTCCGGCT TCAAGGCCGT GCGCGAGGAC GTGGACGATC CCCGCCGCGG CCGCACCGGG GTGCCGCTCG AGACGCTGCG CGACATCGCC ACCCGGATCA CCACGCCCCC GCCGGGCTTC CACCTGCACC GCACGATCCA GCGCTTCTTC GACAACCGCG CCAAGGCGGT CGAGACGGGC GTCGGCATCG ATTGGGCTAC CGCCGAGGCG CTCGCCTTCG GCTCGCTGCT GATCGAGGGC CACCGGGTCC GGCTCTCGGG CCAGGACGTC GAGCGCGGCA CCTTCTCCCA GCGCCACGCC GTGGTGATCG ATCAGGAGAA CGAGCAGCGC TACACGCCGC TCAACTCCCT GCGCGAGGGG CAGGCGAACC TGGAGGTCAT CAACTCGATG CTCTCCGAGG AGGCCGTGCT CGGCTTCGAG TACGGCTACT CGCTCGCCGA GCCGAACTCC CTGGTGCTGT GGGAGGCGCA GTTCGGCGAC TTCGCCAACG GCGCGCAGGT CGTCATCGAC CAGTTCATCT CATCGGGCGA GCGCAAGTGG CTGCGCATGT CCGGCCTCGT GATGCTGCTG CCCCACGGCT ACGAGGGCCA GGGGCCGGAG CACTCGTCCG CCCGTCTGGA GCGCTATCTC CAGATGTGCG CCGAGGACAA CATGCAGGTC GCCAACTGCT CGACGCCCTC GAACTACTTC CACATCCTGC GCCGTCAGTT GAAGCGCGAC TTCCGCAAGC CGCTGATCCT GATGACGCCG AAATCGCTGC TGCGCCACAA GCGGGCGGTC TCGAAAATCG AGGACATCGC GGACGGCTCG ACCTTCCACC GCATCCTGTG GGACGACGCC GAGCACGACG AGAACGGCGT GAAGCTCGTG CGCGACGACA AGATCCGCCG CGTCGTGCTG TGCTCGGGCA AGGTCTATTA CGACCTCTAC GAGGAGCGGG AGAAGCGCGG CGTCAACGAC GTCTACCTGA TGCGCGTCGA GCAGCTCTAC CCGTTCCCGC TCAAGGCGCT GGCCAACGAG ATGACCCGCT TCCGCAACGC GGAGGTGGTG TGGTGCCAGG AAGAGCCCAA GAACATGGGC TCGTGGACCT TCGTCGAGCC CTATCTCGAT TGGGTGCTGG GCCAGGCCGG CTCCGCCTCG AAGCGCGCTC GCTATGTCGG CCGCCCGGCC TCGGCCTCGA CCGCGGTCGG CCTGATGTCG AAGCACCTCG CCCAGCTCCA GGCCTTCCTC AACGAAGCGC TGGCGGTCTG A
|
Protein sequence | MARQDANEAL LRTSFLYGAN AAWIEELQAA YARDPNSVDP EWQRFFKDLG EDDALVKKNA EGASWAKPNW PVVANGEIVS ALDGNWGALE KTFGEKIQAK AQPGKPGDST KGAAIVAATG VSVEQATKDS VRAIMLIRAY RMRGHLHAKL DPIGLAPRGD HEELHPQHYG FQESDWDRKI FLDNVLGMEF STIREIVAIL ERTYCQTLGV EFMHISDPEE KAWIQERIEG KDKEISFTPE GRRAILNKLI EAEGFEKFLD LKYTGTKRFG LDGGESMVPA MEQIIKRGGA LGIEEIVLGM AHRGRLNVLT NVMAKPFRAV FHEFKGGSAS PAEVEGSGDV KYHLGASSDR AFDDNTVHLS LTANPSHLEI VDPVVLGKVR AKQDQKAKPN VERRRVLPLL IHGDAAFAGQ GVVAECLGLS GLKGHRTGGS IHFIINNQIG FTTDPRFSRS SPYPSDVAKM VEAPIFHCNG DDPEAVTFAA KVAVEYRQKF GKPVVIDMLC YRRFGHNEGD EPAFTQPKMY QRIRKHPTAL ETYGKKLVAQ GDLTQEQLDA RKAEFRAMLE SELEVAGGYK ANKADWLDGR WSGFKAVRED VDDPRRGRTG VPLETLRDIA TRITTPPPGF HLHRTIQRFF DNRAKAVETG VGIDWATAEA LAFGSLLIEG HRVRLSGQDV ERGTFSQRHA VVIDQENEQR YTPLNSLREG QANLEVINSM LSEEAVLGFE YGYSLAEPNS LVLWEAQFGD FANGAQVVID QFISSGERKW LRMSGLVMLL PHGYEGQGPE HSSARLERYL QMCAEDNMQV ANCSTPSNYF HILRRQLKRD FRKPLILMTP KSLLRHKRAV SKIEDIADGS TFHRILWDDA EHDENGVKLV RDDKIRRVVL CSGKVYYDLY EEREKRGVND VYLMRVEQLY PFPLKALANE MTRFRNAEVV WCQEEPKNMG SWTFVEPYLD WVLGQAGSAS KRARYVGRPA SASTAVGLMS KHLAQLQAFL NEALAV
|
| |