Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0716 |
Symbol | sucA |
ID | 8136031 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 856948 |
End bp | 859638 |
Gene Length | 2691 bp |
Protein Length | 896 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644868333 |
Product | 2-oxoglutarate dehydrogenase E1 component |
Protein accession | YP_003020548 |
Protein GI | 253699359 |
COG category | [C] Energy production and conversion |
COG ID | [COG0567] 2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, and related enzymes |
TIGRFAM ID | [TIGR00239] 2-oxoglutarate dehydrogenase, E1 component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 7.8642e-20 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGGATAT TGGACAACCT GAGTCCGCTT TGGATCGAGA ACCAGTACGA TGTATGGAAG AAGGATCCGG AGCAGCTTTC CGAGGCGTGG CGCGCCTTCT TCAACGGCTT CGAACTGGGA GCCAATCAAC CGCCCCAAGA GGTTGCCGCT CTTGGGCTTA ATGAGGTTCT GAAGCACTCG GGGGTCCAAT CCCTCATCTA CCGCTACCGG GACATCGGGC ACCTGCTTGC CTGCACCGAC CCGCTTTCCC CCTGCCAGAT AGAACACCCG CTCCTCTCCC TGGACGCCTT CGGCCTGGAG CCGTCCGACC TCGATAAGAC CTTCCATACC AAACGTTTCA TGAAGCAGAG CGCCACCCTT AGGGAGATCC TGCAGGTGAT GCGCGGCACC TACTGCGGTT CCATCGGCGT CGAATTCATG CACCTGCAAA ACCCCGAAGA GCGGCAGTGG CTCATCGACC GGATGGAGCC CGTGGCGAAC CGGGGACACT TCGACGCGCA AAAGCGGCTG AGGCTCTTGA AGAAGCTCAA GGAGGCGGCG CTTTTCGAAC GCGAACTGCA CAAGAAGTTC CCGGGACAGA CCCGCTTCTC GCTGGAAGGG GGGGACGCCC TTATCCCGCT GCTGGACGCA GCCGTGGTGA ACGCCGCCGA GCTCGGCGTC ACCGACGTCG TCTTCGGGAT GCCCCACCGC GGCCGCCTCA ACGTTTTGGC CAACATCTTC GGCATGCCTT ACGAGAACCT CTTCGCCGAA TTCGGCGACA ACAGGGAATA TGGCGTCGTC GGCGAGGGGG ACGTCAAGTA CCACAAGGGA TACTCCGTAG ATCTCACCTT AACCGGCGAG CGGGCCATAC ACTTAACCCT CACCTCCAAT CCGAGCCACC TCGAGGCGAT CGACCCGGTG GTGCAGGGTA AGTGCCGCGC CCGCCAGGAC CGGGTGGGGG AGGGGGCGGA GGGCCGGGTG CTGCCGCTTC TGATACACGG GGACGCGGCT TTCGCCGGGC AGGGGGTGGT GGCCGAGACC TTGAACCTCT CGCAGCTTGC CGGCTACAGG ACCGGCGGCA CCCTGCACGT GGTCCTCAAC AACCAGATCG GTTTCACCAC CTCGGCCGCC GACGCCCGGT CCAGCCACTA CGCCACCGAC GTGGCCAAGA TGGTGCGGGC GCCTGTCTTC CATGTCTACG GCGACGACGC CGAGGCGGTG GTGCGCATAG CCCAGCTGGC GCTTGAGTAC CGGGACCGCT ACAGAAAGGA CGTGGTGGTC GAGGTGATCT GCTACCGCAG ACACGGCCAC AACGAGGGAG ACGAGCCCTA CTTCACTCAG CCGCTCATGT ACGAGCAGAT CAGGCTGCGC CCCCCGCTGC ACTCGCTCTA CGAGATGGAG CTTTTGGGGG AGGGATTCGC CGAGGAGGAG TTGAAGGAGG TCGAGAACGA GGTGGCGCAG CGCCTGGCAC AGGCAGGGGG AAAAAATGCG GAGCCGGTCG AGAGCGCCTT CCTGGCACGT TGGAGCGGCA TGAAACCCGG AACCGAAAAG GGCTCCGTCT CCACCGCGGT GGCGGCGGCT TCGCTCATGG AACTCTCGGA AAAGCTCAAC CTCATCCCCG ACGGCTTCCA GCCGCACCCC AAGGTGGCGG GGATCCTGCA GAAGCGGCGC GAGGCGGTAT TGAAGGGTGG ACCGCTCGAC TGGGGGAACG TGGAGGCCCT GGCCTTCGGC ACCCTGCTGG CGCAGGGTAT CCCGGTCCGC CTTTCGGGGC AGGACGTCAG GCGCGGCACC TTCAGCCACC GGCACGCCGT CCTCTTCGAC CAGCAAAACG GCCGCTCCTA TCTCCCCTTG GCGGGAGTCG GCGCGCAAGG GGCGCTCTTC TGCGTCTACG ACAGCATGCT GGCCGAGTTC TCCGTCTTGG GGTTCGAGTA CGGCTACTCC ATCGAGGCCC CCGAGGCGCT CACCATCTGG GAGGCGCAGT ACGGCGACTT CGCCAACGGC GCCCAGGTGA TCATCGACCA GTTCATCGCA AGCGGCGAAG CCAAGTGGGG GCGCTCCAGC GGGCTAGTGC TGATGCTCCC TCACGGCTAC GAGGGGCAGG GGGCCGAGCA TTCCAGCGCC CGCATAGAGC GCTTCCTCGA ACTGGCGGCA GCCGGGAACA TCCAGGTCGC CTACCCCACC ACGCCCGCCC AGCTCTTCCA CGTGCTGCGG CGGCAGATGC TGCAGCCTTT CCGCAAGCCG CTCATCCTTT TCACCCCCAA GAGCCTGCTG CGGCACCCCG ACTGCGTCTC CAAACTGGAG GAGCTTTCCT CCGGCGGTTT CAAAGAGGTG ATCGCGGAGC CCCCGGCGGG CGGAGAGGTG CAAGAGGTGC TTCTTTGCAG CGGCAAGATC TACTACGACC TTCTGGGGCG GATCAGGAAG GACCAGTTGC AGGGGCGTGC GCTGGTGCGT ATCGAGCAGC TCTACCCCCT GCCTATGGGC CTTCTGCGGG AGGAGTTGCA GCGCTATCCC TCCGGCGTCC GTTACAGCTG GGTGCAGGAG GAGCCTCGCA ACATGGGAGG GTGGCGATTC CTGCACGAGC CTCTTTGCGA GATACTCGGG GTGGTGCCCC GGTACGTCGG GCGGCCGGAA GCAGCGGCGC CGGCCTCCGG CTCACACCGG CTGGACCGGG TGGAGCAGGA GCGCATCATA GAGGACGCGC TGCAGCATTG A
|
Protein sequence | MGILDNLSPL WIENQYDVWK KDPEQLSEAW RAFFNGFELG ANQPPQEVAA LGLNEVLKHS GVQSLIYRYR DIGHLLACTD PLSPCQIEHP LLSLDAFGLE PSDLDKTFHT KRFMKQSATL REILQVMRGT YCGSIGVEFM HLQNPEERQW LIDRMEPVAN RGHFDAQKRL RLLKKLKEAA LFERELHKKF PGQTRFSLEG GDALIPLLDA AVVNAAELGV TDVVFGMPHR GRLNVLANIF GMPYENLFAE FGDNREYGVV GEGDVKYHKG YSVDLTLTGE RAIHLTLTSN PSHLEAIDPV VQGKCRARQD RVGEGAEGRV LPLLIHGDAA FAGQGVVAET LNLSQLAGYR TGGTLHVVLN NQIGFTTSAA DARSSHYATD VAKMVRAPVF HVYGDDAEAV VRIAQLALEY RDRYRKDVVV EVICYRRHGH NEGDEPYFTQ PLMYEQIRLR PPLHSLYEME LLGEGFAEEE LKEVENEVAQ RLAQAGGKNA EPVESAFLAR WSGMKPGTEK GSVSTAVAAA SLMELSEKLN LIPDGFQPHP KVAGILQKRR EAVLKGGPLD WGNVEALAFG TLLAQGIPVR LSGQDVRRGT FSHRHAVLFD QQNGRSYLPL AGVGAQGALF CVYDSMLAEF SVLGFEYGYS IEAPEALTIW EAQYGDFANG AQVIIDQFIA SGEAKWGRSS GLVLMLPHGY EGQGAEHSSA RIERFLELAA AGNIQVAYPT TPAQLFHVLR RQMLQPFRKP LILFTPKSLL RHPDCVSKLE ELSSGGFKEV IAEPPAGGEV QEVLLCSGKI YYDLLGRIRK DQLQGRALVR IEQLYPLPMG LLREELQRYP SGVRYSWVQE EPRNMGGWRF LHEPLCEILG VVPRYVGRPE AAAPASGSHR LDRVEQERII EDALQH
|
| |