Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_29770 |
Symbol | sucA |
ID | 7761878 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 3078251 |
End bp | 3081082 |
Gene Length | 2832 bp |
Protein Length | 943 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643805850 |
Product | 2-oxoglutarate dehydrogenase E1 component |
Protein accession | YP_002800118 |
Protein GI | 226945045 |
COG category | [C] Energy production and conversion |
COG ID | [COG0567] 2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, and related enzymes |
TIGRFAM ID | [TIGR00239] 2-oxoglutarate dehydrogenase, E1 component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAGATA GCGTTATGCA GCGCATGTGG AACAGTGCCC ACCTTTCCGG TGGAAATGCT GCTTATGTGG AAGAGCTCTA TGAGCTTTAC CTGCACGACC CCAACGCTGT GCCCGAAGAG TGGCGCACCT ATTTCGAGAA GCTGCCCGCC GAAGCCGGTA CCTCCACCGA TGTTCCGCAC GCTCCCGTCC GCGACCAGTT CGTTCTGCTG GCCAAGAACC AGCGGCGTGC ACAGCCGGTC GCAACCTCCA GCGTAAGCAC CGAACACGAG AAGAAGCAGG TCGAAGTCCT GCGTCTGATC CAGGCTTACC GCACGCGCGG CCATCAGGCC TCGCAACTGG ATCCGCTCGG TCTGTGGCAG CGCACTGCGC CTTCCGACCT GTCGATCACC CATTACGGGC TGACCAACGC CGATCTGGAT ACCCCCTTCC GTACCGGAGA GCTCTACATC GGTAAGGAAG AGGCGACCCT ACGCGAAATC CTGCAGGCAT TGCAGGAGAC ATATTGCCGC ACTATCGGCG CCGAATTCAC CCATATCGTC GATTCCGAGC AGCGCAACTG GTTCGCCCAG CGCCTGGAAA GCGTACGCGG TCGTCCGGTG TACTCCAAGG AGGCGAAAAG CCACCTGCTC GAGCGCCTGA GCGCTGCCGA AGGCCTGGAA AAATACCTGG GCACCAAATA TCCGGGCACC AAGCGCTTCG GTTTGGAAGG CGGCGAGAGC CTGGTGCCGG TCGTCGACGA GATCATCCAG CGCTCAGGCT CCTACGGCAC CAAGGAAGTC GTCATCGGCA TGGCTCACCG CGGCCGTCTG AACCTGCTGG TCAACGCGCT GGGCAAGAAT CCACGCGACC TGTTCGACGA GTTCGAAGGC AAGCATCTGG TCGAACTGGG CTCCGGTGAC GTGAAGTACC ACCAGGGCTT CTCCTCCAAC GTGATGACCT CGGGTGGCGA AGTGCACCTG GCCATGGCGT TCAACCCCTC GCACCTGGAA ATCGTCTCCC CGGTGGTCGA GGGCTCCGTG CGTGCCCGCC AGGACCGTCG CGTCGACGCT ACCGGCGAGA AGGTGGTACC GATCTCCATC CACGGCGACT CGGCCTTTGC CGGTCAGGGC GTGGTGATGG AAACCTTCCA GATGTCGCAG ATCCGCGGCT ACAAGACGGG CGGTACCATT CACATCGTGG TCAACAACCA GGTCGGCTTC ACCACCAGTA ACCCGGTCGA CACCCGTTCG ACCGAGTACT GCACCGATCC GGCGAAGATG ATTCAGGCGC CGGTACTGCA CGTCAACGGC GACGATCCGG AAGCCGTGCT GTTCGTGACC CAACTGGCCG TCGACTATCG CATGCAGTTC AAGCGCGACG TAGTCATCGA TCTGGTCTGC TACCGCCGTC GCGGTCACAA CGAGGCCGAC GAGCCGAGCG GCACCCAGCC GCTGATGTAC CAGAAGATCG CCAAGCAGCC CACCACCCGC GAGCTGTATG CCGACGCGCT GGTCAAGGAG GGCAGCCTGA GCCAGGAAGA AGTCCAGGCC AAGGTCGACG AATACCGTAC CGCGCTGGAT AACGGTCAGC ACGTGCTCAA GAGCCTGGTC AAGGAGCCGA ACACCGAGCT GTTCGTCGAC TGGACCCCCT ATCTGGGCCA TGCCTGGACC GCTCGTCACG ACACCAGCTT CGAGCTGAAG ACCCTGCAGG AACTGAACGC CAAGCTGCTG CAGATCCCGG AAGGTTTCGT GGTCCAGCGC CAGGTTGCCA AGATCCTCGA GGATCGCGGC CGCATGGGCG TCGGCGCCAT GCCGATCAAC TGGGGCTGCG CCGAGACCCT GGCCTACGCC ACTCTGCTGA AGGAAGGCCA TCCGGTACGC ATCACCGGTC AGGACGTCGG CCGTGGCACC TTCTCGCACC GCCATGCCGC GCTGCACAAC CAGAAGGATG CCAGCCGCTA CATCCCGCTG CAGAACCTCT ACGAGGGACA GCCGAAGTTC GAGCTGTATG ATTCCTTCCT CTCGGAAGAG GCCGTGCTGG CCTTCGAATA CGGCTATGCC ACCACCACGC CGAACGCGCT GGTGATCTGG GAAGCCCAGT TCGGCGACTT CGCCAACGGT GCCCAGGTGG TGATCGACCA GTTCATCAGC AGCGGCGAGA CCAAGTGGGG ACGCCTGTGC GGCCTGACCA TGCTGCTGCC GCACGGCTAC GAGGGCCAGG GTCCGGAGCA CTCTTCCGCA CGTCTGGAGC GCTACCTGCA ACTGTGCGCC GAGCAGAACA TCCAGGTCTG CGTGCCGACC ACCCCGGCGC AGGTCTACCA CATGCTGCGT CGCCAGGTGA TCCGCCCGCT GCGCAAGCCG CTGGTGGCCT TGACGCCGAA GTCGCTGCTG CGTCACAAAT CGGCGATCTC CACCCTGGAA GATCTGGCTC TCGGCTCCTT CCATCCGGTC CTGCCGGAGG TCGATAGCCT CGATCCGAAG AAGGTCGAGC GTCTGGTCCT GTGCAGCGGC AAGGTCTACT ACGACCTGCT GGACAAGCGC CATGCCGAAG GTCGCGAGGA CATCGCCATC GTCCGTATCG AGCAGCTCTA TCCGTTCCCG GAAGAGGAAC TGGCCGAGGT CATGGCGCCG TACACCAACC TCAAGCATGT GGTCTGGTGT CAGGAAGAGC CGATGAACCA GGGCGCCTGG TACTGCAGTC AGCATCACAT GCGTCGCGTC GCCAGCGCGC ACAAGAAGGA GCTGTTCCTC CAGTATGCTG GTCGCGAGGC GTCGGCCGCT CCGGCTTGCG GTTACGCGTC GATGCATGCC GAGCAGCAGG AAAAACTGCT GCAGGACGCT TTCACTGTTT AA
|
Protein sequence | MQDSVMQRMW NSAHLSGGNA AYVEELYELY LHDPNAVPEE WRTYFEKLPA EAGTSTDVPH APVRDQFVLL AKNQRRAQPV ATSSVSTEHE KKQVEVLRLI QAYRTRGHQA SQLDPLGLWQ RTAPSDLSIT HYGLTNADLD TPFRTGELYI GKEEATLREI LQALQETYCR TIGAEFTHIV DSEQRNWFAQ RLESVRGRPV YSKEAKSHLL ERLSAAEGLE KYLGTKYPGT KRFGLEGGES LVPVVDEIIQ RSGSYGTKEV VIGMAHRGRL NLLVNALGKN PRDLFDEFEG KHLVELGSGD VKYHQGFSSN VMTSGGEVHL AMAFNPSHLE IVSPVVEGSV RARQDRRVDA TGEKVVPISI HGDSAFAGQG VVMETFQMSQ IRGYKTGGTI HIVVNNQVGF TTSNPVDTRS TEYCTDPAKM IQAPVLHVNG DDPEAVLFVT QLAVDYRMQF KRDVVIDLVC YRRRGHNEAD EPSGTQPLMY QKIAKQPTTR ELYADALVKE GSLSQEEVQA KVDEYRTALD NGQHVLKSLV KEPNTELFVD WTPYLGHAWT ARHDTSFELK TLQELNAKLL QIPEGFVVQR QVAKILEDRG RMGVGAMPIN WGCAETLAYA TLLKEGHPVR ITGQDVGRGT FSHRHAALHN QKDASRYIPL QNLYEGQPKF ELYDSFLSEE AVLAFEYGYA TTTPNALVIW EAQFGDFANG AQVVIDQFIS SGETKWGRLC GLTMLLPHGY EGQGPEHSSA RLERYLQLCA EQNIQVCVPT TPAQVYHMLR RQVIRPLRKP LVALTPKSLL RHKSAISTLE DLALGSFHPV LPEVDSLDPK KVERLVLCSG KVYYDLLDKR HAEGREDIAI VRIEQLYPFP EEELAEVMAP YTNLKHVVWC QEEPMNQGAW YCSQHHMRRV ASAHKKELFL QYAGREASAA PACGYASMHA EQQEKLLQDA FTV
|
| |