Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2991 |
Symbol | |
ID | 8138334 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3476670 |
End bp | 3478127 |
Gene Length | 1458 bp |
Protein Length | 485 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644870589 |
Product | succinic semialdehyde dehydrogenase |
Protein accession | YP_003022778 |
Protein GI | 253701589 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR01780] succinate-semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1.6539300000000001e-18 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTAGTTC TCAAGGACAG CAATCTTTTC CAGCAGCTCT GCTACATAAA CGGCGCGTGG ACCGGCGCTG ACAGCGGCGA GACCATCGAT GTCACCAACC CCGCCACCGG CGAGAAACTC GGAACCATCC CCAAGATGGG GGGCGCCGAG ACCCGACGCG CCATCGAAGC CGCCAACGCA GCGTTTCCCA AGTGGCGCTC CAAGACGGCG CAGGAGCGCT CCACCATCCT CAGGCGCTGG TCCGAACTGC TCCTGGAGCA CCAGGAGGAC TTAGCCGTTT TGATGACCGC AGAACAGGGG AAGCCGCTGG CAGAGTCGCG GGGCGAGACC GTTTACGCCG CGTCCTTTCT GGAGTGGTTC GCGGAGGAGG CGAAACGGAT CTACGGCGAC GTGATCCCGC CGCATCAAAG CGATAAGAGG ATCGTGGTCC TGAAGGAGCC GATCGGCGTC TGCGCCGCCA TTACCCCCTG GAATTTCCCC TCCGCGATGA TCACCAGGAA GGCGGGGCCC GCGCTCGCCG CCGGGTGCAC CATGGTGGTA AAGCCCGCGA CCGCGACCCC GTATTCGGCG CTGGCACTGG CAGAGCTTGC CCGCCGCGCC GGGGTCCCCG AGGGCGTCTT CTCCGTGGTC ACCGGCTCGG CTGCGGGGAT CGGCGGGGAG ATGACGGCGA ACCCCATCGT GCGCAAGCTC ACCTTCACCG GTTCCACCGA GATCGGGAAG AAGCTGATGG CCGAGTGCGC CGGCACGGTG AAGAAGGTCT CCATGGAGCT CGGCGGCAAC GCCCCCTTCA TAGTCTTCGA CGACGCCGAC ATCGACGCCG CGGTCGAGGG GGCGCTCATC TCCAAGTACC GCAACACCGG CCAGACCTGC GTCTGCACCA ACCGTTTCCT GGTCCAGGAC GGGGTCTACG ACCGGTTCGC GGAAAAATTG GCGCGAGCCG TCGCCAACAT GAAGGTCGGA GACGGCCTGA AAGGCGAGAC CCAGCAGGGT CCGCTGATCG ACATGAAAGC GGTGGAGAAG GTGGAGGAAC ATATCCAGGA CGCGCTTGCC GGCGGGGCGC GCGTGGTGAC CGGCGGCAAG CGCCATGCGC TGGGGGGGAG CTTCTTCGAG CCTACCGTCC TGACCGACGT GAAGCCCGGG ATGCTGGTGG CGAAAGAGGA GACCTTCGGC CCGTTGGCGC CGCTGTTCCG CTTTAAGACC GAAGAGGAGG CGGTACACAT GGCAAACGAC ACCGAGTTCG GCCTCGCCGC CTACTTCTAC AGCCAGGACG TCTCCAGGGT CTGGCGGGTC GCCGAGGCCA TCGAATATGG CATCGTCGGC ATCAACACCG GTCTCATCTC CACCACCGTC GCCCCCTTCG GCGGCGTAAA GGAGTCCGGC ATCGGGCGCG AAGGATCAAA GTACGGCATC GAGGACTTCC TCGAGGTCAA GTACCTCTGC ATAGGCGGCG TGAAGTAG
|
Protein sequence | MLVLKDSNLF QQLCYINGAW TGADSGETID VTNPATGEKL GTIPKMGGAE TRRAIEAANA AFPKWRSKTA QERSTILRRW SELLLEHQED LAVLMTAEQG KPLAESRGET VYAASFLEWF AEEAKRIYGD VIPPHQSDKR IVVLKEPIGV CAAITPWNFP SAMITRKAGP ALAAGCTMVV KPATATPYSA LALAELARRA GVPEGVFSVV TGSAAGIGGE MTANPIVRKL TFTGSTEIGK KLMAECAGTV KKVSMELGGN APFIVFDDAD IDAAVEGALI SKYRNTGQTC VCTNRFLVQD GVYDRFAEKL ARAVANMKVG DGLKGETQQG PLIDMKAVEK VEEHIQDALA GGARVVTGGK RHALGGSFFE PTVLTDVKPG MLVAKEETFG PLAPLFRFKT EEEAVHMAND TEFGLAAYFY SQDVSRVWRV AEAIEYGIVG INTGLISTTV APFGGVKESG IGREGSKYGI EDFLEVKYLC IGGVK
|
| |