Gene GM21_2991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2991 
Symbol 
ID8138334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3476670 
End bp3478127 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content65% 
IMG OID644870589 
Productsuccinic semialdehyde dehydrogenase 
Protein accessionYP_003022778 
Protein GI253701589 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01780] succinate-semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value1.6539300000000001e-18 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTAGTTC TCAAGGACAG CAATCTTTTC CAGCAGCTCT GCTACATAAA CGGCGCGTGG 
ACCGGCGCTG ACAGCGGCGA GACCATCGAT GTCACCAACC CCGCCACCGG CGAGAAACTC
GGAACCATCC CCAAGATGGG GGGCGCCGAG ACCCGACGCG CCATCGAAGC CGCCAACGCA
GCGTTTCCCA AGTGGCGCTC CAAGACGGCG CAGGAGCGCT CCACCATCCT CAGGCGCTGG
TCCGAACTGC TCCTGGAGCA CCAGGAGGAC TTAGCCGTTT TGATGACCGC AGAACAGGGG
AAGCCGCTGG CAGAGTCGCG GGGCGAGACC GTTTACGCCG CGTCCTTTCT GGAGTGGTTC
GCGGAGGAGG CGAAACGGAT CTACGGCGAC GTGATCCCGC CGCATCAAAG CGATAAGAGG
ATCGTGGTCC TGAAGGAGCC GATCGGCGTC TGCGCCGCCA TTACCCCCTG GAATTTCCCC
TCCGCGATGA TCACCAGGAA GGCGGGGCCC GCGCTCGCCG CCGGGTGCAC CATGGTGGTA
AAGCCCGCGA CCGCGACCCC GTATTCGGCG CTGGCACTGG CAGAGCTTGC CCGCCGCGCC
GGGGTCCCCG AGGGCGTCTT CTCCGTGGTC ACCGGCTCGG CTGCGGGGAT CGGCGGGGAG
ATGACGGCGA ACCCCATCGT GCGCAAGCTC ACCTTCACCG GTTCCACCGA GATCGGGAAG
AAGCTGATGG CCGAGTGCGC CGGCACGGTG AAGAAGGTCT CCATGGAGCT CGGCGGCAAC
GCCCCCTTCA TAGTCTTCGA CGACGCCGAC ATCGACGCCG CGGTCGAGGG GGCGCTCATC
TCCAAGTACC GCAACACCGG CCAGACCTGC GTCTGCACCA ACCGTTTCCT GGTCCAGGAC
GGGGTCTACG ACCGGTTCGC GGAAAAATTG GCGCGAGCCG TCGCCAACAT GAAGGTCGGA
GACGGCCTGA AAGGCGAGAC CCAGCAGGGT CCGCTGATCG ACATGAAAGC GGTGGAGAAG
GTGGAGGAAC ATATCCAGGA CGCGCTTGCC GGCGGGGCGC GCGTGGTGAC CGGCGGCAAG
CGCCATGCGC TGGGGGGGAG CTTCTTCGAG CCTACCGTCC TGACCGACGT GAAGCCCGGG
ATGCTGGTGG CGAAAGAGGA GACCTTCGGC CCGTTGGCGC CGCTGTTCCG CTTTAAGACC
GAAGAGGAGG CGGTACACAT GGCAAACGAC ACCGAGTTCG GCCTCGCCGC CTACTTCTAC
AGCCAGGACG TCTCCAGGGT CTGGCGGGTC GCCGAGGCCA TCGAATATGG CATCGTCGGC
ATCAACACCG GTCTCATCTC CACCACCGTC GCCCCCTTCG GCGGCGTAAA GGAGTCCGGC
ATCGGGCGCG AAGGATCAAA GTACGGCATC GAGGACTTCC TCGAGGTCAA GTACCTCTGC
ATAGGCGGCG TGAAGTAG
 
Protein sequence
MLVLKDSNLF QQLCYINGAW TGADSGETID VTNPATGEKL GTIPKMGGAE TRRAIEAANA 
AFPKWRSKTA QERSTILRRW SELLLEHQED LAVLMTAEQG KPLAESRGET VYAASFLEWF
AEEAKRIYGD VIPPHQSDKR IVVLKEPIGV CAAITPWNFP SAMITRKAGP ALAAGCTMVV
KPATATPYSA LALAELARRA GVPEGVFSVV TGSAAGIGGE MTANPIVRKL TFTGSTEIGK
KLMAECAGTV KKVSMELGGN APFIVFDDAD IDAAVEGALI SKYRNTGQTC VCTNRFLVQD
GVYDRFAEKL ARAVANMKVG DGLKGETQQG PLIDMKAVEK VEEHIQDALA GGARVVTGGK
RHALGGSFFE PTVLTDVKPG MLVAKEETFG PLAPLFRFKT EEEAVHMAND TEFGLAAYFY
SQDVSRVWRV AEAIEYGIVG INTGLISTTV APFGGVKESG IGREGSKYGI EDFLEVKYLC
IGGVK