Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dbac_0971 |
Symbol | |
ID | 8376629 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfomicrobium baculatum DSM 4028 |
Kingdom | Bacteria |
Replicon accession | NC_013173 |
Strand | - |
Start bp | 1051185 |
End bp | 1054220 |
Gene Length | 3036 bp |
Protein Length | 1011 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 645000210 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_003157502 |
Protein GI | 256828774 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.893461 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTGG GAAGAAGAGA ATTCGTCAAA CTCACGGCCG CTGCGACAGC GGTCACGGCC TTCGGGGGCC TCGGGTTTGA CCTGGCGCCG ACCAAGGCCC ATGCTCAGCT TTTGAGCTTG CGCAAAGGCA AGGAAACCAC CTCGGTCTGC TGCTACTGTT CGGTAGGCTG CGGACTCATC GTCACCACCG ACGAGAAAAC AGGGCGCGCC ATAAACATCG AAGGCGACCC GGATCATCCC ATCAACGAAG GCGCCCTCTG TGCCAAAGGC GCGGCCATTT ACCAGCTGGC GGAAAATCCC CAGCGCATCA CCAAGGTGCA GTACCGTGCA CCCGGCAGCG ACAAGTGGGA AGAAAAGAGC TGGGATTGGG CCATCACGGA GATCGCCAAG CGCGTCAAGA AAGTCCGTGA CGAATCCTTC ACCGAAAAGA ACGCCAAGGG CGAAGTCGTC AACCGCACCG AAGGCCTGGC CTCCGTCGGC TCGGCCGCCA TGGACAACGA GGAATGCTGG ATCTACCAGG CCTTCCTGCG CAGTCTGGGG CTAACGTACA TAGAGCATCA GGCCCGTATT TGACACAGCG CAACTGTTGC GGCTCTGGCA GAGTCGTTCG GACGCGGCGC GATGACCAAC CACTGGATCG ACTTCAAGAA CAGTGATTGC ATTTTGATAA TGGGCAGCAA CGCTGCCGAA AACCACCCCA TCTGTTGGAA ATGGGTCACC AGGGCGCAGG AAGCAGGCGC GACGGTCATT CACGTCGACC CGCGTTACAC CCGCACCTCC TCCAAGTCGG ACATGTACGT TCCCCTGCGG TCGGGCTCCG ACATCGGTGT CCTGGGCGGC ATGATCAAGT ACATTCTGGA CAATGACCTG ACTCAGAAGG AATACGTTCT CCACTATACC AACGCCTCTT TTCTCGTTTC CAAGGATTTC AGCTTCAAGG ACGGCCTGTT CTCCGGCTTC GACGAGGCTG GACGCAAGTA CGACAAGAAG TCCTGGGACT TTGAAATGGA CGAGAACGGC GTGCCCAAGC GGGACATGAC CTTGAAAGAT CCCCGCTGCG TGTATCAGAT CCTGAAAAAA CACTACGAGC GTTACGACTC CAAGAAAGTT TCCGCCATCT CCGGTTCTCC CGAAGACCAG CTGCTGGCCT TCTACAAGGC GTACACCGCC ACGGGCAAAC CGGACAAGGC AGGCACCATC CTGTATGCCA TGGGCTGGAC CCAGCACACC GTCGGCGTGC AGAACATACG CGCCATGAGC ATCATCCAGC TGCTGCTGGG CAACATGGGC GTGGCCGGCG GCGGCGTGAA CGCGCTGCGC GGCGAATCCA ACGTCCAGGG CTCCACCGAC CAGGCCCTGC TCTTCCACAT CATCCCCGGT TACATGGCCA CCCCGCGTGC CAACTGGGCG ACCCTGGAAG ACTACAACAA GGCCAACACT CCGGTCAGCA AGGACCCCAA GAGCGTCAAC TGGTGGCAGC ACAAACCCAA GTACTTAGCC AGCCTGCTCA AGTCCTTCTA CAAGGAAGCC GACCTGCCCA CCGCCTACAC CTGGATGCCC AAGCTCGATG TAGGACAGAA TGCCTCCTGG CTGGTACTCT TCGATAAGAT GCTCAAAGGC CAGTTCAAGG GCTTCTTCGC CTGGGGCATG AACCCCGCCG CCAGTGGCGC GGACTCCAAC AAGACCCGCG AAGCCCTGTC CAAGCTGGAC TGGATGGTCA ACGTCAACTT GTTCGAGAAC GAGACTGGTT CCTTCTGGAA GGGCCCGGGC GTTGATCCCA AGAAGATCAA GACCGAAGTC TTCTTCCTGC CCTGCTCGGT CTCGGTAGAA AAGGAAGGCT CCATCTCCAA CTCCGGCCGC TGGATGCAGT GGCGCTACGC CGGTCCCAAG CCCATGGGCG AGTCCAAGCC CGACGGCGAC ATCATCTATG AACTGGCGCT CAAAATCCGC GAGCTGTACG AGAAGGAAAA GGGCGCGTTC CCCGATCCGG TCCTCGGCCT CAACGTCGAC GATTGGGGCG ATGGACACGT TTTCGATCCG CACAAGGTCG CAAAGCTCAT CAACGGTTAC TTCCTGAAAG ACACGACCAT CAAGGGCGCT GACGGCAAGG AAACCGTGTA CAAAGCGGGC AGCCAGGTTC CGAGCTTCGT GATGCTGCAG GCCGACGGCT CGACCTGCGC CGGTAACTGG ATCTACACCG CCTCCTACAC CGACAAGGGC AACATGTCCG AACGCCGCGA CAAGACCCAG ACCCCGGAAC AGGCCAATAT CGGCCTCTTC CCCGGATGGA CCTGGTCCTG GCCGGTCAAC CGCCGCATCA TCTACAACAG GGCTTCCGTC GACATGACCG GCAAACCCTG GGCACCGCAG AAGCCTGTCC TGGCATGGGA CGGCGCGAAC AAGAAGTGGC TCATCGACGT GGTCGATGGC GGCGGAGACG CAGGCGCCAA GTATCCCTTC ATCATGCAGA AGCATGGCGT GGGACAGATC TATGGCCCGG GCCTCAACGA CGGTCCGTTG CCCGAATACT ACGAACCCCT GGAATGTCCG GTCACCGAGC ATCCCTTCTC CACGCAGCTC AACAGCCCCA CGGCCCTCAT GTTTGCCGAA GACGCTCACA AGCGCACCAC CTGCGACCCG CGCTTCCCGT TCGTGTGCAC CTCGTACCGG GTCACCGAGC ACTGGCAGAC GGGCGTACTG ACCCGCTGGC TGCCCTGGCT GACGGAAGCC CAGCCGCAGA TGTTCTGCGA AATGAGCGAG GAACTGGCTG AACTCAAAGG GATCAAGAAC GGCGAAAAGG TCATTCTGGA GAATCCGCGC GGACAGCTCT GGGCCATCGC CATCGTCACC AAGCGCTTGA AGCCTTTTGA CGTCATGGGC AACCCTGTCC ATGTCGTGGG TATCCCCTGG CACTACGGTT GGGTCTGGCC CAAGGACGGC GGGGATGCGG CCAACCTGCT GACTCCTGCT GTCGGCGATC CGAACACCGG CATCCCGGAA ACCAAGGCCT TCATGGTCAA CGTGAAGAAA GCGTAA
|
Protein sequence | MKLGRREFVK LTAAATAVTA FGGLGFDLAP TKAHAQLLSL RKGKETTSVC CYCSVGCGLI VTTDEKTGRA INIEGDPDHP INEGALCAKG AAIYQLAENP QRITKVQYRA PGSDKWEEKS WDWAITEIAK RVKKVRDESF TEKNAKGEVV NRTEGLASVG SAAMDNEECW IYQAFLRSLG LTYIEHQARI UHSATVAALA ESFGRGAMTN HWIDFKNSDC ILIMGSNAAE NHPICWKWVT RAQEAGATVI HVDPRYTRTS SKSDMYVPLR SGSDIGVLGG MIKYILDNDL TQKEYVLHYT NASFLVSKDF SFKDGLFSGF DEAGRKYDKK SWDFEMDENG VPKRDMTLKD PRCVYQILKK HYERYDSKKV SAISGSPEDQ LLAFYKAYTA TGKPDKAGTI LYAMGWTQHT VGVQNIRAMS IIQLLLGNMG VAGGGVNALR GESNVQGSTD QALLFHIIPG YMATPRANWA TLEDYNKANT PVSKDPKSVN WWQHKPKYLA SLLKSFYKEA DLPTAYTWMP KLDVGQNASW LVLFDKMLKG QFKGFFAWGM NPAASGADSN KTREALSKLD WMVNVNLFEN ETGSFWKGPG VDPKKIKTEV FFLPCSVSVE KEGSISNSGR WMQWRYAGPK PMGESKPDGD IIYELALKIR ELYEKEKGAF PDPVLGLNVD DWGDGHVFDP HKVAKLINGY FLKDTTIKGA DGKETVYKAG SQVPSFVMLQ ADGSTCAGNW IYTASYTDKG NMSERRDKTQ TPEQANIGLF PGWTWSWPVN RRIIYNRASV DMTGKPWAPQ KPVLAWDGAN KKWLIDVVDG GGDAGAKYPF IMQKHGVGQI YGPGLNDGPL PEYYEPLECP VTEHPFSTQL NSPTALMFAE DAHKRTTCDP RFPFVCTSYR VTEHWQTGVL TRWLPWLTEA QPQMFCEMSE ELAELKGIKN GEKVILENPR GQLWAIAIVT KRLKPFDVMG NPVHVVGIPW HYGWVWPKDG GDAANLLTPA VGDPNTGIPE TKAFMVNVKK A
|
| |