Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dhaf_4271 |
Symbol | |
ID | 7261296 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfitobacterium hafniense DCB-2 |
Kingdom | Bacteria |
Replicon accession | NC_011830 |
Strand | - |
Start bp | 4520720 |
End bp | 4523770 |
Gene Length | 3051 bp |
Protein Length | 1016 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 643564188 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_002460712 |
Protein GI | 219670277 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.12828 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGTAA CTCGTCGTGG GTTCCTTAAG CTTTCTGGGG CATCCCTGTT TGCTTTGGCT TCCGGTCTTG GCTTTGATCC ACAGATCGCC CAAGCCCAGG GATTTAGTCT TAGAATTGAA GGAACAACTA AGATTCCTAG CATTTGCCAT TTCTGTTCCG GCGGGTGCGG TTTGCTGCTT CATATTAAAG ACGAAAAACT CGTTTATCTG GATGGAGACC CGGATAATCC GGTGAATTCC GGTGCCTTGT GTCCGAAAGG GGCCAGTTTG GGTCATGTGG CCAATGCCAA GGACCGTGTC ACAAAGCCCA GATACAGAGC TCCCGGGTCC AGCGATTGGC AGGATATTTC CTGGGATGAA GCGATTAATA AGATCGCCTC AAAGATCAAG GAAGTTCGTG ACACAACCTG GATGGCTACG GAAGAAATTG GTGGTACAGC CTACAATGTA AACCGCGCTG ATGGCATTGC CGTTCTCGGT TCAGCCGAAG TGGATAATGA AGAATCCTAC CTGATCAAGA AACTGTCCGA GCTCATCGGA ACACCTTATA ACGAACACCA GGCCCGGATA TGACACGCTC CCACGGTGGC AAGTTTGTCA CCTTCATTTG GCCGCGGAGC TATGACCAAT TCCTGGACGG ATATGCAAAA CACGAAATGC TTCCTGATTG CGGGCAGCAA CTGTGCGGAG AATCATCCCA TAGCTATGCG TTGGATCAAT AAAGCCAAAG AAAACGGCGC CAAGGTGATC GTGGTGGATC CGCGGTTTAC CCGTACCGCC TCCCAAGCAG ATATCTTTGC CCAGGTTCGT CCCGGTGCGG ATATTGCTTA TTTAAATGCA ATCATCAATT ATATCTTAGA GAACAAACTC TATGATCAAG ACTATGTACT CAACCATACC AATGGGCTGT ACAAGATCAG TAAAGACTTT AAATTTGCCG ATGGACTGTT TTCCGGATTT GATCCCGAGA CCAAAAAGTA TAATTTTGAC AGCTGGGCTT ATCAGCTTGA TGCCGAAAAC AAACCGGTGA AAGCGGAAAG TCTGGATGAT CCTGATTGCG TATTTGGCAA GCTTAAAGAG CATTTCTCCC GTTACACTCT GGAAGTAGGG GCCGATATCA GCGGTATACC TGCTGAAAAA ATTAAAGAAA TTGCTGATAC CTTCTGCAAT ACCAGACCAG GCAGCATTCT TTATGCACTG GGCATGACCC AACACACCAC CGGCGTTCAG GGAATCCGCA GTTACGCGAT CATCCAGCTG CTTTTGGGCA ACGTAGGTAA AGCAGGCAGC GGCATCCAAG CCTTGCGTGG TGAGCCCAAT GTTCAAGGCT CCACCGATAT GGCCAATCTC TTTAATAACC TGCCGGGTTA CTTGCCGGCC CCGGTTCATA CGGATAAGGA TCTGCGCAGT TATCTGGTGC GCAGCGGCTC GGCTTTTGAA AGGCATATTA TCTCTCAGCT CAAAGCTTGG TTTGGCGAAA ATGCCACCAA AGAAAATGAT TATTGCTTTA ATTACCTGCC TAAGTACAAC TCAGGCAAGA ACTACAGCAT GGTAAAACTT TGGGAAGCTG CCAATAGCGG ACAATTCAAA ATGCTGCTTA ATTTTGGCTC GAATTCCATG GTATCCATCC CTAACCGGCA AATCGTCCGT GAAGGCTTGG CAAAGCTGGA TATGCTGGTC ATCGCGGATG TTTATGAGGT TGAAACTGCC CAATTCTGGC GTGAAAAGGA TCCGACCACA GGCGAGCTCC TGGTCAATCC GGCCAAAATC AACACAGAAG TCATCCTGCT TCCCGCAGCT TTCGTCTATG AAAAAGGGGG CACGCTATCC AACTCCGGCC GCTGGATTCA GTGGAAGGAT GCTGCTCTGA AGCCGCCAGG GGAAGCTAAG CCTGACCTGG ATATTCTGGA TCATATCTAC CATAAACTTA AAGAACTTTA TGCCGGCAGT ACCGATCCTA AGGATGAACC CATTCTCAAG GCCCGGTGGG ACTATGGTCA TGAACCGGAT CCGCTGAAAG TTCTCCAGGA GATCAGTGGT TATGATGAAA CAACTGGTAA AGTGCTGCCT ACCCTGGCTG ATTATTTAAA AGCTCCTATC GGCTCAGCTT CGTCAGGCTG TTGGATTTAT GCCGGTGTTA CAGGCAATGG CAACCTGGCT GCCCGCCGGG ACAACAGTGA TCCCTCGGGG CTTGGTCTAT ACCGCAATTG GAGCTTCTCC TGGCCGGGTA ACATCCGCAT CCTTTATAAT CGCGGCTCCT GTGATATGAA CGGTCAGCCT CTGGATGAGA ACCGCAAGCT GATTTGGTGG GATGCGGCCA AGAATTCCTG GGAAGGCAAT GACGGTGCCG ATGTGCCGGA CAAAACCAAA GGCCCGGATA CCCCGGAAGG GAAACAGGTT TTCCGCATGA ATCCTGAAGG AGTAGGCCGC TTATTCACTG CGAAATATTT CAGTGGAATT CCTGCCACAC CTGCAGCAGA TGGCTTGCCC CATATTGGCG TTAGACCGGC CGGTCAATGT AATGACGGTC CTTTGCCGGA GTTTTATGAG CCGGTGGAAA GTCCGACGGT CAATAGTCTG CATCCGGATG TAAGCTCTAA TCCTACCGTG CCCATCCCGA ACTTCCTGCC TGGTGTTACG AACCATGGCA GCAAGGAAGA TTTCCCTTAT GTACTAACGA CCTATGCCTT AGTTGAGCAT TTCTGCGCCG GCGGGATCAC GAGAAATATC CCCATGCTCA ATGAGCTGAT GCCTCAGCCC TTTGCCGAGA TCAGCAAAAA TCTGGCCCAA AAGATCGGAG TCAAAGAAGG GGACATGGTA GAAGTATCTT CTGCCCGTGG CAAAGTTCAA GTAGTCGCCC TGGTAACGGA CAGAATTCAG ACCTTAAAGA TCAACGGTCA GGATTCCGAA ACCATTGGTA TGCCTTGGAG TTGGGGCTTC GCATCCCTAA GTCCCGGACC GACGACCAAC AACCTGACCA TCAGCGCCAT CGATCCCACT GCAGGCACCC CGGAATATAA ATGCTGCCTG GTCAACATAA GGAGGGCGTA G
|
Protein sequence | MEVTRRGFLK LSGASLFALA SGLGFDPQIA QAQGFSLRIE GTTKIPSICH FCSGGCGLLL HIKDEKLVYL DGDPDNPVNS GALCPKGASL GHVANAKDRV TKPRYRAPGS SDWQDISWDE AINKIASKIK EVRDTTWMAT EEIGGTAYNV NRADGIAVLG SAEVDNEESY LIKKLSELIG TPYNEHQARI UHAPTVASLS PSFGRGAMTN SWTDMQNTKC FLIAGSNCAE NHPIAMRWIN KAKENGAKVI VVDPRFTRTA SQADIFAQVR PGADIAYLNA IINYILENKL YDQDYVLNHT NGLYKISKDF KFADGLFSGF DPETKKYNFD SWAYQLDAEN KPVKAESLDD PDCVFGKLKE HFSRYTLEVG ADISGIPAEK IKEIADTFCN TRPGSILYAL GMTQHTTGVQ GIRSYAIIQL LLGNVGKAGS GIQALRGEPN VQGSTDMANL FNNLPGYLPA PVHTDKDLRS YLVRSGSAFE RHIISQLKAW FGENATKEND YCFNYLPKYN SGKNYSMVKL WEAANSGQFK MLLNFGSNSM VSIPNRQIVR EGLAKLDMLV IADVYEVETA QFWREKDPTT GELLVNPAKI NTEVILLPAA FVYEKGGTLS NSGRWIQWKD AALKPPGEAK PDLDILDHIY HKLKELYAGS TDPKDEPILK ARWDYGHEPD PLKVLQEISG YDETTGKVLP TLADYLKAPI GSASSGCWIY AGVTGNGNLA ARRDNSDPSG LGLYRNWSFS WPGNIRILYN RGSCDMNGQP LDENRKLIWW DAAKNSWEGN DGADVPDKTK GPDTPEGKQV FRMNPEGVGR LFTAKYFSGI PATPAADGLP HIGVRPAGQC NDGPLPEFYE PVESPTVNSL HPDVSSNPTV PIPNFLPGVT NHGSKEDFPY VLTTYALVEH FCAGGITRNI PMLNELMPQP FAEISKNLAQ KIGVKEGDMV EVSSARGKVQ VVALVTDRIQ TLKINGQDSE TIGMPWSWGF ASLSPGPTTN NLTISAIDPT AGTPEYKCCL VNIRRA
|
| |