Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dde_0813 |
Symbol | |
ID | 3755730 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio desulfuricans subsp. desulfuricans str. G20 |
Kingdom | Bacteria |
Replicon accession | NC_007519 |
Strand | - |
Start bp | 833077 |
End bp | 836121 |
Gene Length | 3045 bp |
Protein Length | 1014 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637781678 |
Product | formate dehydrogenase alpha subunit |
Protein accession | YP_387309 |
Protein GI | 78355860 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACATCA GCCGGCGTAG TTTTATAAAG GCCACCGCGG CAAGCGGCAT CGCTGCCGCA TTCGGCGGAC TGGGTCTGTC GCTTGTTCCC TCCGTATCAC AGGCAAAACT GCTGCCCCTG CGCTGGGCAA AACAGACCAC CTCGGTCTGC TGTTTCTGTG CCGTGGGATG CGGACTGCTG GTACACACCG ACAAAACATC GCACCGCGCC GTAAACGTGG AAGGCAACCC CGACCACCCC GTCAACGAAG GTTCACTCTG CTCCAAAGGG GCCTCTATCT ATCAGGTGGC CGAAAATGCT GAACGGCCCG CAACGCCCAT GTACCGCGCT CCCTACAGTG ACGCGTGGCA GCCTGTCTCT TGGGACTGGG CACTGACCGA AATTGCCAGA CGCATAAAAA AAGACCGTGA TGCTTCCTTC CGCTTTCAGA ATGACATGGG GCTGACAGTG AACCGCTGCG ACGGTATCGC CTCTGTGGGC TCGGCGGCGC TGGATAACGA AGAATGCTGG GTGTTCCAGT CCATCCTGCG CGCGCTGGGA CTTGTCTGGA TAGAACATCA GGCACGCATC TGCCACAGCT CCACGGTTCC TGCACTGGCG GAATCTTTCG GGCGGGGCGC CATGACCAAC CACTGGAACG ATATACAGCA CAGTGACTGC GTGCTGGTGA TGGGCAGCAA TGCGGCGGAA AACCATCCCA TATCATTCAA ATGGGTGCTG CGTGCCATGG AAAAAGGGGC AAAGCTTATC AGCATCGACC CGCGCTTCAC CCGCACATCG GCCAAGGCCG ACCTGTATGT GCAGATACGC GCCGGTACCG ACATCGCCGT GCTGGGCGGG CTTATCAACT ACATCATCGA AAACGACCTG ATCCAGCGCG ACTATGTGGT CAGCCACACC AACGCGCCGT TCCTCGTGTC CGATTCGTTC TCTTTCAAGG ACGGGCTGTT CAGCGGATAC AAGGCAGGCA GCAAGGACAG CCATTATCAG GGGCGCTACA ACAAGTCCCT GTGGGATTTT GCCAAGGACG AAAACGGGCT GCCGCTGAAA GATGAAACCC TGGCGCACCC GCGCTGCGTA TACCAGCTGC TCAAAAAACA CTACTCGCGC TATGACATAG ACACAGTGGT CAGCGTGAGC GGCGCGGACA AGCAGGGCCT GCTGGAATTT TACCGCCAGT ATGCCGCCAC CGGCAAGCCG GACAAGGCAG GCACCATAAT GTACGCCATG GGCTGGACAC AGCACACGGT GGGCACCCAG TACATACGCA CCATGGCTAT GGTGCAGCTG CTGCTGGGCA ACATCGGCGT GGCCGGAGGC GGAGTGAACG CCCTGCGCGG CGAATCGAAC GTGCAGGGGT CCACGGACCA CGCGCTGCTC TGGCAGAGCC TGCCCGGTTA CCTTGCCGTG CCCGACGCCA CCCATACATC GTACAAAAAG CATCTTGAAA TAAAAACCGC CCCCCATCTG GCAGCGGCCA AAGATCCCAA AAGCGCCGCA TGGTGGCAGT ATTACCCCAA ATACATGGCC AGCTTTCTCA AATCCATGTA TCCGCAGGCA GAGCTTGCCG ATGCATATAA CTGGCTGCCC AAGGCCGATC AGGGAAAAAC CTACACATGG CTCGAGCTGT TTGACGCCAT GCACGACGGC GAATTCAAGG GGTTCTTCGC CTGGGGACAG AACCCCGCCT GTTCCGGCGC CCACGCCGGA AAAAACCGCG AAGCCATGGC GAAACTGGAC TGGATGGTCA ACGTCAATAT TTTTGACAAT GAAACAGGCT CGTTCTGGCG CGGTCCGGGC ATGCAGCCCG AAAAGATCAA GACGGAAGTA TTCTTTCTGC CCTGCTGTGT ATCCATAGAA AAAGAAGGCT CCATCACCAA CTCGGGCCGG TGGATGCAGT GGCGCTATGC AGGCCCCGAC CCGAGAGGCG CCGCCAGATC CGACGGGCAT ATAATGGTTG AACTGATGGA GAAAGTCCGT GCTCTGTACG CAGAAGAAGG CGGTACCTTC ACCGCGCCCA TCGAAGCGCT TTCGCTGGAT ATGTGGCGTG ACCAAAAAGG CTACAACCCG CATAACGTGG CCAGACTCAT CAACGGCACA TTCCTGCGGG ACGTGACCAT CAAGGGCACT ACCTACAAAA AAGGCCAGCA GGTGCCCAGT TTTGCGCTGC TGCAGGACGA CGGCTCGACC TGCTCCGGCA ACTGGCTGTA CTGCGCTTCG TACACCGATG AAGGCAACAT GGGCGAACGC CAGAGCAGGC AGCAGAGCCC GGAACAGGAA AAGCTCGGCC TTTTCCCCAA CTGGACATGG TGCTGGCCGC TCAACAGGCG CATACTGTAC AACCGCGCTT CCTGCGACCT GACAGGCAAG CCCTATAACC CGCAGATGCC CGTCATCAGC TGGACAGGCG AAAAATGGAC AGGCGACGTA CCCGACGGCG GCTGGAAACC CGGAGAAAAA TACTCGTTCA TCATGAAGCC GCACGGCCAC GGACGCATCT ACGGCCCCGG ACTGGAAGAC GGGCCCTTCC CCGAGCACTA CGAACCCATG GAAACCCCGC TCAAATCGCA TCCGCTCTCG CGCCAGCGCA GCAATCCGGC CTGCCTGTCC TTCAATGACG AATACAAGGC CGTGGCAGAC CCGAAATTCC CTTATGTGGC AACCACATTC AGGGTTACCG AACACTGGCA GACGGGCCTT ATGACCCGCC ACATGCCCTG GCTGCTGGAA GCACAGCCGC AGATGTTTGT GGAGCTGAGC GAAGAACTGG CCCGGAAGCT GGGCGTAGGT AACGGTGACA AGGTTGTAGT GGAAAGCGCA CGCGGCAGCA TCTGGGCAGT GGGCATGGTG ACTGAACGCG TCAAGCCGCT CAGAATTCTG GGTAAAACCG TGCACCAGAT AAGCATGCCG TGGTGTTTCG GCTGGTTCAT GCCGCACGAC GGCAGCGGGG GAGATTCCTC CAACCTGCTT ACAGCGGCCG TGGGCGACGC CAATACCGGC ATTCCCGAAA CCAAGGTCTT CATGGCCAAT GTGCGCAAGG CGTAA
|
Protein sequence | MHISRRSFIK ATAASGIAAA FGGLGLSLVP SVSQAKLLPL RWAKQTTSVC CFCAVGCGLL VHTDKTSHRA VNVEGNPDHP VNEGSLCSKG ASIYQVAENA ERPATPMYRA PYSDAWQPVS WDWALTEIAR RIKKDRDASF RFQNDMGLTV NRCDGIASVG SAALDNEECW VFQSILRALG LVWIEHQARI CHSSTVPALA ESFGRGAMTN HWNDIQHSDC VLVMGSNAAE NHPISFKWVL RAMEKGAKLI SIDPRFTRTS AKADLYVQIR AGTDIAVLGG LINYIIENDL IQRDYVVSHT NAPFLVSDSF SFKDGLFSGY KAGSKDSHYQ GRYNKSLWDF AKDENGLPLK DETLAHPRCV YQLLKKHYSR YDIDTVVSVS GADKQGLLEF YRQYAATGKP DKAGTIMYAM GWTQHTVGTQ YIRTMAMVQL LLGNIGVAGG GVNALRGESN VQGSTDHALL WQSLPGYLAV PDATHTSYKK HLEIKTAPHL AAAKDPKSAA WWQYYPKYMA SFLKSMYPQA ELADAYNWLP KADQGKTYTW LELFDAMHDG EFKGFFAWGQ NPACSGAHAG KNREAMAKLD WMVNVNIFDN ETGSFWRGPG MQPEKIKTEV FFLPCCVSIE KEGSITNSGR WMQWRYAGPD PRGAARSDGH IMVELMEKVR ALYAEEGGTF TAPIEALSLD MWRDQKGYNP HNVARLINGT FLRDVTIKGT TYKKGQQVPS FALLQDDGST CSGNWLYCAS YTDEGNMGER QSRQQSPEQE KLGLFPNWTW CWPLNRRILY NRASCDLTGK PYNPQMPVIS WTGEKWTGDV PDGGWKPGEK YSFIMKPHGH GRIYGPGLED GPFPEHYEPM ETPLKSHPLS RQRSNPACLS FNDEYKAVAD PKFPYVATTF RVTEHWQTGL MTRHMPWLLE AQPQMFVELS EELARKLGVG NGDKVVVESA RGSIWAVGMV TERVKPLRIL GKTVHQISMP WCFGWFMPHD GSGGDSSNLL TAAVGDANTG IPETKVFMAN VRKA
|
| |