Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_0226 |
Symbol | |
ID | 8418030 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013223 |
Strand | + |
Start bp | 282666 |
End bp | 285683 |
Gene Length | 3018 bp |
Protein Length | 1005 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 645036791 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_003197106 |
Protein GI | 258404364 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type [TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.174026 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0776388 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTTTCA CCCGAAGACA TTTTCTGAAG CTCTCCGCTT CGGCTGCGGC AGTCACCGCA TTCGGCGGGC TGGGATTCAG CCTGAAGCCG ACCGCGGCAC AGGCTCAGCT GCTGAAATTG CGCTGGGCCA AGGAAACCAC ATCCATCTGT TGTTATTGCG CGGTAGGGTG TGGACTGATC GTCCATACCT CCCAGGAAGG ACAGGGCCGG GCCATCAATG TCGAAGGCGA TCCGGACCAC CCCGTCAGTG AAGGCTCCTT GTGCGCCAAA GGGGCGGCCA TTTTCAACCT GGGCGAGAAC GAAGACCGCA TCACTTCGGT TCTTTATCGC GCTCCGGGCA GCGAGAAGTG GCAGGAAACA TCCTGGGATT GGGCCCTGGA CACCATCGCC AAACGGGTCA AGGAAACCCG TGACGCCACG TTTACCCGGA CTAATGCCCA AGGGCAGGAA GTCAATCGGT GCAACGGCTT GGCCTCTGTT GGCTCCGCCG CCATCGACAA CGAAGAGTGC TGGGTCTATC AGGCCATGCT GCGCTCTCTG GGCCTGGTGT ATATCGAGCA CCAGGCGCGT ATCTGACACT CCGCAACGGT AGCGGCTCTG GCAGAGTCGT TCGGACGCGG TGCGATGACC AATCACTGGA TCGACATCAA AAACAGTGAT TGCATTTTGA TCATGGGCAG TAACGCTGCC GAAAACCACC CTGTCTCCTT CAAGTGGGTG ACCAAGGCCC AGGAAAAAGG GGCCCAACTG ATCCATGTCG ACCCCAGGTA CACGCGGACT TCGGCCAAGG CCGATATCTA CGCCCCCTTG CGTTCCGGTT CCGACATCGC GTTTCTTGGC GGTTTGATCA AATATCTGAC CGACAAGGAA ATGGTGAACT GGGAATACGT CATCAATTAT ACCAACGCGA CATTTATCCT GAGCGACGAG TACGGCTTCG AAGACGGCCT TTTTGCCGGC TTTGATCCCA AGACCAAGAG TTACGATAAA TCCAAGTGGA GCTTTGTCCT CGACGAAAAC GGCGTCCCCA AACGGGACAC CAACCTGGCG GATCCCCGGT GTGTCTACAA CCTCATGCGC AAACACTACG AACGCTACAC CCTGGATAAG GTCTCCAAGG CGACCGGGAC GCCCAAGGAA GACCTGCTCA AAGTGTACAA AGCGTACGCG GCGTCCTATA AAGCCGACAA ATCCGCGACG ATCATGTACG CCATGGGCTG GACCCAGCAT ACCGTCGGCG TCCAGAACAT CCGCGCCATG GCCATGATCC AGCTTCTGCT GGGCAATATT GGCGTGGCTG GCGGCGGCGT GAACGCCCTG CGCGGCGAGT CCAATGTGCA GGGGTCCACG GACCATTGCC TCCTGTACCA CATTCTGCCG GGGTATCTGA AGACGCCCAA GGCGTCGCAA CCGACGCTCC AGGCCTATAA TGAAGCCTAC ACTCCGGTCA GCAATGACCC CAAATCCGCC AATTGGTGGC AGCATTATCC GAAGTACTCG GCCAGCTTGA TCAAGGCCAT GTACAAGGAC GCCCCGATTG AAAAAGGGTA CAAATGGCTG CCCAAACTTG ACGACGGCAA AGGGTATTCC TTCCTGGAAC TCTTTGACGC CATGTACAGA GAAGAGATCA AAGGCTTTTT CGCCTGGGGA CAAAACCCCG CCAGCGGTCT GGCCAACTCG AACAAATCCC GTGAAGCGCT GTCCAAATTG GACTGGATGG TCGTGACCAA CATCTTCGAC AATGAAACAG CCTCGTTTTG GAAGGGCCCG AACATGGATC CCAAGTCCGT GGACACCGAG GTCTTCTTCC TGCCGTGCGC TGTGTCTATC GAGAAGGAAG GTTCGATCAC CAACTCTGGA CGCTGGATGC AGTGGCGGTA CGAAGGGCCG AAGCCCCTGC CGAACACCAA GACAGACGGG GACATGATCG TCGAGCTGAC CAAACGGCTC CAAAAGCTCT ACGCCAATGA AGGCGGCACC TACAGTGAGC CGATCGTCAA TCTGAGCACC GAACTGTGGG AAAAGAACGG CAAATACGAT CCACACAAGG TGGCCAAGCT GATCAACGGT TTCTTCCTCA AGGACGTCAC CGTCCGCGGC AAATCCTTCA AGGCCGGGGA TCAGGTCCCG AGTTTCGCCT ATCTCCTGGA AGACGGGACC ACGACCTCGG GCAACTGGCT GTACTGCAAT TCGTACACCA ATGAGGGCAA TATGGCCGCC CGGCGCGACA AATCCCAGAC CAAGATGCAG GCCAATATCG GTCTGTATCC GAATTGGTCC TGGTGTTGGC CGGTCAATCG GCGGATCATC TACAACCGGG CTTCCGTGGA TCTCAAAGGC AAACCGTACG CGCCCGACAA ACCGGTCATC AAATGGACCG GGGACAGCTG GGCCGGCGAT GTTCCCGACG GTGGCTGGCC TCCGGGCGAA AAGCACGCCT TTATCATGCG CAAGCATGGC TTTGGTCAGA TTTTCGGCCC CGGCCGGGCT GATGGACCGT TCCCGGAATA CTACGAACCC TTGGAGTGCC CGCTGGAAGA ACATCCGTTC TCCTCGCAAC TGCACAATCC AACGGCGCTG ACCTTTGAGG GGGCCATGGA CAAACGGCGT TCCTGCGATC CGCGCTATCC GTTTGTCGGC ACGACCTATC GGGTCACCGA ACACTGGCAA AGCGGAGTCA TGACCCGTTG GCAGCCGTGG CTTATCGAAG CCGAACCGGA ACTGTTCGTG GAAATGAGCC CGGAACTGGC CAAGATGCGC GGCATCGAGA ACGGGGAACG AGTCATCGTG GAATCCGCCC GGGGTCAGGT CAAAGCTGTG GCCATGGTTA CCCCGCGGAT GCAGCCCTTT ACGATTATGG GGCAGGTCAT CCACCAGATC GGGCTCCCCT GGCATTACGG TTGGGTCTAC CCCAAAGACA GCGGTGACGC GGCCAATCTG CTCACACCGT CTGTCGGGGA TGCGAATACC GGTATTCCCG AAACCAAGGC CTTCATGGTC AATGTTCGCA AGATTTAA
|
Protein sequence | MTFTRRHFLK LSASAAAVTA FGGLGFSLKP TAAQAQLLKL RWAKETTSIC CYCAVGCGLI VHTSQEGQGR AINVEGDPDH PVSEGSLCAK GAAIFNLGEN EDRITSVLYR APGSEKWQET SWDWALDTIA KRVKETRDAT FTRTNAQGQE VNRCNGLASV GSAAIDNEEC WVYQAMLRSL GLVYIEHQAR IUHSATVAAL AESFGRGAMT NHWIDIKNSD CILIMGSNAA ENHPVSFKWV TKAQEKGAQL IHVDPRYTRT SAKADIYAPL RSGSDIAFLG GLIKYLTDKE MVNWEYVINY TNATFILSDE YGFEDGLFAG FDPKTKSYDK SKWSFVLDEN GVPKRDTNLA DPRCVYNLMR KHYERYTLDK VSKATGTPKE DLLKVYKAYA ASYKADKSAT IMYAMGWTQH TVGVQNIRAM AMIQLLLGNI GVAGGGVNAL RGESNVQGST DHCLLYHILP GYLKTPKASQ PTLQAYNEAY TPVSNDPKSA NWWQHYPKYS ASLIKAMYKD APIEKGYKWL PKLDDGKGYS FLELFDAMYR EEIKGFFAWG QNPASGLANS NKSREALSKL DWMVVTNIFD NETASFWKGP NMDPKSVDTE VFFLPCAVSI EKEGSITNSG RWMQWRYEGP KPLPNTKTDG DMIVELTKRL QKLYANEGGT YSEPIVNLST ELWEKNGKYD PHKVAKLING FFLKDVTVRG KSFKAGDQVP SFAYLLEDGT TTSGNWLYCN SYTNEGNMAA RRDKSQTKMQ ANIGLYPNWS WCWPVNRRII YNRASVDLKG KPYAPDKPVI KWTGDSWAGD VPDGGWPPGE KHAFIMRKHG FGQIFGPGRA DGPFPEYYEP LECPLEEHPF SSQLHNPTAL TFEGAMDKRR SCDPRYPFVG TTYRVTEHWQ SGVMTRWQPW LIEAEPELFV EMSPELAKMR GIENGERVIV ESARGQVKAV AMVTPRMQPF TIMGQVIHQI GLPWHYGWVY PKDSGDAANL LTPSVGDANT GIPETKAFMV NVRKI
|
| |