Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4280 |
Symbol | fdoG |
ID | 6144178 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4379453 |
End bp | 4382503 |
Gene Length | 3051 bp |
Protein Length | 1016 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641619101 |
Product | formate dehydrogenase-O, major subunit, selenocysteine-containing |
Protein accession | YP_001746225 |
Protein GI | 170683127 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGTCA GCAGAAGGCA GTTCTTTAAG ATCTGCGCTG GCGGTATGGC AGGCACCACG GCAGCGGCAC TGGGTTTTGC ACCCAGCGTA GCACTCGCGG AAACCCGGCA GTATAAACTG CTGCGCACCC GCGAAACCCG TAATACCTGC ACCTATTGTT CCGTAGGCTG TGGGCTGTTG ATGTACAGCC TCGGTGACGG AGCAAAAAAC GCCAAAGCAT CTATCTTCCA TATCGAAGGT GACCCGGATC ACCCGGTCAA CCGCGGAGCA CTTTGTCCGA AAGGCGCTGG CCTGGTGGAT TTCATCCACT CCGAAAGCCG CCTGAAGTTT CCGGAATACC GTGCGCCAGG TTCTGATAAA TGGCAACAAA TCAGTTGGGA AGAGGCGTTT GATCGCATCG CCAAACTGAT GAAAGAAGAC CGCGATGCGA ACTACATTGC GCAAAACGCC GAAGGCGTGA CTGTTAACCG CTGGCTCTCC ACCGGGATGC TGTGTGCTTC CGCGTCGAGC AACGAAACCG GCTATTTAAC ACAAAAATTC TCCCGCGCGC TGGGTATGCT CGCGGTCGAC AACCAGGCGC GTGTCTGACA CGGACCAACG GTAGCAAGTC TTGCTCCAAC ATTTGGTCGC GGTGCGATGA CCAACCACTG GGTCGACATC AAGAATGCCA ACCTCGTCGT GGTGATGGGC GGTAACGCCG CTGAAGCTCA CCCGGTCGGG TTCCGCTGGG CGATGGAAGC CAAAATTCAC AACGGCGCGA AGCTGATTGT GATCGATCCT CGCTTTACAC GTACGGCGGC GGTGGCTGAC TACTATGCCC CTATTCGTTC CGGGACTGAC ATTGCTTTCC TGTCAGGCGT ATTGCTGTAC CTGCTGAACA ATGAAAAATT CAACCGCGAA TACACCGAAG CCTATACCAA CGCCAGCCTG ATCGTGCGTG AGGATTACGG CTTTGAAGAT GGCCTGTTCA CCGGCTACGA CGCGGAAAAA CGCAAGTACG ATAAATCCAG CTGGACCTAC GAGCTGGACG AAAACGGCTT CGCCAAACGC GATACCACGC TGCAACATCC GCGCTGCGTG TGGAACTTGC TGAAACAGCA CGTTTCCCGT TATACGCCAG ATGTGGTTGA AAACATCTGT GGTACGCCGA AAGACGCGTT CCTGAAAGTC TGCGAATACA TCGCAGAGAC CAGTGCTCAC GATAAAACTG CCTCGTTCCT TTACGCGCTC GGCTGGACGC AACACTCCAT TGGTGCGCAA AACATTCGTA CGATGGCGAT GATCCAGCTG CTGCTCGGCA ACATGGGGAT GGCAGGCGGC GGCGTTAACG CCCTGCGCGG TCACTCCAAT ATTCAGGGGC TGACCGACCT GGGGCTGCTG TCGCAGAGTC TGCCAGGTTA CATGACGTTG CCAAGCGAGA AGCAGACCGA TCTGCAAACT TACCTTACCG CCAACACGCC GAAACCACTG CTGGAAGGCC AGGTTAACTA CTGGGGCAAT TACCCGAAAT TCTTCGTCTC CATGATGAAG GCCTTCTTTG GTGATAAAGC GACGGCGGAA AATAACTGGG GCTTTGACTG GTTGCCGAAG TGGGATAAAG GCTACGACGT CCTGCAATAC TTCGAGATGA TGAAAGAGGG CAAGGTCAAT GGCTATATCT GCCAGGGCTT TAACCCTGTT GCCTCATTCC CGAACAAAAA CAAAGTGATC GGCTGTCTGT CGAAACTGAA GTTCCTCGTG ACCATCGACC CGCTGAACAC TGAAACCTCT AACTTCTGGC AGAACCACGG TGAACTGAAC GAAGTTGACT CGTCGAAGAT CCAGACCGAA GTGTTCCGTC TGCCATCGAC CTGCTTCGCG GAAGAGAACG GTTCAATCGT CAACTCTGGC CGCTGGTTGC AGTGGCACTG GAAAGGTGCG GACGCCCCGG GGATTGCGCT GACTGACGGC GAGATCCTCT CCGGTATCTT CCTGCGCTTG CGCAAGATGT ATGCCGAACA GGGTGGCGCG AACCCGGACC AGGTGCTGAA CATGACCTGG AACTACGCCA TTCCGCATGA GCCATCTTCA GAAGAAGTGG CGATGGAGAG TAACGGTAAG GCGCTGGCCG ATATTACCGA TCCGGCAACC GGTGCGGTTA TCGTCAAGAA AGGCCAACAA CTTAGCTCGT TCGCCCAACT GCGCGATGAC GGTACAACCT CCTGTGGCTG CTGGATTTTC GCCGGTAGCT GGACGCCGGA AGGCAACCAG ATGGCGCGTC GTGATAACGC CGATCCGTCT GGCCTCGGTA ACACGCTGGG CTGGGCATGG GCATGGCCGC TTAACCGCCG CATTCTGTAT AACCGCGCCT CCGCAGATCC GCAGGGTAAC CCATGGGATC CGAAGCGTCA GTTGCTGAAA TGGGACGGCA CTAAGTGGAC CGGCTGGGAT ATTCCGGATT ACAGCGCAGC GCCTCCGGGC AGTGGCGTCG GGCCGTTTAT CATGCAGCAG GAAGGCATGG GGCGTCTGTT TGCCCTCGAT AAGATGGCGG AAGGTCCGTT CCCGGAACAC TACGAGCCGT TTGAAACGCC GCTGGGAACT AACCCGCTGC ATCCAAACGT TATCTCGAAC CCGGCTGCGC GAATCTTTAA AGACGACGCC GAAGCATTGG GTAAAGCCGA TAAGTTCCCG TATGTCGGAA CCACCTATCG TCTGACCGAG CACTTCCACT ACTGGACCAA ACACGCGCTG TTGAACGCGA TTTTGCAACC AGAGCAGTTT GTGGAAATCG GGGAGTCGCT GGCGAATAAA CTTGGCATTG CCCAGGGCGA TACTGTGAAA GTCTCCTCCA ACCGTGGCTA TATCAAAGCC AAAGCGGTAG TGACCAAACG TATTCGCACG CTGAAAGCAA ACGGCAAAGA TATCGATACC ATCGGTATTC CGATTCACTG GGGCTATGAA GGTGTTGCGA AAAAAGGCTT TATTGCCAAT ACGTTGACGC CATTCGTCGG TGATGCGAAC ACGCAGACGC CGGAGTTTAA GTCCTTCCTT GTGAATGTGG AAAAGGTGTA A
|
Protein sequence | MQVSRRQFFK ICAGGMAGTT AAALGFAPSV ALAETRQYKL LRTRETRNTC TYCSVGCGLL MYSLGDGAKN AKASIFHIEG DPDHPVNRGA LCPKGAGLVD FIHSESRLKF PEYRAPGSDK WQQISWEEAF DRIAKLMKED RDANYIAQNA EGVTVNRWLS TGMLCASASS NETGYLTQKF SRALGMLAVD NQARVUHGPT VASLAPTFGR GAMTNHWVDI KNANLVVVMG GNAAEAHPVG FRWAMEAKIH NGAKLIVIDP RFTRTAAVAD YYAPIRSGTD IAFLSGVLLY LLNNEKFNRE YTEAYTNASL IVREDYGFED GLFTGYDAEK RKYDKSSWTY ELDENGFAKR DTTLQHPRCV WNLLKQHVSR YTPDVVENIC GTPKDAFLKV CEYIAETSAH DKTASFLYAL GWTQHSIGAQ NIRTMAMIQL LLGNMGMAGG GVNALRGHSN IQGLTDLGLL SQSLPGYMTL PSEKQTDLQT YLTANTPKPL LEGQVNYWGN YPKFFVSMMK AFFGDKATAE NNWGFDWLPK WDKGYDVLQY FEMMKEGKVN GYICQGFNPV ASFPNKNKVI GCLSKLKFLV TIDPLNTETS NFWQNHGELN EVDSSKIQTE VFRLPSTCFA EENGSIVNSG RWLQWHWKGA DAPGIALTDG EILSGIFLRL RKMYAEQGGA NPDQVLNMTW NYAIPHEPSS EEVAMESNGK ALADITDPAT GAVIVKKGQQ LSSFAQLRDD GTTSCGCWIF AGSWTPEGNQ MARRDNADPS GLGNTLGWAW AWPLNRRILY NRASADPQGN PWDPKRQLLK WDGTKWTGWD IPDYSAAPPG SGVGPFIMQQ EGMGRLFALD KMAEGPFPEH YEPFETPLGT NPLHPNVISN PAARIFKDDA EALGKADKFP YVGTTYRLTE HFHYWTKHAL LNAILQPEQF VEIGESLANK LGIAQGDTVK VSSNRGYIKA KAVVTKRIRT LKANGKDIDT IGIPIHWGYE GVAKKGFIAN TLTPFVGDAN TQTPEFKSFL VNVEKV
|
| |