Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1700 |
Symbol | fdnG |
ID | 6146074 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1701344 |
End bp | 1704391 |
Gene Length | 3048 bp |
Protein Length | 1015 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641616576 |
Product | formate dehydrogenase, nitrate inducible, alpha subunit, selenocysteine-containing |
Protein accession | YP_001743754 |
Protein GI | 170684244 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.3444 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGTCA GTCGCAGACA ATTTTTTAAA ATCTGCGCGG GCGGTATGGC TGGAACAACA GTAGCGGCAT TGGGCTTTGC CCCAAAGCAA GCACTGGCTC AGGCGCGAAA CTACAAATTA TTACGCGCTA AAGAGATCCG TAACACCTGC ACATACTGTT CCGTAGGTTG CGGGCTATTG ATGTATAGCC TGGGTGATGG CGCGAAAAAC GCCAGAGAAG CGATTTATCA CATTGAAGGT GACCCGGATC ATCCGGTAAG CCGTGGTGCG CTGTGCCCGA AAGGGGCCGG TTTGCTGGAT TACGTCAACA GCGAAAACCG TCTGCGCTAC CCGGAATATC GTGCGCCAGG TTCTGATAAA TGGCAGCGTA TTAGCTGGGA TGAAGCATTC TCCCGTATTG CGAAGCTGAT GAAAGCTGAC CGTGACGCTA ACTTTATTGA AAAGAACGAG CAGGGCGTAA CGGTAAACCG TTGGCTTTCT ACCGGTATGC TGTGTGCCTC CGGTGCCAGC AACGAAACCG GGATGCTGAC TCAAAAATTT GCCCGCTCCC TCGGGATGCT GGCGGTAGAC AACCAGGCGC GCGTCTGACA CGGACCAACG GTAGCAAGTC TTGCTCCAAC ATTTGGTCGC GGTGCGATGA CCAACCACTG GGTGGATATC AAAAACGCTA ACGTCGTGAT GGTGATGGGC GGTAACGCTG CTGAAGCACA TCCCGTCGGT TTCCGCTGGG CGATGGAAGC GAAAAACAAC AACGATGCAA CCTTGATCGT TGTCGATCCC CGTTTTACGC GTACCGCTTC TGTGGCGGAT ATTTACGCAC CTATTCGTTC CGGTACGGAC ATTACGTTCC TGTCTGGCGT TTTGCGCTAC CTGATCGAAA ACAACAAAAT CAACGCCGAA TATGTTAAGC ATTACACCAA CGCCAGCCTG CTGGTGCGTG ATGATTTTGC TTTCGAAGAC GGTCTGTTCA GCGGCTACGA CGCTGAAAAA CGGCAATACG ATAAATCGTC CTGGAACTAT CAGTTCGATG AAAACGGCTA TGCGAAACGC GATGAAACAC TGACTCATCC GCGCTGTGTG TGGAACCTGC TGAAAGCGCA CGTTTCCCGC TACACGCCGG ACGTAGTAGA AAACATTTGC GGTACGCCAA AAGCCGACTT CCTGAAAGTG TGTGAAGTGC TGGCCTCCAC CAGCGCACCG GATCGCACAA CCACCTTCCT GTACGCGCTG GGCTGGACGC AGCACACCGT GGGTGCGCAG AACATCCGTA CGATGGCGAT GATCCAGTTG CTGCTCGGTA ACATGGGTAT GGCCGGTGGC GGCGTGAACG CATTGCGTGG TCACTCCAAC ATTCAGGGCC TGACTGACTT AGGCCTGCTC TCTACCAGCC TGCCAGGTTA TCTGACGCTA CCGTCAGAAA AACAGGTTGA TTTGCAGTCG TATCTGGAAG CGAATACGCC GAAAGCGACG CTGGCTGATC AGGTGAACTA CTGGAGCAAC TATCCGAAGT TCTTCGTTAG CCTGATGAAA TCTTTCTACG GCGATGCCGC GCAGAAAGAG AACAACTGGG GCTACGACTG GCTGCCGAAG TGGGACCAGA CCTACGACGT CATCAAGTAT TTCAACATGA TGGACGAAGG CAAAGTCACC GGTTATTTCT GCCAGGGCTT TAACCCGGTT GCGTCCTTCC CGGACAAAAA CAAAGTGGTG AGCTGCCTGA GCAAGCTGAA GTACATGGTG GTAATCGATC CGCTGGTGAC TGAAACTTCT ACCTTCTGGC AGAACCACGG CGAGTCGAAC GATGTCGATC CGGCGTCTAT TCAGACTGAA GTATTCCGTC TGCCTTCGAC CTGCTTTGCT GAAGAAGATG GTTCTATCGC TAACTCCGGT CGCTGGTTGC AGTGGCACTG GAAAGGTCAG GACGCGCCGG GCGAAGCGCG TAACGACGGC GAAATTCTGG CGGGTATCTA CCATCATCTG CGCGAGCTGT ACCAGGCCGA AGGTGGTAAA GGCGTAGAAC CGCTGATGAA GATGAGCTGG AACTACAAGC AGCCGCACGA ACCGCAATCT GACGAAGTAG CTAAAGAGAA CAACGGCTAC GCGCTGGAAG ATCTCTATGA TGCCAATGGC GTGCTGATTG CGAAGAAAGG TCAGTTGCTG AGTAGCTTTG CGCATCTGCG TGATGACGGT ACAACCGCAT CGTCTTGCTG GATCTACACC GGTAGCTGGA CAGAGCAGGG CAACCAGATG GCTAACCGCG ATAACTCCGA CCCATCTGGT CTGGGGAATA CGCTGGGATG GGCCTGGGCG TGGCCGCTCA ACCGTCGCGT ACTCTACAAC CGTGCTTCGG CGGATATCAA CGGTAAACCG TGGGATCCGA AACGGATGCT GATCCAGTGG AACGGCAGCA AGTGGACGGG TAACGATATT CCGGACTTCG GCAATGCCGC GCCAGGTACA CCAACCGGAC CGTTTATCAT GCAGCCGGAA GGGATGGGAC GCCTGTTTGC CATCAACAAA ATGGCGGAAG GTCCGTTCCC GGAACACTAC GAGCCAATCG AAACGCCGCT GGGCACTAAC CCGCTGCATC CGAACGTGGT GTCTAACCCG GTCGTTCGTC TGTATGAACA AGACGCGCTG CGGATGGGTA AAAAAGAGCA GTTCCCGTAT GTGGGTACGA CCTATCGTCT GACCGAGCAC TTCCACACAT GGACCAAGCA CGCGTTGCTC AACGCAATTG CTCAGCCGGA ACAGTTTGTG GAAATCAGCG AAACGCTGGC GGCGGCGAAA GGCATTAATA ATGGCGATCG TGTCACTGTC TCAAGCAAGC GTGGCTTTAT CCGCGCGGTG GCGGTGGTAA CGCGCCGTCT GAAACCGCTG AATGTAAACG GTCAGCAGGT TGAAACGGTG GGTATTCCGA TCCACTGGGG CTTTGAGGGT GTCGCGCGTA AAGGTTATAT CGCTAACACT CTGACGCCGA ATGTCGGTGA TGCAAACTCG CAAACGCCGG AATATAAAGC GTTCTTAGTC AACATCGAGA AGGCGTAA
|
Protein sequence | MDVSRRQFFK ICAGGMAGTT VAALGFAPKQ ALAQARNYKL LRAKEIRNTC TYCSVGCGLL MYSLGDGAKN AREAIYHIEG DPDHPVSRGA LCPKGAGLLD YVNSENRLRY PEYRAPGSDK WQRISWDEAF SRIAKLMKAD RDANFIEKNE QGVTVNRWLS TGMLCASGAS NETGMLTQKF ARSLGMLAVD NQARVUHGPT VASLAPTFGR GAMTNHWVDI KNANVVMVMG GNAAEAHPVG FRWAMEAKNN NDATLIVVDP RFTRTASVAD IYAPIRSGTD ITFLSGVLRY LIENNKINAE YVKHYTNASL LVRDDFAFED GLFSGYDAEK RQYDKSSWNY QFDENGYAKR DETLTHPRCV WNLLKAHVSR YTPDVVENIC GTPKADFLKV CEVLASTSAP DRTTTFLYAL GWTQHTVGAQ NIRTMAMIQL LLGNMGMAGG GVNALRGHSN IQGLTDLGLL STSLPGYLTL PSEKQVDLQS YLEANTPKAT LADQVNYWSN YPKFFVSLMK SFYGDAAQKE NNWGYDWLPK WDQTYDVIKY FNMMDEGKVT GYFCQGFNPV ASFPDKNKVV SCLSKLKYMV VIDPLVTETS TFWQNHGESN DVDPASIQTE VFRLPSTCFA EEDGSIANSG RWLQWHWKGQ DAPGEARNDG EILAGIYHHL RELYQAEGGK GVEPLMKMSW NYKQPHEPQS DEVAKENNGY ALEDLYDANG VLIAKKGQLL SSFAHLRDDG TTASSCWIYT GSWTEQGNQM ANRDNSDPSG LGNTLGWAWA WPLNRRVLYN RASADINGKP WDPKRMLIQW NGSKWTGNDI PDFGNAAPGT PTGPFIMQPE GMGRLFAINK MAEGPFPEHY EPIETPLGTN PLHPNVVSNP VVRLYEQDAL RMGKKEQFPY VGTTYRLTEH FHTWTKHALL NAIAQPEQFV EISETLAAAK GINNGDRVTV SSKRGFIRAV AVVTRRLKPL NVNGQQVETV GIPIHWGFEG VARKGYIANT LTPNVGDANS QTPEYKAFLV NIEKA
|
| |