Gene EcSMS35_4280 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4280 
SymbolfdoG 
ID6144178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4379453 
End bp4382503 
Gene Length3051 bp 
Protein Length1016 aa 
Translation table11 
GC content55% 
IMG OID641619101 
Productformate dehydrogenase-O, major subunit, selenocysteine-containing 
Protein accessionYP_001746225 
Protein GI170683127 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGTCA GCAGAAGGCA GTTCTTTAAG ATCTGCGCTG GCGGTATGGC AGGCACCACG 
GCAGCGGCAC TGGGTTTTGC ACCCAGCGTA GCACTCGCGG AAACCCGGCA GTATAAACTG
CTGCGCACCC GCGAAACCCG TAATACCTGC ACCTATTGTT CCGTAGGCTG TGGGCTGTTG
ATGTACAGCC TCGGTGACGG AGCAAAAAAC GCCAAAGCAT CTATCTTCCA TATCGAAGGT
GACCCGGATC ACCCGGTCAA CCGCGGAGCA CTTTGTCCGA AAGGCGCTGG CCTGGTGGAT
TTCATCCACT CCGAAAGCCG CCTGAAGTTT CCGGAATACC GTGCGCCAGG TTCTGATAAA
TGGCAACAAA TCAGTTGGGA AGAGGCGTTT GATCGCATCG CCAAACTGAT GAAAGAAGAC
CGCGATGCGA ACTACATTGC GCAAAACGCC GAAGGCGTGA CTGTTAACCG CTGGCTCTCC
ACCGGGATGC TGTGTGCTTC CGCGTCGAGC AACGAAACCG GCTATTTAAC ACAAAAATTC
TCCCGCGCGC TGGGTATGCT CGCGGTCGAC AACCAGGCGC GTGTCTGACA CGGACCAACG
GTAGCAAGTC TTGCTCCAAC ATTTGGTCGC GGTGCGATGA CCAACCACTG GGTCGACATC
AAGAATGCCA ACCTCGTCGT GGTGATGGGC GGTAACGCCG CTGAAGCTCA CCCGGTCGGG
TTCCGCTGGG CGATGGAAGC CAAAATTCAC AACGGCGCGA AGCTGATTGT GATCGATCCT
CGCTTTACAC GTACGGCGGC GGTGGCTGAC TACTATGCCC CTATTCGTTC CGGGACTGAC
ATTGCTTTCC TGTCAGGCGT ATTGCTGTAC CTGCTGAACA ATGAAAAATT CAACCGCGAA
TACACCGAAG CCTATACCAA CGCCAGCCTG ATCGTGCGTG AGGATTACGG CTTTGAAGAT
GGCCTGTTCA CCGGCTACGA CGCGGAAAAA CGCAAGTACG ATAAATCCAG CTGGACCTAC
GAGCTGGACG AAAACGGCTT CGCCAAACGC GATACCACGC TGCAACATCC GCGCTGCGTG
TGGAACTTGC TGAAACAGCA CGTTTCCCGT TATACGCCAG ATGTGGTTGA AAACATCTGT
GGTACGCCGA AAGACGCGTT CCTGAAAGTC TGCGAATACA TCGCAGAGAC CAGTGCTCAC
GATAAAACTG CCTCGTTCCT TTACGCGCTC GGCTGGACGC AACACTCCAT TGGTGCGCAA
AACATTCGTA CGATGGCGAT GATCCAGCTG CTGCTCGGCA ACATGGGGAT GGCAGGCGGC
GGCGTTAACG CCCTGCGCGG TCACTCCAAT ATTCAGGGGC TGACCGACCT GGGGCTGCTG
TCGCAGAGTC TGCCAGGTTA CATGACGTTG CCAAGCGAGA AGCAGACCGA TCTGCAAACT
TACCTTACCG CCAACACGCC GAAACCACTG CTGGAAGGCC AGGTTAACTA CTGGGGCAAT
TACCCGAAAT TCTTCGTCTC CATGATGAAG GCCTTCTTTG GTGATAAAGC GACGGCGGAA
AATAACTGGG GCTTTGACTG GTTGCCGAAG TGGGATAAAG GCTACGACGT CCTGCAATAC
TTCGAGATGA TGAAAGAGGG CAAGGTCAAT GGCTATATCT GCCAGGGCTT TAACCCTGTT
GCCTCATTCC CGAACAAAAA CAAAGTGATC GGCTGTCTGT CGAAACTGAA GTTCCTCGTG
ACCATCGACC CGCTGAACAC TGAAACCTCT AACTTCTGGC AGAACCACGG TGAACTGAAC
GAAGTTGACT CGTCGAAGAT CCAGACCGAA GTGTTCCGTC TGCCATCGAC CTGCTTCGCG
GAAGAGAACG GTTCAATCGT CAACTCTGGC CGCTGGTTGC AGTGGCACTG GAAAGGTGCG
GACGCCCCGG GGATTGCGCT GACTGACGGC GAGATCCTCT CCGGTATCTT CCTGCGCTTG
CGCAAGATGT ATGCCGAACA GGGTGGCGCG AACCCGGACC AGGTGCTGAA CATGACCTGG
AACTACGCCA TTCCGCATGA GCCATCTTCA GAAGAAGTGG CGATGGAGAG TAACGGTAAG
GCGCTGGCCG ATATTACCGA TCCGGCAACC GGTGCGGTTA TCGTCAAGAA AGGCCAACAA
CTTAGCTCGT TCGCCCAACT GCGCGATGAC GGTACAACCT CCTGTGGCTG CTGGATTTTC
GCCGGTAGCT GGACGCCGGA AGGCAACCAG ATGGCGCGTC GTGATAACGC CGATCCGTCT
GGCCTCGGTA ACACGCTGGG CTGGGCATGG GCATGGCCGC TTAACCGCCG CATTCTGTAT
AACCGCGCCT CCGCAGATCC GCAGGGTAAC CCATGGGATC CGAAGCGTCA GTTGCTGAAA
TGGGACGGCA CTAAGTGGAC CGGCTGGGAT ATTCCGGATT ACAGCGCAGC GCCTCCGGGC
AGTGGCGTCG GGCCGTTTAT CATGCAGCAG GAAGGCATGG GGCGTCTGTT TGCCCTCGAT
AAGATGGCGG AAGGTCCGTT CCCGGAACAC TACGAGCCGT TTGAAACGCC GCTGGGAACT
AACCCGCTGC ATCCAAACGT TATCTCGAAC CCGGCTGCGC GAATCTTTAA AGACGACGCC
GAAGCATTGG GTAAAGCCGA TAAGTTCCCG TATGTCGGAA CCACCTATCG TCTGACCGAG
CACTTCCACT ACTGGACCAA ACACGCGCTG TTGAACGCGA TTTTGCAACC AGAGCAGTTT
GTGGAAATCG GGGAGTCGCT GGCGAATAAA CTTGGCATTG CCCAGGGCGA TACTGTGAAA
GTCTCCTCCA ACCGTGGCTA TATCAAAGCC AAAGCGGTAG TGACCAAACG TATTCGCACG
CTGAAAGCAA ACGGCAAAGA TATCGATACC ATCGGTATTC CGATTCACTG GGGCTATGAA
GGTGTTGCGA AAAAAGGCTT TATTGCCAAT ACGTTGACGC CATTCGTCGG TGATGCGAAC
ACGCAGACGC CGGAGTTTAA GTCCTTCCTT GTGAATGTGG AAAAGGTGTA A
 
Protein sequence
MQVSRRQFFK ICAGGMAGTT AAALGFAPSV ALAETRQYKL LRTRETRNTC TYCSVGCGLL 
MYSLGDGAKN AKASIFHIEG DPDHPVNRGA LCPKGAGLVD FIHSESRLKF PEYRAPGSDK
WQQISWEEAF DRIAKLMKED RDANYIAQNA EGVTVNRWLS TGMLCASASS NETGYLTQKF
SRALGMLAVD NQARVUHGPT VASLAPTFGR GAMTNHWVDI KNANLVVVMG GNAAEAHPVG
FRWAMEAKIH NGAKLIVIDP RFTRTAAVAD YYAPIRSGTD IAFLSGVLLY LLNNEKFNRE
YTEAYTNASL IVREDYGFED GLFTGYDAEK RKYDKSSWTY ELDENGFAKR DTTLQHPRCV
WNLLKQHVSR YTPDVVENIC GTPKDAFLKV CEYIAETSAH DKTASFLYAL GWTQHSIGAQ
NIRTMAMIQL LLGNMGMAGG GVNALRGHSN IQGLTDLGLL SQSLPGYMTL PSEKQTDLQT
YLTANTPKPL LEGQVNYWGN YPKFFVSMMK AFFGDKATAE NNWGFDWLPK WDKGYDVLQY
FEMMKEGKVN GYICQGFNPV ASFPNKNKVI GCLSKLKFLV TIDPLNTETS NFWQNHGELN
EVDSSKIQTE VFRLPSTCFA EENGSIVNSG RWLQWHWKGA DAPGIALTDG EILSGIFLRL
RKMYAEQGGA NPDQVLNMTW NYAIPHEPSS EEVAMESNGK ALADITDPAT GAVIVKKGQQ
LSSFAQLRDD GTTSCGCWIF AGSWTPEGNQ MARRDNADPS GLGNTLGWAW AWPLNRRILY
NRASADPQGN PWDPKRQLLK WDGTKWTGWD IPDYSAAPPG SGVGPFIMQQ EGMGRLFALD
KMAEGPFPEH YEPFETPLGT NPLHPNVISN PAARIFKDDA EALGKADKFP YVGTTYRLTE
HFHYWTKHAL LNAILQPEQF VEIGESLANK LGIAQGDTVK VSSNRGYIKA KAVVTKRIRT
LKANGKDIDT IGIPIHWGYE GVAKKGFIAN TLTPFVGDAN TQTPEFKSFL VNVEKV