Gene EcSMS35_4542 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4542 
SymbolfdhF 
ID6146683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4642827 
End bp4644974 
Gene Length2148 bp 
Protein Length715 aa 
Translation table11 
GC content54% 
IMG OID641619358 
Productformate dehydrogenase H, alpha subunit, selenocysteine-containing 
Protein accessionYP_001746470 
Protein GI170683812 
COG category[R] General function prediction only 
COG ID[COG3383] Uncharacterized anaerobic dehydrogenase 
TIGRFAM ID[TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAG TCGTCACGGT TTGCCCCTAT TGCGCATCAG GTTGCAAAAT CAACCTGGTC 
GTCGATAACG GCAAAATCGT CCGGGCGGAG GCAGCGCAGG GGAAAACCAA CCAGGGTACC
CTGTGTCTGA AGGGTTATTA TGGCTGGGAC TTCATTAACG ATACCCAGAT CCTGACCCCG
CGCCTGAAAA CCCCCATGAT CCGTCGCCAG CGTGGCGGCA AACTCGAACC TGTTTCCTGG
GATGAGGCAC TGAATTACGT TGCCGAGCGC CTGAGCGCCA TCAAAGAGAA GTACGGTCCG
GATGCCATCC AGACGACCGG CTCCTCGCGT GGTACGGGTA ACGAAACCAA CTATGTAATG
CAAAAATTTG CGCGCGCCGT TATTGGTACC AATAACGTTG ACTGCTGCGC TCGTGTCTGA
CACGGCCCAT CGGTTGCAGG TCTGCACCAA TCGGTCGGTA ATGGCGCAAT GAGCAATGCT
ATTAACGAAA TTGATAATAC CGATTTAGTG TTCGTTTTCG GGTATAACCC GGCGGATTCC
CACCCAATCG TGGCGAACCA CGTAATTAAC GCTAAACGTA ACGGGGCGAA AATTATCGTC
TGCGATCCGC GCAAAATTGA AACTGCGCGC ATTGCTGACA TGCACATTGC ATTGAAAAAC
GGCTCGAACA TCGCGCTGTT GAATGCGATG GGCCATGTCA TTATTGAAGA AAATCTGTAC
GACAAAGCGT TCGTCGCTTC CCGTACAGAA GGCTTTGAAG AGTATCGTAA AATCGTTGAA
GGCTACACGC CGGAGTCGGT TGAAGATATC ACCGGCGTCA GCGCCAGTGA GATTCGTCAG
GCGGCACGGA TGTATGCCCA GGCGAAAAGC GCCGCCATCC TGTGGGGCAT GGGTGTAACC
CAGTTCTACC AGGGCGTGGA AACCGTGCGT TCTCTGACCA GCCTCGCGAT GCTGACCGGT
AACCTCGGTA AGCCACATGC GGGTGTAAAC CCGGTTCGTG GTCAGAACAA CGTACAGGGT
GCCTGCGATA TGGGCGCGCT GCCGGATACG TATCCGGGAT ACCAGTACGT GAAAGATCCG
GCTAACCGCG AGAAATTCGC CAAAGCCTGG GGCGTGGAAA GCCTGCCTGC TCATACCGGT
TATCGCATCA GCGAGCTGCC GCACCGCGCA GCGCATGGTG AAGTGCGTGC CGCGTACATT
ATGGGCGAAG ATCCGCTACA AACTGACGCG GAGCTGTCGG CAGTACGTAA AGCCTTTGAA
GATCTGGAAC TGGTCATCGT TCAGGACATC TTTATGACCA AAACCGCGTC GGCGGCGGAT
GTCATTTTAC CGTCAACGTC ATGGGGCGAG CATGAAGGCG TGTTTACTGC GGCTGACCGT
GGCTTCCAGC GTTTCTTTAA AGCAGTTGAA CCGAAATGGG ATTTGAAAAC GGACTGGCAA
ATCATCAGTG AAATCGCCAC CCGTATGGGT TATCCGATGC ACTACAACAA CACCCAGGAG
ATCTGGGATG AGTTGCGTCA TCTGTGCCCG GATTTCTACG GTGCGACTTA CGAGAAAATG
GGCGAACTGG GCTTCATTCA GTGGCCTTGT CGCGATACTT CAGATGCCGA TCAGGGGACT
TCTTATCTGT TTAAAGAGAA GTTTGATACT CCGAACGGTC TGGCGCAGTT CTTCACCTGC
GACTGGGTAG CGCCAATCGA CAAACTCACC GACGAGTATC CGATGGTACT GTCAACGGTG
CGTGAAGTCG GTCACTACTC TTGCCGTTCG ATGACCGGTA ACTGTGCGGC ACTGGCGGCG
CTGGCTGATG AACCTGGCTA CGCACAAATC AATACCGAAG ACGCCAAACG TCTGGGGATT
GAAGATGAGG CATTGGTTTG GGTGCACTCG CGTAAAGGCA AAATTATCAC CCGTGCGCAG
GTCAGCGATC GTCCGAACAA AGGGGCGATT TACATGACCT ACCAGTGGTG GATTGGTGCC
TGTAACGAGC TGGTTACCGA AAACTTAAGC CCGATTACGA AAACGCCGGA ATACAAATAC
TGCGCCGTGC GCGTCGAGCC GATCGCCGAT CAGCGCGCCG CCGAGCAGTA TGTGATTGAC
GAGTACAACA AGTTGAAAAC TCGCCTGCGC GAAGCGGCAC TGGCGTAA
 
Protein sequence
MKKVVTVCPY CASGCKINLV VDNGKIVRAE AAQGKTNQGT LCLKGYYGWD FINDTQILTP 
RLKTPMIRRQ RGGKLEPVSW DEALNYVAER LSAIKEKYGP DAIQTTGSSR GTGNETNYVM
QKFARAVIGT NNVDCCARVU HGPSVAGLHQ SVGNGAMSNA INEIDNTDLV FVFGYNPADS
HPIVANHVIN AKRNGAKIIV CDPRKIETAR IADMHIALKN GSNIALLNAM GHVIIEENLY
DKAFVASRTE GFEEYRKIVE GYTPESVEDI TGVSASEIRQ AARMYAQAKS AAILWGMGVT
QFYQGVETVR SLTSLAMLTG NLGKPHAGVN PVRGQNNVQG ACDMGALPDT YPGYQYVKDP
ANREKFAKAW GVESLPAHTG YRISELPHRA AHGEVRAAYI MGEDPLQTDA ELSAVRKAFE
DLELVIVQDI FMTKTASAAD VILPSTSWGE HEGVFTAADR GFQRFFKAVE PKWDLKTDWQ
IISEIATRMG YPMHYNNTQE IWDELRHLCP DFYGATYEKM GELGFIQWPC RDTSDADQGT
SYLFKEKFDT PNGLAQFFTC DWVAPIDKLT DEYPMVLSTV REVGHYSCRS MTGNCAALAA
LADEPGYAQI NTEDAKRLGI EDEALVWVHS RKGKIITRAQ VSDRPNKGAI YMTYQWWIGA
CNELVTENLS PITKTPEYKY CAVRVEPIAD QRAAEQYVID EYNKLKTRLR EAALA