Gene EcSMS35_1700 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1700 
SymbolfdnG 
ID6146074 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1701344 
End bp1704391 
Gene Length3048 bp 
Protein Length1015 aa 
Translation table11 
GC content54% 
IMG OID641616576 
Productformate dehydrogenase, nitrate inducible, alpha subunit, selenocysteine-containing 
Protein accessionYP_001743754 
Protein GI170684244 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.3444 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGTCA GTCGCAGACA ATTTTTTAAA ATCTGCGCGG GCGGTATGGC TGGAACAACA 
GTAGCGGCAT TGGGCTTTGC CCCAAAGCAA GCACTGGCTC AGGCGCGAAA CTACAAATTA
TTACGCGCTA AAGAGATCCG TAACACCTGC ACATACTGTT CCGTAGGTTG CGGGCTATTG
ATGTATAGCC TGGGTGATGG CGCGAAAAAC GCCAGAGAAG CGATTTATCA CATTGAAGGT
GACCCGGATC ATCCGGTAAG CCGTGGTGCG CTGTGCCCGA AAGGGGCCGG TTTGCTGGAT
TACGTCAACA GCGAAAACCG TCTGCGCTAC CCGGAATATC GTGCGCCAGG TTCTGATAAA
TGGCAGCGTA TTAGCTGGGA TGAAGCATTC TCCCGTATTG CGAAGCTGAT GAAAGCTGAC
CGTGACGCTA ACTTTATTGA AAAGAACGAG CAGGGCGTAA CGGTAAACCG TTGGCTTTCT
ACCGGTATGC TGTGTGCCTC CGGTGCCAGC AACGAAACCG GGATGCTGAC TCAAAAATTT
GCCCGCTCCC TCGGGATGCT GGCGGTAGAC AACCAGGCGC GCGTCTGACA CGGACCAACG
GTAGCAAGTC TTGCTCCAAC ATTTGGTCGC GGTGCGATGA CCAACCACTG GGTGGATATC
AAAAACGCTA ACGTCGTGAT GGTGATGGGC GGTAACGCTG CTGAAGCACA TCCCGTCGGT
TTCCGCTGGG CGATGGAAGC GAAAAACAAC AACGATGCAA CCTTGATCGT TGTCGATCCC
CGTTTTACGC GTACCGCTTC TGTGGCGGAT ATTTACGCAC CTATTCGTTC CGGTACGGAC
ATTACGTTCC TGTCTGGCGT TTTGCGCTAC CTGATCGAAA ACAACAAAAT CAACGCCGAA
TATGTTAAGC ATTACACCAA CGCCAGCCTG CTGGTGCGTG ATGATTTTGC TTTCGAAGAC
GGTCTGTTCA GCGGCTACGA CGCTGAAAAA CGGCAATACG ATAAATCGTC CTGGAACTAT
CAGTTCGATG AAAACGGCTA TGCGAAACGC GATGAAACAC TGACTCATCC GCGCTGTGTG
TGGAACCTGC TGAAAGCGCA CGTTTCCCGC TACACGCCGG ACGTAGTAGA AAACATTTGC
GGTACGCCAA AAGCCGACTT CCTGAAAGTG TGTGAAGTGC TGGCCTCCAC CAGCGCACCG
GATCGCACAA CCACCTTCCT GTACGCGCTG GGCTGGACGC AGCACACCGT GGGTGCGCAG
AACATCCGTA CGATGGCGAT GATCCAGTTG CTGCTCGGTA ACATGGGTAT GGCCGGTGGC
GGCGTGAACG CATTGCGTGG TCACTCCAAC ATTCAGGGCC TGACTGACTT AGGCCTGCTC
TCTACCAGCC TGCCAGGTTA TCTGACGCTA CCGTCAGAAA AACAGGTTGA TTTGCAGTCG
TATCTGGAAG CGAATACGCC GAAAGCGACG CTGGCTGATC AGGTGAACTA CTGGAGCAAC
TATCCGAAGT TCTTCGTTAG CCTGATGAAA TCTTTCTACG GCGATGCCGC GCAGAAAGAG
AACAACTGGG GCTACGACTG GCTGCCGAAG TGGGACCAGA CCTACGACGT CATCAAGTAT
TTCAACATGA TGGACGAAGG CAAAGTCACC GGTTATTTCT GCCAGGGCTT TAACCCGGTT
GCGTCCTTCC CGGACAAAAA CAAAGTGGTG AGCTGCCTGA GCAAGCTGAA GTACATGGTG
GTAATCGATC CGCTGGTGAC TGAAACTTCT ACCTTCTGGC AGAACCACGG CGAGTCGAAC
GATGTCGATC CGGCGTCTAT TCAGACTGAA GTATTCCGTC TGCCTTCGAC CTGCTTTGCT
GAAGAAGATG GTTCTATCGC TAACTCCGGT CGCTGGTTGC AGTGGCACTG GAAAGGTCAG
GACGCGCCGG GCGAAGCGCG TAACGACGGC GAAATTCTGG CGGGTATCTA CCATCATCTG
CGCGAGCTGT ACCAGGCCGA AGGTGGTAAA GGCGTAGAAC CGCTGATGAA GATGAGCTGG
AACTACAAGC AGCCGCACGA ACCGCAATCT GACGAAGTAG CTAAAGAGAA CAACGGCTAC
GCGCTGGAAG ATCTCTATGA TGCCAATGGC GTGCTGATTG CGAAGAAAGG TCAGTTGCTG
AGTAGCTTTG CGCATCTGCG TGATGACGGT ACAACCGCAT CGTCTTGCTG GATCTACACC
GGTAGCTGGA CAGAGCAGGG CAACCAGATG GCTAACCGCG ATAACTCCGA CCCATCTGGT
CTGGGGAATA CGCTGGGATG GGCCTGGGCG TGGCCGCTCA ACCGTCGCGT ACTCTACAAC
CGTGCTTCGG CGGATATCAA CGGTAAACCG TGGGATCCGA AACGGATGCT GATCCAGTGG
AACGGCAGCA AGTGGACGGG TAACGATATT CCGGACTTCG GCAATGCCGC GCCAGGTACA
CCAACCGGAC CGTTTATCAT GCAGCCGGAA GGGATGGGAC GCCTGTTTGC CATCAACAAA
ATGGCGGAAG GTCCGTTCCC GGAACACTAC GAGCCAATCG AAACGCCGCT GGGCACTAAC
CCGCTGCATC CGAACGTGGT GTCTAACCCG GTCGTTCGTC TGTATGAACA AGACGCGCTG
CGGATGGGTA AAAAAGAGCA GTTCCCGTAT GTGGGTACGA CCTATCGTCT GACCGAGCAC
TTCCACACAT GGACCAAGCA CGCGTTGCTC AACGCAATTG CTCAGCCGGA ACAGTTTGTG
GAAATCAGCG AAACGCTGGC GGCGGCGAAA GGCATTAATA ATGGCGATCG TGTCACTGTC
TCAAGCAAGC GTGGCTTTAT CCGCGCGGTG GCGGTGGTAA CGCGCCGTCT GAAACCGCTG
AATGTAAACG GTCAGCAGGT TGAAACGGTG GGTATTCCGA TCCACTGGGG CTTTGAGGGT
GTCGCGCGTA AAGGTTATAT CGCTAACACT CTGACGCCGA ATGTCGGTGA TGCAAACTCG
CAAACGCCGG AATATAAAGC GTTCTTAGTC AACATCGAGA AGGCGTAA
 
Protein sequence
MDVSRRQFFK ICAGGMAGTT VAALGFAPKQ ALAQARNYKL LRAKEIRNTC TYCSVGCGLL 
MYSLGDGAKN AREAIYHIEG DPDHPVSRGA LCPKGAGLLD YVNSENRLRY PEYRAPGSDK
WQRISWDEAF SRIAKLMKAD RDANFIEKNE QGVTVNRWLS TGMLCASGAS NETGMLTQKF
ARSLGMLAVD NQARVUHGPT VASLAPTFGR GAMTNHWVDI KNANVVMVMG GNAAEAHPVG
FRWAMEAKNN NDATLIVVDP RFTRTASVAD IYAPIRSGTD ITFLSGVLRY LIENNKINAE
YVKHYTNASL LVRDDFAFED GLFSGYDAEK RQYDKSSWNY QFDENGYAKR DETLTHPRCV
WNLLKAHVSR YTPDVVENIC GTPKADFLKV CEVLASTSAP DRTTTFLYAL GWTQHTVGAQ
NIRTMAMIQL LLGNMGMAGG GVNALRGHSN IQGLTDLGLL STSLPGYLTL PSEKQVDLQS
YLEANTPKAT LADQVNYWSN YPKFFVSLMK SFYGDAAQKE NNWGYDWLPK WDQTYDVIKY
FNMMDEGKVT GYFCQGFNPV ASFPDKNKVV SCLSKLKYMV VIDPLVTETS TFWQNHGESN
DVDPASIQTE VFRLPSTCFA EEDGSIANSG RWLQWHWKGQ DAPGEARNDG EILAGIYHHL
RELYQAEGGK GVEPLMKMSW NYKQPHEPQS DEVAKENNGY ALEDLYDANG VLIAKKGQLL
SSFAHLRDDG TTASSCWIYT GSWTEQGNQM ANRDNSDPSG LGNTLGWAWA WPLNRRVLYN
RASADINGKP WDPKRMLIQW NGSKWTGNDI PDFGNAAPGT PTGPFIMQPE GMGRLFAINK
MAEGPFPEHY EPIETPLGTN PLHPNVVSNP VVRLYEQDAL RMGKKEQFPY VGTTYRLTEH
FHTWTKHALL NAIAQPEQFV EISETLAAAK GINNGDRVTV SSKRGFIRAV AVVTRRLKPL
NVNGQQVETV GIPIHWGFEG VARKGYIANT LTPNVGDANS QTPEYKAFLV NIEKA