Gene EcHS_A1557 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1557 
SymbolfdnG 
ID5594998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1564213 
End bp1567260 
Gene Length3048 bp 
Protein Length1015 aa 
Translation table11 
GC content54% 
IMG OID640920711 
Productformate dehydrogenase, nitrate inducible, alpha subunit, selenocysteine-containing 
Protein accessionYP_001458267 
Protein GI157160949 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.468579 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGTCA GTCGCAGACA ATTTTTTAAA ATCTGCGCGG GCGGTATGGC TGGAACAACA 
GTAGCGGCAT TGGGCTTTGC CCCGAAGCAA GCACTGGCTC AGGCGCGAAA CTACAAATTA
TTACGCGCTA AAGAGATCCG TAACACCTGC ACATACTGTT CCGTAGGTTG CGGGCTATTG
ATGTATAGCC TGGGTGATGG CGCGAAAAAC GCCAGAGAAG CGATTTATCA CATTGAAGGT
GATCCGGATC ATCCGGTAAG CCGTGGTGCG CTGTGCCCGA AAGGGGCCGG TTTGCTGGAT
TACGTCAACA GCGAAAACCG TCTGCGCTAC CCGGAATATC GTGCGCCAGG TTCTGACAAA
TGGCAGCGCA TTAGCTGGGA AGAAGCATTC TCCCGTATTG CGAAGCTGAT GAAAGCTGAC
CGTGACGCTA ACTTTATTGA AAAGAACGAG CAGGGCGTAA CGGTAAACCG TTGGCTTTCT
ACCGGTATGC TGTGTGCTTC CGGTGCCAGC AACGAAACCG GGATGCTGAC CCAGAAATTT
GCCCGCTCCC TCGGGATGCT GGCGGTAGAC AACCAGGCGC GCGTCTGACA CGGACCAACG
GTAGCAAGTC TTGCTCCAAC ATTTGGTCGC GGTGCGATGA CCAACCACTG GGTGGATATC
AAAAACGCTA ACGTCGTAAT GGTAATGGGC GGTAACGCTG CTGAAGCGCA TCCCGTCGGT
TTCCGCTGGG CGATGGAAGC GAAAAACAAC AACGATGCAA CCTTGATCGT TGTCGATCCC
CGTTTTACGC GTACCGCTTC TGTAGCGGAT ATTTACGCAC CTATTCGTTC CGGTACGGAC
ATTACGTTCC TGTCTGGCGT TTTGCGCTAC CTGATCGAAA ACAACAAAAT CAACGCCGAA
TACGTTAAGC ATTACACCAA CGCCAGCTTG CTGGTGCGTG ATGATTTTGC TTTCGAAGAC
GGTCTGTTCA GCGGCTACGA CGCTGAAAAA CGGCAATACG ATAAATCGTC CTGGAACTAT
CAGTTCGATG AAAACGGCTA TGCGAAACGC GATGAAACAC TGACTCATCC GCGCTGTGTG
TGGAACCTGC TGAAAGCGCA CGTTTCCCGC TACACGCCGG ACGTTGTTGA AAACATCTGT
GGTACGCCAA AAGCCGACTT CCTGAAAGTG TGTGAAGTGC TGGCCTCCAC CAGCGCACCG
GATCGCACAA CCACCTTCCT GTACGCGCTG GGCTGGACGC AGCACACCGT GGGTGCGCAG
AACATCCGTA CTATGGCGAT GATCCAGTTG CTGCTCGGTA ACATGGGTAT GGCCGGTGGC
GGTGTGAACG CATTGCGTGG TCACTCCAAC ATTCAGGGTC TGACTGACTT AGGTCTGCTC
TCTACCAGCC TGCCAGGTTA TCTGACGCTG CCGTCAGAAA AACAGGTTGA TTTGCAGTCG
TATCTGGAAG CGAACACGCC TAAAGCGACG CTGGCTGATC AGGTGAACTA CTGGAGCAAC
TATCCGAAGT TCTTCGTTAG CCTGATGAAA TCTTTCTATG GCGATGCCGC GCAGAAAGAG
AACAACTGGG GCTATGACTG GCTGCCGAAG TGGGACCAGA CCTACGACGT CATCAAGTAT
TTCAACATGA TGGACGAAGG CAAAGTCACC GGTTATTTCT GCCAGGGCTT TAACCCGGTT
GCGTCCTTCC CGGATAAAAA CAAAGTGGTG AGCTGCCTGA GCAAACTGAA GTACATGGTG
GTTATCGATC CGCTGGTGAC TGAAACCTCT ACCTTCTGGC AGAACCACGG CGAGTCGAAC
GATGTCGATC CGGCGTCTAT TCAGACTGAA GTATTCCGTC TGCCTTCGAC CTGCTTTGCC
GAAGAAGATG GTTCTATCGC TAACTCCGGT CGCTGGTTGC AGTGGCACTG GAAAGGTCAG
GACGCGCCGG GCGAAGCGCG TAACGACGGC GAAATTCTGG CGGGTATCTA CCATCATCTG
CGCGAGCTGT ACCAGGCCGA AGGTGGTAAA GGCGTAGAAC CGCTGATGAA GATGAGCTGG
AACTACAAGC AGCCGCACGA ACCGCAATCT GACGAAGTGG CAAAAGAGAA CAACGGCTAT
GCGCTGGAAG ATCTCTATGA CGCAAATGGC GTGCTGATTG CGAAGAAAGG TCAGTTGCTG
AGTAGCTTTG CGCATCTGCG TGATGACGGT ACAACCGCAT CTTCTTGCTG GATCTACACC
GGTAGCTGGA CAGAGCAGGG CAACCAGATG GCTAACCGCG ATAACTCCGA CCCGTCCGGT
CTGGGGAATA CGCTTGGATG GGCCTGGGCG TGGCCGCTCA ACCGTCGCGT GCTGTACAAC
CGTGCTTCGG CGGATATCAA CGGTAAACCG TGGGATCCGA AACGGATGCT GATCCAGTGG
AACGGCAGCA AGTGGACGGG TAACGATATT CCTGACTTCG GCAATGCCGC ACCGGGTACG
CCAACCGGGC CGTTTATCAT GCAGCCGGAA GGGATGGGAC GCCTGTTTGC CATCAACAAA
ATGGCGGAAG GTCCGTTCCC GGAACACTAC GAGCCGATTG AAACGCCGCT GGGCACTAAC
CCGCTGCATC CGAACGTGGT GTCTAACCCG GTTGTTCGTC TGTATGAACA AGACGCGCTG
CGGATGGGTA AAAAAGAGCA GTTCCCGTAT GTGGGTACGA CCTATCGTCT GACCGAGCAC
TTCCACACCT GGACCAAGCA CGCATTGCTC AACGCAATTG CTCAGCCGGA ACAGTTTGTG
GAAATCAGCG AAACGCTGGC GGCGGCGAAA GGCATTAATA ATGGCGATCG TGTCACTGTC
TCAAGCAAAC GTGGCTTTAT CCGCGCGGTG GCAGTGGTAA CGCGCCGTCT GAAGCCGCTG
AATGTAAACG GTCAGCAGGT TGAAACGGTG GGTATTCCAA TCCACTGGGG CTTTGAGGGT
GTCGCGCGTA AAGGTTATAT CGCTAACACT CTGACGCCGA ATGTCGGTGA TGCAAACTCG
CAAACGCCGG AATATAAAGC GTTCTTAGTC AACATCGAGA AGGCGTAA
 
Protein sequence
MDVSRRQFFK ICAGGMAGTT VAALGFAPKQ ALAQARNYKL LRAKEIRNTC TYCSVGCGLL 
MYSLGDGAKN AREAIYHIEG DPDHPVSRGA LCPKGAGLLD YVNSENRLRY PEYRAPGSDK
WQRISWEEAF SRIAKLMKAD RDANFIEKNE QGVTVNRWLS TGMLCASGAS NETGMLTQKF
ARSLGMLAVD NQARVUHGPT VASLAPTFGR GAMTNHWVDI KNANVVMVMG GNAAEAHPVG
FRWAMEAKNN NDATLIVVDP RFTRTASVAD IYAPIRSGTD ITFLSGVLRY LIENNKINAE
YVKHYTNASL LVRDDFAFED GLFSGYDAEK RQYDKSSWNY QFDENGYAKR DETLTHPRCV
WNLLKAHVSR YTPDVVENIC GTPKADFLKV CEVLASTSAP DRTTTFLYAL GWTQHTVGAQ
NIRTMAMIQL LLGNMGMAGG GVNALRGHSN IQGLTDLGLL STSLPGYLTL PSEKQVDLQS
YLEANTPKAT LADQVNYWSN YPKFFVSLMK SFYGDAAQKE NNWGYDWLPK WDQTYDVIKY
FNMMDEGKVT GYFCQGFNPV ASFPDKNKVV SCLSKLKYMV VIDPLVTETS TFWQNHGESN
DVDPASIQTE VFRLPSTCFA EEDGSIANSG RWLQWHWKGQ DAPGEARNDG EILAGIYHHL
RELYQAEGGK GVEPLMKMSW NYKQPHEPQS DEVAKENNGY ALEDLYDANG VLIAKKGQLL
SSFAHLRDDG TTASSCWIYT GSWTEQGNQM ANRDNSDPSG LGNTLGWAWA WPLNRRVLYN
RASADINGKP WDPKRMLIQW NGSKWTGNDI PDFGNAAPGT PTGPFIMQPE GMGRLFAINK
MAEGPFPEHY EPIETPLGTN PLHPNVVSNP VVRLYEQDAL RMGKKEQFPY VGTTYRLTEH
FHTWTKHALL NAIAQPEQFV EISETLAAAK GINNGDRVTV SSKRGFIRAV AVVTRRLKPL
NVNGQQVETV GIPIHWGFEG VARKGYIANT LTPNVGDANS QTPEYKAFLV NIEKA