Gene EcDH1_3913 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3913 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4215215 
End bp4217362 
Gene Length2148 bp 
Protein Length715 aa 
Translation table11 
GC content54% 
IMG OID 
Productformate dehydrogenase, alpha subunit 
Protein accessionACX41513 
Protein GI260451091 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAG TCGTCACGGT TTGCCCCTAT TGCGCATCAG GTTGCAAAAT CAACCTGGTC 
GTCGATAACG GCAAAATCGT CCGGGCGGAG GCAGCGCAGG GGAAAACCAA CCAGGGTACC
CTGTGTCTGA AGGGTTATTA TGGCTGGGAC TTCATTAACG ATACCCAGAT CCTGACCCCG
CGCCTGAAAA CCCCCATGAT CCGTCGCCAG CGTGGCGGCA AACTCGAACC TGTTTCCTGG
GATGAGGCAC TGAATTACGT TGCCGAGCGC CTGAGCGCCA TCAAAGAGAA GTACGGTCCG
GATGCCATCC AGACGACCGG CTCCTCGCGT GGTACGGGTA ACGAAACCAA CTATGTAATG
CAAAAATTTG CGCGCGCCGT TATTGGTACC AATAACGTTG ACTGCTGCGC TCGTGTCTGA
CACGGCCCAT CGGTTGCAGG TCTGCACCAA TCGGTCGGTA ATGGCGCAAT GAGCAATGCT
ATTAACGAAA TTGATAATAC CGATTTAGTG TTCGTTTTCG GGTACAACCC GGCGGATTCC
CACCCAATCG TGGCGAATCA CGTAATTAAC GCTAAACGTA ACGGGGCGAA AATTATCGTC
TGCGATCCGC GCAAAATTGA AACCGCGCGC ATTGCTGACA TGCACATTGC ACTGAAAAAC
GGCTCGAACA TCGCGCTGTT GAATGCGATG GGCCATGTCA TTATTGAAGA AAATCTGTAC
GACAAAGCGT TCGTCGCTTC ACGTACAGAA GGCTTTGAAG AGTATCGTAA AATCGTTGAA
GGCTACACGC CGGAGTCGGT TGAAGATATC ACCGGCGTCA GCGCCAGTGA GATTCGTCAG
GCGGCACGGA TGTATGCCCA GGCGAAAAGC GCCGCCATCC TGTGGGGCAT GGGTGTAACC
CAGTTCTACC AGGGCGTGGA AACCGTGCGT TCTCTGACCA GCCTCGCGAT GCTGACCGGT
AACCTCGGTA AGCCGCATGC GGGTGTTAAC CCGGTTCGTG GTCAGAACAA CGTTCAGGGT
GCCTGCGATA TGGGCGCGCT GCCGGATACG TATCCGGGAT ACCAGTACGT GAAAGATCCG
GCTAACCGCG AGAAATTCGC CAAAGCCTGG GGCGTGGAAA GCCTGCCAGC GCATACCGGC
TATCGCATCA GCGAGCTGCC GCACCGCACA GCGCATGGCG AAGTGCGTGC CGCGTACATT
ATGGGCGAAG ATCCGCTACA AACTGACGCG GAGCTGTCGG CAGTACGTAA AGCCTTTGAA
GATCTGGAAC TGGTTATCGT TCAGGACATC TTTATGACCA AAACCGCGTC GGCGGCGGAT
GTTATTTTAC CGTCAACGTC GTGGGGCGAG CATGAAGGCG TGTTTACTGC GGCTGACCGT
GGCTTCCAGC GTTTCTTCAA GGCGGTTGAA CCGAAATGGG ATCTGAAAAC GGACTGGCAA
ATCATCAGTG AAATCGCCAC CCGTATGGGT TATCCGATGC ACTACAACAA CACCCAGGAG
ATCTGGGATG AGTTGCGTCA TCTGTGCCCG GATTTCTACG GTGCGACTTA CGAGAAAATG
GGCGAACTGG GCTTCATTCA GTGGCCTTGC CGCGATACTT CAGATGCCGA TCAGGGGACT
TCTTATCTGT TTAAAGAGAA GTTTGATACC CCGAACGGTC TGGCGCAGTT CTTCACCTGC
GACTGGGTAG CGCCAATCGA CAAACTCACC GACGAGTACC CGATGGTACT GTCAACGGTG
CGTGAAGTTG GTCACTACTC TTGCCGTTCG ATGACCGGTA ACTGTGCGGC ACTGGCGGCG
CTGGCTGATG AACCTGGCTA CGCACAAATC AATACCGAAG ACGCCAAACG TCTGGGTATT
GAAGATGAGG CATTGGTTTG GGTGCACTCG CGTAAAGACA AAATTATCAC CCGTGCGCAG
GTCAGCGATC GTCCGAACAA AGGGGCGATT TACATGACCT ACCAGTGGTG GATTGGTGCC
TGTAACGAGC TGGTTACCGA AAACTTAAGC CCGATTACGA AAACGCCGGA GTACAAATAC
TGCGCCGTTC GCGTCGAGCC GATCGCCGAT CAGCGCGCCG CCGAGCAGTA CGTGATTGAC
GAGTACAACA AGTTGAAAAC TCGCCTGCGC GAAGCGGCAC TGGCGTAA
 
Protein sequence
MKKVVTVCPY CASGCKINLV VDNGKIVRAE AAQGKTNQGT LCLKGYYGWD FINDTQILTP 
RLKTPMIRRQ RGGKLEPVSW DEALNYVAER LSAIKEKYGP DAIQTTGSSR GTGNETNYVM
QKFARAVIGT NNVDCCARVU HGPSVAGLHQ SVGNGAMSNA INEIDNTDLV FVFGYNPADS
HPIVANHVIN AKRNGAKIIV CDPRKIETAR IADMHIALKN GSNIALLNAM GHVIIEENLY
DKAFVASRTE GFEEYRKIVE GYTPESVEDI TGVSASEIRQ AARMYAQAKS AAILWGMGVT
QFYQGVETVR SLTSLAMLTG NLGKPHAGVN PVRGQNNVQG ACDMGALPDT YPGYQYVKDP
ANREKFAKAW GVESLPAHTG YRISELPHRT AHGEVRAAYI MGEDPLQTDA ELSAVRKAFE
DLELVIVQDI FMTKTASAAD VILPSTSWGE HEGVFTAADR GFQRFFKAVE PKWDLKTDWQ
IISEIATRMG YPMHYNNTQE IWDELRHLCP DFYGATYEKM GELGFIQWPC RDTSDADQGT
SYLFKEKFDT PNGLAQFFTC DWVAPIDKLT DEYPMVLSTV REVGHYSCRS MTGNCAALAA
LADEPGYAQI NTEDAKRLGI EDEALVWVHS RKDKIITRAQ VSDRPNKGAI YMTYQWWIGA
CNELVTENLS PITKTPEYKY CAVRVEPIAD QRAAEQYVID EYNKLKTRLR EAALA