Gene EcolC_4125 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_4125 
Symbol 
ID6066050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4552541 
End bp4553443 
Gene Length903 bp 
Protein Length300 aa 
Translation table11 
GC content55% 
IMG OID641603546 
Productformate dehydrogenase, beta subunit 
Protein accessionYP_001727049 
Protein GI170022095 
COG category[C] Energy production and conversion 
COG ID[COG0437] Fe-S-cluster-containing hydrogenase components 1 
TIGRFAM ID[TIGR01582] formate dehydrogenase, beta subunit, Fe-S containing 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTATC AATCTCAAGA TATCATTCGT CGTTCCGCGA CTAACGGTCT GACCCCCGCG 
CCTCAGGCGC GGGACTTCCA GGAAGAAGTG GCGAAACTCA TCGACGTTAC CACCTGTATC
GGCTGTAAAG CCTGTCAGGT GGCGTGTTCA GAGTGGAACG ACATTCGCGA TACCGTCGGC
AATAACATTG GGGTGTACGA CAACCCCAAT GATTTAAGCG CCAAATCGTG GACGGTAATG
CGCTTCTCGG AAGTGGAGCA GAACGACAAA CTGGAATGGC TGATCCGCAA AGACGGCTGT
ATGCACTGTT CCGATCCAGG CTGCCTGAAA GCGTGCCCGG CGGAAGGTGC AATTATTCAG
TATGCCAACG GTATCGTCGA CTTCCAGTCC GAGCAGTGCA TTGGCTGCGG TTATTGCATT
GCGGGCTGTC CGTTCGACAT TCCGCGCCTC AACCCGGAAG ACAACCGCGT CTACAAATGT
ACGCTGTGCG TTGACCGCGT GGTGGTTGGG CAAGAACCAG CCTGCGTGAA GACCTGCCCA
ACGGGCGCGA TTCACTTCGG TACGAAAGAG TCGATGAAAA CGCTGGCGAG CGAGCGCGTT
GCTGAGCTGA AAACCCGCGG TTACGACAAT GCGGGTCTGT ACGATCCGGC AGGCGTCGGT
GGTACACACG TCATGTACGT GCTGCACCAT GCTGACAAGC CAAATCTGTA TCATGGCTTG
CCGGAGAACC CGGAAATCAG CGAAACCGTG AAATTCTGGA AAGGCATCTG GAAACCGCTC
GCGGCTGTTG GCTTTGCGGC TACCTTTGCG GCCAGTATCT TCCACTACGT GGGTGTCGGT
CCGAACCGTG CGGATGAGGA AGAGAATAAT CTGCACGAAG AGAAAGACGA GGAGCGCAAA
TGA
 
Protein sequence
MAYQSQDIIR RSATNGLTPA PQARDFQEEV AKLIDVTTCI GCKACQVACS EWNDIRDTVG 
NNIGVYDNPN DLSAKSWTVM RFSEVEQNDK LEWLIRKDGC MHCSDPGCLK ACPAEGAIIQ
YANGIVDFQS EQCIGCGYCI AGCPFDIPRL NPEDNRVYKC TLCVDRVVVG QEPACVKTCP
TGAIHFGTKE SMKTLASERV AELKTRGYDN AGLYDPAGVG GTHVMYVLHH ADKPNLYHGL
PENPEISETV KFWKGIWKPL AAVGFAATFA ASIFHYVGVG PNRADEEENN LHEEKDEERK