Gene YpsIP31758_4118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_4118 
Symbol 
ID5384597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp4646611 
End bp4647582 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content54% 
IMG OID640867147 
Productputative aerobic formate dehydrogenase, iron-sulfur subunit 
Protein accessionYP_001403061 
Protein GI153948809 
COG category[C] Energy production and conversion 
COG ID[COG0437] Fe-S-cluster-containing hydrogenase components 1 
TIGRFAM ID[TIGR01582] formate dehydrogenase, beta subunit, Fe-S containing 


Plasmid Coverage information

Num covering plasmid clones54 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACTGC AAACTCAAGA CATTATCCGG CGCTCCGGTA CTAATTCCCT TACGCCGCCA 
CCTCAGGTCC GTGATCATCA GGAGCCGGTG GCTAAACTTA TCGATGTCAC CACCTGTATC
GGCTGTAAAG CCTGTCAGGT GGCGTGTTCA GAGTGGAACG ATATCCGCGA TGAAGTCGGT
CATAACGTCG GGGTGTATGA CAACCCCGCC GATTTGACCG CTAAGTCGTG GACAGTGATG
CGTTTCTCTG AAGTTGAAGA TGAAGAGAGC GGCAAGCTGG AGTGGCTGAT CCGCAAGGAT
GGCTGTATGC ATTGCGCCGA TCCGGGCTGC CTGAAGGCAT GTCCATCGGA AGGGGCGATC
ATTCAGTACG CCAACGGTAT CGTTGATTTC CAATCAGAAC ATTGTATTGG TTGTGGCTAC
TGCATCGCAG GTTGTCCGTT CGATGTCCCG CGCATGAATA AAGATGACAA TCGGGTGTAT
AAATGCACCT TGTGTGTCGA TCGTGTCGGT GTTGGTCAGG AACCTGCTTG TGTGAAAACC
TGCCCGACTG GAGCGATTCA CTTTGGTACC AAAGAGTCGA TGAAAGAAGT GGCGGCTGGC
CGGGTTGCTG AGCTAAAAAC CCGTGGGTTT GATAACGCAG GGTTATATGA CCCTGCGGGC
GTCGGCGGTA CCCATGTGAT GTATGTACTG CATCATGCGG ATAAACCCCA GCTTTATCAT
GGCCTGCCGG AGAATCCGAC CATCAGTCCG ACGGTGACTT TCTGGAAAGG CATCTGGAAA
CCGTTGGCTG CGGTAGGTTT CGCGGCGACC TTCGCTGCCA GTATCTTCCA TTACGTTGGC
GTAGGCCCGA ACCGGGTGGA GGAAGAGGAA GAAGACGATG AGACAACGGA TCCTACCCCT
TCCGAGACGG TAGCAAAGGC ACCAGAGCAG ACAACCTCTG AGCGCTCAGA CGAAGGGGAA
ACGCGGAAAT GA
 
Protein sequence
MSLQTQDIIR RSGTNSLTPP PQVRDHQEPV AKLIDVTTCI GCKACQVACS EWNDIRDEVG 
HNVGVYDNPA DLTAKSWTVM RFSEVEDEES GKLEWLIRKD GCMHCADPGC LKACPSEGAI
IQYANGIVDF QSEHCIGCGY CIAGCPFDVP RMNKDDNRVY KCTLCVDRVG VGQEPACVKT
CPTGAIHFGT KESMKEVAAG RVAELKTRGF DNAGLYDPAG VGGTHVMYVL HHADKPQLYH
GLPENPTISP TVTFWKGIWK PLAAVGFAAT FAASIFHYVG VGPNRVEEEE EDDETTDPTP
SETVAKAPEQ TTSERSDEGE TRK