Gene EcSMS35_2140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2140 
SymbolhyaF 
ID6144796 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2149975 
End bp2150832 
Gene Length858 bp 
Protein Length285 aa 
Translation table11 
GC content55% 
IMG OID641617016 
Producthydrogenase-1 operon protein HyaF 
Protein accessionYP_001744191 
Protein GI170681914 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.634628 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.881946 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAAA CTTTTTTCCA TCTGCTGGGG CCAGGAACGC AACCGAACGA TGACAGTTTC 
AGCATGAATC CACTGCCGAT CACCTGTCAG GTGAATGATG AACCGAGTAT GGCGGCCCTG
GAGCAATGTG CTCACAGCCC GCAGGTGATT GCGCTGTTAA ACGAGTTACA ACATCAACTA
AGCGAACGCC AGCCGCCGTT GGGCGAGGTG CTGGCAGTCG ATCTGTTAAA TCTCAACGCC
GACGATCGTC ACTTTATCAA TACGCTTCTC GGGGAAGGGG AAGTGTCAGT GCGCATACAG
CAGGCTGACG ACAGTGAAAG TGAAATACAG GAGGCGATCT TCTGCGGATT ATGGCGGGTG
CGCAGACGTC GCGGCGACAA GTTGCTGGAG GACAAACTGG AGGCTGGCTG CGCGCCGCTG
GCATTGTGGC AGGCGGCAAC GCAAAACGTC TTGCCGACAG ATTCGCTGTT ACCGCCGCCC
ATTGATGGCC TGATGAATGG CCTACCGTTG GCGCATGAGT TACTGGCGCA TGTACGTAAC
CCCGACGCGC AGCCGCACAG CATTAATCTG ACGCAATTAC CCATCAGCGA GGCTGATCGG
CTTTTTCTCT CACGTCTCTG TGGGCCGGGA AATATTCAGA TTCGTACCAT TGGCTATGGC
GAGAGCTATA TCAACGCCAC GGGGTTACGC CATGTCTGGC ATTTACGCTG TACGGACACC
TTAAAAGGCC CGTTACTGGA AAGTTATGAA ATCTGCCCAA TACCGGAAGT GGTGCTGGCA
GCGCCAGAAG ATTTGGTCGA CTCTGCGCAG CGGCTTAGCG AGGTATGTCA GTGGCTGGCG
GAAGGTGCAC CGACATAA
 
Protein sequence
MSETFFHLLG PGTQPNDDSF SMNPLPITCQ VNDEPSMAAL EQCAHSPQVI ALLNELQHQL 
SERQPPLGEV LAVDLLNLNA DDRHFINTLL GEGEVSVRIQ QADDSESEIQ EAIFCGLWRV
RRRRGDKLLE DKLEAGCAPL ALWQAATQNV LPTDSLLPPP IDGLMNGLPL AHELLAHVRN
PDAQPHSINL TQLPISEADR LFLSRLCGPG NIQIRTIGYG ESYINATGLR HVWHLRCTDT
LKGPLLESYE ICPIPEVVLA APEDLVDSAQ RLSEVCQWLA EGAPT