Gene EcSMS35_0627 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0627 
SymbolahpF 
ID6146440 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp640457 
End bp642052 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content53% 
IMG OID641615519 
Productalkyl hydroperoxide reductase subunit F 
Protein accessionYP_001742725 
Protein GI170682192 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3634] Alkyl hydroperoxide reductase, large subunit 
TIGRFAM ID[TIGR03140] alkyl hydroperoxide reductase, F subunit 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGATGT TTAAAGCCCA GGAGATAAAC ATGCTCGACA CAAATATGAA AACTCAACTC 
AAGGCTTACC TTGAGAAATT GACCAAGCCT GTTGAGTTAA TTGCCACGCT GGATGACAGC
GCTAAATCGG CAGAAATCAA GGAACTGTTG GCTGAAATCG CAGAACTGTC AGACAAAGTC
ACCTTTAAAG AAGATAACAG CTTGCCAGTG CGTAAGCCGT CTTTCCTGAT CACCAACCCA
GGTTCCAACC AGGGACCACG TTTTGCAGGC TCCCCGCTGG GCCACGAGTT CACCTCGCTG
GTACTGGCGT TGCTGTGGAC CGGTGGTCAT CCGTCGAAAG AAGCGCAGTC TCTGCTGGAG
CAGATTCGCC ATATTGACGG TGATTTTGAA TTCGAAACCT ATTACTCGCT CTCTTGCCAC
AACTGCCCGG ACGTGGTGCA GGCGCTGAAC CTGATGAGCG TACTGAACCC GCGCATCAAG
CACACTGCAA TTGACGGCGG CACCTTCCAG AACGAAATCA CCGATCGCAA CGTGATGGGC
GTTCCGGCAG TGTTCGTAAA CGGGAAAGAG TTTGGTCAGG GCCGCATGAC GTTGACTGAA
ATCGTTGCCA AAATTGATAC GGGTGCGGAA AAACGTGCGG CAGAAGAGCT GAACAAGCGT
GATGCTTATG ACGTATTAAT CGTTGGTTCC GGCCCGGCGG GTGCAGCGGC AGCAATTTAC
TCCGCACGTA AAGGCATCCG TACCGGTCTG ATGGGCGAAC GTTTTGGTGG TCAGATCCTC
GATACCGTTG ATATCGAAAA CTACATTTCT GTACCGAAGA CCGAAGGCCA GAAACTGGCA
GGTGCGCTGA AAGTTCATGT TGATGAATAC GACGTTGATG TGATCGACAG CCAGAGCGCG
AGCAAACTGA TCCCGGCAGC GGTTGAAGGC GGCCTGCATC AGATTGAAAC AGCTTCTGGC
GCGGTACTGA AAGCACGCAG CATTATCGTG GCGACCGGTG CAAAATGGCG CAACATGAAC
GTTCCTGGCG AAGATCAGTA TCGCACCAAA GGCGTGACCT ACTGCCCGCA CTGCGACGGC
CCGCTGTTTA AAGGCAAACG CGTAGCGGTT ATCGGCGGCG GTAACTCCGG CGTGGAAGCG
GCAATTGACC TGGCGGGTAT CGTTGAGCAC GTAACGCTGC TGGAATTTGC GCCAGAAATG
AAAGCCGACC AGGTTCTGCA GGACAAACTG CGCAGCCTGA AAAACGTCGA CATTATTCTG
AATGCGCAAA CCACGGAAGT GAAAGGCGAC GGTAGCAAAG TCGTAGGTCT GGAATATCGC
GATCGTGTCA GCGGCGATAT TCACAACATC GAACTGGCCG GTATTTTCGT CCAGATTGGT
CTGCTGCCGA ACACCAACTG GCTGGAAGGC GCAGTCGAAC GTAACCGCAT GGGCGAGATT
ATCATTGATG CGAAATGCGA AACCAACGTC AAAGGCGTGT TCGCAGCGGG TGACTGTACG
ACGGTTCCGT ACAAGCAGAT CATCATCGCT ACTGGCGAAG GTGCCAAAGC CTCTCTGAGT
GCTTTTGACT ACCTGATTCG CACCAAAACT GCATAA
 
Protein sequence
MMMFKAQEIN MLDTNMKTQL KAYLEKLTKP VELIATLDDS AKSAEIKELL AEIAELSDKV 
TFKEDNSLPV RKPSFLITNP GSNQGPRFAG SPLGHEFTSL VLALLWTGGH PSKEAQSLLE
QIRHIDGDFE FETYYSLSCH NCPDVVQALN LMSVLNPRIK HTAIDGGTFQ NEITDRNVMG
VPAVFVNGKE FGQGRMTLTE IVAKIDTGAE KRAAEELNKR DAYDVLIVGS GPAGAAAAIY
SARKGIRTGL MGERFGGQIL DTVDIENYIS VPKTEGQKLA GALKVHVDEY DVDVIDSQSA
SKLIPAAVEG GLHQIETASG AVLKARSIIV ATGAKWRNMN VPGEDQYRTK GVTYCPHCDG
PLFKGKRVAV IGGGNSGVEA AIDLAGIVEH VTLLEFAPEM KADQVLQDKL RSLKNVDIIL
NAQTTEVKGD GSKVVGLEYR DRVSGDIHNI ELAGIFVQIG LLPNTNWLEG AVERNRMGEI
IIDAKCETNV KGVFAAGDCT TVPYKQIIIA TGEGAKASLS AFDYLIRTKT A