Gene ECH74115_4993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4993 
SymbolrfaF 
ID6971739 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4644866 
End bp4645912 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content54% 
IMG OID643388674 
ProductADP-heptose:LPS heptosyltransferase II 
Protein accessionYP_002273101 
Protein GI209398299 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0859] ADP-heptose:LPS heptosyltransferase 
TIGRFAM ID[TIGR02195] lipopolysaccharide heptosyltransferase II 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000211589 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATAC TGGTGATCGG CCCGTCTTGG GTTGGCGACA TGATGATGTC GCAAAGTCTC 
TATCGCACGC TCCAGGCGCG CTATCCCCAG GCGATAATCG ATGTGATGGC ACCGGCATGG
TGCCGTCCAT TATTATCGCG GATGCCGGAA GTTAACGAAG CTATTCCTAT GCCTCTCGGT
CACGGAGCGC TGGAAATCGG CGAACGCCGC AAACTGGGTC ATAGTCTGCG TGAAAAGCGC
TACGACCGCG CCTACGTCTT ACCTAACTCC TTCAAATCCG CATTAGTGCC TTTCTTCGCG
GGTATTCCTC ATCGCACCGG CTGGCGCGGC GAGATGCGCT ACGGTTTACT CAACGATGTA
CGCGTGCTCG ATAAAGAAGC CTGGCCGCTA ATGGTGGAAC GCTATGTCGC GCTGGCCTAT
GACAAAGGCA TTATGCGCAC AGCACAAGAT CTGCCGCAGC CATTGTTATG GCCGCAGTTG
CAGGTGAGCG AAGGTGAAAA ATCATATACC TGTAATCAAT TTTCGCTTTC ATCAGAACGT
CCGATGATTG GTTTTTGCCC GGGTGCGGAG TTTGGTCCGG CAAAACGCTG GCCACACTAC
CACTATGCGG AGCTGGCAAA GCAGCTGATT GATGAAGGTT ATCAGGTGGT TCTGTTTGGC
TCTGCGAAAG ATCATGAAGC AGGCAATGAG ATTCTTGCCG CTTTGAATAC CGAGCAGCAG
GCATGGTGTC GGAACCTGGC GGGGGAAACA CAGCTTGATC AAGCGGTTAT CCTGATTGCA
GCCTGTAAAG CCATTGTCAC TAACGATTCT GGCCTGATGC ACGTTGCGGC GGCGCTCAAT
CGTCCGCTGG TTGCCCTGTA TGGTCCGAGT AGCCCGGACT TCACACCGCC GCTATCCCAT
AAAGCGCGCG TGATCCGCCT GATTACCGGC TATCACAAAG TGCGTAAAGG TGACGCTGCG
GAGGGTTATC ACCAGAGCTT GATCGACATT ACTCCCCAGC GCGTACTGAA AGAACTCAAC
GCGCTATTGT TACAAGAGGA AGCCTGA
 
Protein sequence
MKILVIGPSW VGDMMMSQSL YRTLQARYPQ AIIDVMAPAW CRPLLSRMPE VNEAIPMPLG 
HGALEIGERR KLGHSLREKR YDRAYVLPNS FKSALVPFFA GIPHRTGWRG EMRYGLLNDV
RVLDKEAWPL MVERYVALAY DKGIMRTAQD LPQPLLWPQL QVSEGEKSYT CNQFSLSSER
PMIGFCPGAE FGPAKRWPHY HYAELAKQLI DEGYQVVLFG SAKDHEAGNE ILAALNTEQQ
AWCRNLAGET QLDQAVILIA ACKAIVTNDS GLMHVAAALN RPLVALYGPS SPDFTPPLSH
KARVIRLITG YHKVRKGDAA EGYHQSLIDI TPQRVLKELN ALLLQEEA