Gene ECH74115_4994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4994 
SymbolrfaC 
ID6969335 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4645916 
End bp4646908 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content50% 
IMG OID643388675 
ProductADP-heptose:LPS heptosyl transferase I 
Protein accessionYP_002273102 
Protein GI209399933 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0859] ADP-heptose:LPS heptosyltransferase 
TIGRFAM ID[TIGR02193] lipopolysaccharide heptosyltransferase I 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.041745 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones70 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGTTT TGATCGTTAA AACATCGTCG ATGGGCGATG TTCTCCATAC GTTGCCCGCA 
CTCACTGATG CCCAGCAGGC AATCCCAGAA ATTAAGTTTG ACTGGGTGGT GGAAGAAGGG
TTCGCACAGA TTCCTTCCTG GCACGCTGCC GTTGAGCGAG TTATTCCTGT GGCAATACGT
CGCTGGCGTA AAGCCTGGTT CTCGGCCCCT ATAAAAGCTG AACGCAAAGC GTTTCGTGAA
GCGCTACAAG CAGAGAACTA TGACGCAGTT ATCGACGCTC AGGGGCTGGT AAAAAGCGCG
GCGCTGGTTA CGCGTCTGGC ACATGGCGTA AAGCATGGCA TGGACTGGCA AACCGCTCGC
GAACCTTTAG CCAGCCTGTT TTACAATCGT AAGCATCATA TTGCAAAACA GCAGCACGCC
GTAGAACGCA CCCGCGAACT GTTTGCCAAA AGTTTGGGCT ATAGCAAACC GCAAACCCAG
GGCGATTATG CTATCGCACA GCATTTTCTG ACGAACCTGC CTACAGATGC TGGCGAATAT
GCCGTATTTC TTCATGCGAC GACCCGTGAT GATAAACACT GGCCGGAAGA ACACTGGCGA
GAATTGATTG GTTTACTGGC TGATTCAGGA ATACGGATTA AACTTCCGTG GGGCGCGCCG
CATGAGGAAG AACGGGCGAA ACGACTGGCG GAAGGATTTG CTTATGTTGA AGTATTGCCG
AAGATGAGTC TGGAAGGCGT TGCCCGCGTG CTGGCCGGGG CTAAATTTGT AGTGTCGGTG
GATACGGGGT TAAGCCATTT AACGGCGGCA CTGGATAGAC CCAATATCAC GGTTTATGGA
CCAACCGATC CGGGATTAAT TGGTGGGTAT GGGGAGAATC AAGTAGAGTG TCGTTCTACA
AGTATGTCTC TTGCAGATTT GCCAGCTCAA ACCGTTTTTC AAAATTTAAA TCTTGAAATA
ATAACCAATA AGTTGACATC GGAGATAAGA TGA
 
Protein sequence
MRVLIVKTSS MGDVLHTLPA LTDAQQAIPE IKFDWVVEEG FAQIPSWHAA VERVIPVAIR 
RWRKAWFSAP IKAERKAFRE ALQAENYDAV IDAQGLVKSA ALVTRLAHGV KHGMDWQTAR
EPLASLFYNR KHHIAKQQHA VERTRELFAK SLGYSKPQTQ GDYAIAQHFL TNLPTDAGEY
AVFLHATTRD DKHWPEEHWR ELIGLLADSG IRIKLPWGAP HEEERAKRLA EGFAYVEVLP
KMSLEGVARV LAGAKFVVSV DTGLSHLTAA LDRPNITVYG PTDPGLIGGY GENQVECRST
SMSLADLPAQ TVFQNLNLEI ITNKLTSEIR