Gene ECH74115_4997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4997 
Symbol 
ID6972029 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4649300 
End bp4650313 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content31% 
IMG OID643388678 
Productlipopolysaccharide 1,2-glucosyltransferase 
Protein accessionYP_002273105 
Protein GI209398034 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1442] Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0250725 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones71 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGATTTTA AACATCTTAC TCAATTTAAA GATATAATTG AACTGGACAA GCGCCCCGTT 
AAACTTGATG AACGGGAAAC GTTTAATGTC TCATGGGGTA TTGATGAGAA CTACCAGGTT
GGGGCTGCGA TTTCAATTGC TTCAATTCTT GAAAATAATA AACAAAACAA ATTTACCTTT
CACATAATCG CTGATTACTT AGACAAAGAG TATATTGAAT TATTATCACA ATTAGCAACG
AAGTATCAAA CAGTAATTAA ATTATATCAT ATTGATTCTG AGCCATTGAA GGCGCTACCT
CAATCAAATA TCTGGCCAGT ATCTATTTAT TATCGTTTGC TTTCATTTGA TTATTTTTCT
GCGCGATTGG ATTCATTATT ATATCTTGAT GCTGATATCG TCTGTAAGGG TTCATTGAAC
GAGTTAATAG CATTAGAGTT TAAAGATGAA TATGGGGCAG TGGTAATTGA TGTAGATGCT
ATGCAAAGTA AAAGCGCTGA GCGTTTGTGT AATGAGGATT TTAACGGTAG CTATTTTAAC
TCTGGTGTAA TGTATATTAA TTTACGGGAA TGGTTAAAAC AAAGACTAAC GGAAAAATTC
TTTGATCTAT TATCAGATGA GTCAATTATA AAAAAATTAA AGTACCCGGA TCAAGATATT
TTAAACTTAA TGTTTCTACA TCATGCTAAA ATATTACCGA GAAAATATAA TTGTATTTAT
ACTATAAAGT CAGAATTTGA AGAAAAAAAT AGTGAATATT ACACCCGGTT TATTAATGAT
GACACTGTCT TCATACATTA TACTGGTATA ACTAAGCCAT GGCATGATTG GGCGAACTAC
GCCTCTGCAG ATTATTTTCG TAATATTTAT AATATATCAC CATGGAGAAA TATACCTTAT
AAAAAAGCTG TTAAAAAACA TGAGTACAAA GAAAAATATA AACACTTGCT TTACCAGAAA
AAATTTCTCG ATGGTGTTTT TACAGCAATT AAATATAATG TTATGAAAGG TTAA
 
Protein sequence
MDFKHLTQFK DIIELDKRPV KLDERETFNV SWGIDENYQV GAAISIASIL ENNKQNKFTF 
HIIADYLDKE YIELLSQLAT KYQTVIKLYH IDSEPLKALP QSNIWPVSIY YRLLSFDYFS
ARLDSLLYLD ADIVCKGSLN ELIALEFKDE YGAVVIDVDA MQSKSAERLC NEDFNGSYFN
SGVMYINLRE WLKQRLTEKF FDLLSDESII KKLKYPDQDI LNLMFLHHAK ILPRKYNCIY
TIKSEFEEKN SEYYTRFIND DTVFIHYTGI TKPWHDWANY ASADYFRNIY NISPWRNIPY
KKAVKKHEYK EKYKHLLYQK KFLDGVFTAI KYNVMKG