Gene ECH74115_1652 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1652 
Symbol 
ID6967251 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1596177 
End bp1597661 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content42% 
IMG OID643385612 
Producthypothetical protein 
Protein accessionYP_002270106 
Protein GI209400030 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.916188 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAAAAA TAATTACTCA TTTCAAAGTT GTTTTAACGT TACTTCTACC AGTAACCGTA 
TCTGCCCAGC AGATACAGTG GCAATCCTGT ATGGCCAGTC AATTCAACCA CTGGTTTGGT
GAGGAAAAAC CGTCTCCTGA CTTACTATGT GGTTATTTGT CTGTTCCATT AAAATATACA
GACACAGGCG GAGATGCTTC TTATGAAAAA AAATCACAAG TCAAACTAGC GTTGACAAAA
TTGCCGGCAA AAAGCAAGCA TAAAGGAAGT ATCCTGATAA TAAGTGGTGG TCCCGGGTTA
CCAGGCATAA ATCCTTATAT TAACTTTGAC TGGCCAGTCA CAAATCTTCG TGAGTCATGG
GATATTATTG GATTTGATCC TCGAGGCGTC GGGCAGTCCA CTCCGACAAT AAACTGCCGG
CAATCAGATA CAGAGACTCA GGAAAACATA ACCGAAAAGC AACAAGTATT AAATAAAATT
AATGCCTGTA TCCATAATAC CGGAGCCGAA GTCATTCGCC ATATCGGCTC TAACGAGGCT
GTATACGATA TTGATCGTAT TAGGCAAGCC TTGGGGGATA AACAACTGAC AGCCGTGGCG
TATTCGTATG GAACTCAAAT TGCAGCCTTA TATGCAGAAC GTTTTCCCTA CAACGTAAGA
TCTATCGTTC TTGATGGAGT CGTCGATATC GATGACCTGG AGGACAACTT CACATGGCAA
CTCAAACAGG CACAGAGTTA TCAGGAAACG TTTGATCGCT TTGCATCCTG GTGTGCGCGT
ACAAAAAGTT GCCCGCTTTC TTCAGACAGA GATAAGGCAA TAACTCAGTT CCATGAGCTA
TTATCAAAAT TACATCACAA ACCTTTATTA GACAGTAAGG GAGAAAATAT ATCTTCAGAT
GAACTCATAT CATTAACAAC AGACCTTCTG CTATGGCGTT CATCATGGCC AACCCTTGCA
ACTGCCATAC GCCAGTTCTC TCAGGGGATT GTCAGTAATG AAATTGAAAC TGCGCTCAGT
GCTCCGATAG CCTCAGAAGA GTCAAGTGAT GCTTCGGGGG TAATCCTTTG TGTAGATCAG
GGGGATGAGC AATTAACACC AGAAGAGCGA AAATTCCGAA AAGAGGCTCT TGCGAATGCC
TTCCCGGCTA TTAACTTTGA CAATGGACGT TCCGATTCAC CTGATTTTTG TGAATTATGG
CCAATACATA GCGACCTGAA CAAAACTCGC CTGAAAAATA CTGTTCTGCC CTCTGGTTTA
CTGTTTGTAG CACACAAATA CGACCCAACA ACGCCCTGGA TTAATGCCCG TAAGATGGCA
GAGAAATTTT CCAGCCCGTT ACTAACAATA AATGGTGATG GGCATACATT AGCTCTCACC
GGAGTTAATT TATGTGTAGA TAAAGCAGTT GTACATCACC TGATCACTCC ACAAAAAATA
GAAAATATAT ACTGCCCAGG AAATTCTGAA GCAGAAATAC AATAA
 
Protein sequence
MRKIITHFKV VLTLLLPVTV SAQQIQWQSC MASQFNHWFG EEKPSPDLLC GYLSVPLKYT 
DTGGDASYEK KSQVKLALTK LPAKSKHKGS ILIISGGPGL PGINPYINFD WPVTNLRESW
DIIGFDPRGV GQSTPTINCR QSDTETQENI TEKQQVLNKI NACIHNTGAE VIRHIGSNEA
VYDIDRIRQA LGDKQLTAVA YSYGTQIAAL YAERFPYNVR SIVLDGVVDI DDLEDNFTWQ
LKQAQSYQET FDRFASWCAR TKSCPLSSDR DKAITQFHEL LSKLHHKPLL DSKGENISSD
ELISLTTDLL LWRSSWPTLA TAIRQFSQGI VSNEIETALS APIASEESSD ASGVILCVDQ
GDEQLTPEER KFRKEALANA FPAINFDNGR SDSPDFCELW PIHSDLNKTR LKNTVLPSGL
LFVAHKYDPT TPWINARKMA EKFSSPLLTI NGDGHTLALT GVNLCVDKAV VHHLITPQKI
ENIYCPGNSE AEIQ