Gene ECH74115_2849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2849 
Symbol 
ID6970027 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2652487 
End bp2653545 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content55% 
IMG OID643386697 
Productphospholipase, patatin family 
Protein accessionYP_002271168 
Protein GI209396477 
COG category[R] General function prediction only 
COG ID[COG1752] Predicted esterase of the alpha-beta hydrolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.41074 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACGG GGCTGGTGCT GTCCGGTGGC GGGGCGGTGG GCGCTTATCA GGCGGGAGTG 
GTTAAGGCAC TGGCAGAGTG TGGTACACAG ATCAGCATGG TTTCAGGGGC CAGCATTGGC
GCATTCAATG GTGCCATTAT CGCGGCCTCT ACCGATCTGT CAGAAGCTGC CGTACGCCTG
GAGGCGCTCT GGGATCATCT GGGGAATAAT CAGGTGCTGT CGGTAAACAG ATTGGTTTAC
TTTTCATTGC TGAAAAAATT GTTCCAGGCA ATGAACCTCT GCCAGATCCC CGGACGTGCA
GGAGCACTGC TTACGACGCT TCTTCGCCAT ATATCGACAA TCAACGGGTT TGACAATCTG
ATGGCTCAGC CGTTGTTGTC AGATGAGCCC CTGACAGCGC TGATGGATCA TTATCTTGAT
ACTGATGCTC TGGCAGACGG GCTACCGCTG TATGTGTCGC TGTACCCCAC AGAAGGGGGC
ATGCAGGATA TTATTGACTG CATTCGTGCT GAACTGGGTG TCGGAACCAC GAAAAACGCC
GTTTTTCAGC ATATCCAGAG CCTGCCCCGC GGACAGCAGA AAGAGGCTCT GCTTGCGTCA
GCCGCGCTGC CCCTGCTGTT CCGTCCCCGT GAGGTTCAGG GGACAATGTT CGGTGATGGT
GGTATGGGAG GATGGCGAAA TATGCAGGGA AATACCCCTG TGACGCCTCT GGTCGATGCC
GGATGCAATA TGGTGATTGT GACGCATCTG AGTGACGGTT CTTTATGGGA TCGCCAGGCT
TTTCCGGACA CCACAATCCT TGAGATCCGT CCCCGGAAAA GGCTGAAATA TGCAGGTGAT
GGTGGCAACA GCGGCGGTCT GCTCAGTTTT ACATCGGCAC ATACCGACGC CTGGCGTCAG
CAGGGCTATG AAGACACGAT GCTGGCGATG GAGCATATCC GGAAACCGCT GGCAGCACGT
CAGGCACTGA CCCGGTCAGA GGCGGTATTG CAGAAAAGCC TGGATATAAC GGAAGAGGCA
GATTTGGCAC TGAGAAACGC GATGGCCCGG ATTAAATAA
 
Protein sequence
MKTGLVLSGG GAVGAYQAGV VKALAECGTQ ISMVSGASIG AFNGAIIAAS TDLSEAAVRL 
EALWDHLGNN QVLSVNRLVY FSLLKKLFQA MNLCQIPGRA GALLTTLLRH ISTINGFDNL
MAQPLLSDEP LTALMDHYLD TDALADGLPL YVSLYPTEGG MQDIIDCIRA ELGVGTTKNA
VFQHIQSLPR GQQKEALLAS AALPLLFRPR EVQGTMFGDG GMGGWRNMQG NTPVTPLVDA
GCNMVIVTHL SDGSLWDRQA FPDTTILEIR PRKRLKYAGD GGNSGGLLSF TSAHTDAWRQ
QGYEDTMLAM EHIRKPLAAR QALTRSEAVL QKSLDITEEA DLALRNAMAR IK