Gene ECH74115_1398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1398 
Symbol 
ID6969206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1393991 
End bp1395049 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content55% 
IMG OID643385372 
Productphospholipase, patatin family 
Protein accessionYP_002269867 
Protein GI209398448 
COG category[R] General function prediction only 
COG ID[COG1752] Predicted esterase of the alpha-beta hydrolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.326586 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACGG GGCTGGTGCT GTCCGGTGGC GGGGCGGTGG GCGCTTATCA GGCGGGAGTG 
GTTAAGGCAC TGGCAGAGTG TGGTACACAG ATCAGCATGG TTTCAGGGAC CAGCATTGGC
GCATTCAATG GTGCCATTAT CGCGGCCTCT CCCGATCTGT CAGAAGCTGC CGTACGCCTG
GAGGCGCTCT GGGAGCATCT GGGGAATAAT CAGGTGCTGT CGGTAAACAG ATTGGTTTAC
TTTTCATTGC TGAAAAAATT GTTCCAGGCA ATGAACCTCT GCCAGATCCC CGGACGTGCA
GGAGCACTGC TTACGACGCT TCTTCGCCAT ATATCGACAA TCAACGGGTT TGACAATCTG
ATGGCTCAGC CGTTGTTGTC AGATGAGCCC CTGACAGCGC TGATGGATCA TTATCTTGAT
ACTGATGCTC TGGCAGACGG GCTACCGCTG TATGTGTCGC TGTACCCCAC AGAAGGGGGC
ATGCAGGATA TTATTGACTG CATTCGTGCT GAACTGGGTG TCGGAACCAC GAAAAACGCC
GTTTTTCAGC ATATCCAGAG CCTGCCCCGC GGACAGCAGA AAGAGGCTCT GCTTGCGTCA
GCCGCGCTGC CCCTGCTGTT CCGTCCCCGT GAGGTTCAGG GGACAATGTT CGGTGATGGT
GGTATGGGAG GATGGCGAAA TATGCAGGGA AATACCCCTG TGACGCCTCT GGTCGATGCC
GGATGCAATA TGGTGATTGT GACGCATCTG AGTGACGGTT CTTTATGGGA TCGCCAGGCT
TTTCCGGACA CCACAATCCT TGAGATCCGT CCCCGGAAAA GGCTGAAATA TGCAGGTGAT
GGTGGCAACA GCGGCGGTCT GCTCAGTTTT ACATCGGCAC ATACCGACGC CTGGCGTCAG
CAGGGCTATG AAGACACGAT GCTGGCGATG GAGCATATCC GGAAACCGCT GGCAGCACGT
CAGGCACTGA CCCGGTCAGA GGCGGTATTG CAGAAAAGCC TGGATATAAC GGAAGAGGCA
GATTTGGCAC TGAGAAACGC GATGGCCCGG ATTAAATAA
 
Protein sequence
MKTGLVLSGG GAVGAYQAGV VKALAECGTQ ISMVSGTSIG AFNGAIIAAS PDLSEAAVRL 
EALWEHLGNN QVLSVNRLVY FSLLKKLFQA MNLCQIPGRA GALLTTLLRH ISTINGFDNL
MAQPLLSDEP LTALMDHYLD TDALADGLPL YVSLYPTEGG MQDIIDCIRA ELGVGTTKNA
VFQHIQSLPR GQQKEALLAS AALPLLFRPR EVQGTMFGDG GMGGWRNMQG NTPVTPLVDA
GCNMVIVTHL SDGSLWDRQA FPDTTILEIR PRKRLKYAGD GGNSGGLLSF TSAHTDAWRQ
QGYEDTMLAM EHIRKPLAAR QALTRSEAVL QKSLDITEEA DLALRNAMAR IK