Gene ECH74115_2014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2014 
Symbol 
ID6967980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1911332 
End bp1912624 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content50% 
IMG OID643385931 
ProductPAP2 family protein 
Protein accessionYP_002270420 
Protein GI209398896 
COG category[T] Signal transduction mechanisms 
COG ID[COG2453] Predicted protein-tyrosine phosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.106325 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTACAAG GCGCTGGCTG GTTATTGTTG CTGGCCCCGT TTTTCTTCTT CACCTATGGA 
TCTCTAAATC AGTTCACCGC GGTTCAGGAC TTTAACAGCC ATGATATCCC CAGTCAGGTA
TTCGGCTGGG AAACGGCGAT CCCTTTTCTT CCCTGGACTA TTGTCCCTTA CTGGAGTCTG
GATCTTTTAT ATGGATTTTC GCTGTTCGTT TGTAGCTCGA CATTCGAACA GCGCCGACTT
GTCCACCGGC TTATTCTGGC AACGGTAATG GCCTGCTGCG GTTTTTTTCT CTACCCGCTG
AAGTTTAGTT TTATCCGTCC TGAAGTGAGT GGGGTGACAG GATGGCTATT TTCGCAACTT
GAACTGTTTG ATCTGCCTTA TAACCAGTCT CCTTCGCTGC ATATTGTTCT CTGCTGGCTA
CTTTGGCGTC ACTTTCGTCA GCATCTGGCT GTGAGGTGGC GTAAAGTCTG CGGCGGATGG
TTTTTACTCA TCGCCATTTC GATGCTGACA ACCTGGCAGC ATCATTTTAT TGATGTCATC
ACAGGGCTGG CGGTAGGTAT GTTGATTGAC TGGATGATAC CCGTCGACCG TCGTTGGAAT
TATCAGAAAC CTGATCAACG TCGAATCAAA ATAGCACTGC CATATGTCGT AGGCGCGTGC
GCGTGCATTG TGTTGATGGA GCTAATGATG ATGGTTCAGT TATGGTGGTC AGTCTGGTTA
TGTTGGCCAG TATTATCGCT ACTCATTATT GGCCGTGGGT ACGGTGGGCT TGGCGCGATA
ACAACAGGGA AAGATAGTCA GGGGAAACTC CCGCCCGCCG TTTACTGGCT GACATTGCCC
TGGCGCATCG GGATGTGGCT ATCTATGCGT TGGTTTTGTC GTCGCCTGGA GCCGGTGAGC
AAAATGACTG CTGGTGTTTA TTTAGGGGCG TTTCCACGAC ATATTCCGGC ACAGAATGCG
GTTCTGGATG TCACCTTTGA ATTCCCTCGC GGACGAGCGA CAAAAGATCG ACTCTATTTC
TGTGTACCGA TGCTGGATCT GGTGGTTCCG GAAGAGGGGG AGCTCCGACA GGCCGTGGCG
ATGCTGGAAA CATTACGCGA AGAGCAAGGC AGCGTTCTGG TCCATTGCGC ATTGGGATTA
TCGCGCAGTG CGCTGGTAGT GGCGGCATGG CTGTTATGTT ACGGACACTG TAAAACAGTT
GATGAAGCGA TTAGCTTTAT TCGAGCCAGA CGCTCGCATA TTGTGCTTAA GGAAGAGCAC
AAAGCGATGT TGAAATTATG GGAAAACAGG TAA
 
Protein sequence
MLQGAGWLLL LAPFFFFTYG SLNQFTAVQD FNSHDIPSQV FGWETAIPFL PWTIVPYWSL 
DLLYGFSLFV CSSTFEQRRL VHRLILATVM ACCGFFLYPL KFSFIRPEVS GVTGWLFSQL
ELFDLPYNQS PSLHIVLCWL LWRHFRQHLA VRWRKVCGGW FLLIAISMLT TWQHHFIDVI
TGLAVGMLID WMIPVDRRWN YQKPDQRRIK IALPYVVGAC ACIVLMELMM MVQLWWSVWL
CWPVLSLLII GRGYGGLGAI TTGKDSQGKL PPAVYWLTLP WRIGMWLSMR WFCRRLEPVS
KMTAGVYLGA FPRHIPAQNA VLDVTFEFPR GRATKDRLYF CVPMLDLVVP EEGELRQAVA
MLETLREEQG SVLVHCALGL SRSALVVAAW LLCYGHCKTV DEAISFIRAR RSHIVLKEEH
KAMLKLWENR