Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2014 |
Symbol | |
ID | 6967980 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1911332 |
End bp | 1912624 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 643385931 |
Product | PAP2 family protein |
Protein accession | YP_002270420 |
Protein GI | 209398896 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2453] Predicted protein-tyrosine phosphatase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.106325 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCTACAAG GCGCTGGCTG GTTATTGTTG CTGGCCCCGT TTTTCTTCTT CACCTATGGA TCTCTAAATC AGTTCACCGC GGTTCAGGAC TTTAACAGCC ATGATATCCC CAGTCAGGTA TTCGGCTGGG AAACGGCGAT CCCTTTTCTT CCCTGGACTA TTGTCCCTTA CTGGAGTCTG GATCTTTTAT ATGGATTTTC GCTGTTCGTT TGTAGCTCGA CATTCGAACA GCGCCGACTT GTCCACCGGC TTATTCTGGC AACGGTAATG GCCTGCTGCG GTTTTTTTCT CTACCCGCTG AAGTTTAGTT TTATCCGTCC TGAAGTGAGT GGGGTGACAG GATGGCTATT TTCGCAACTT GAACTGTTTG ATCTGCCTTA TAACCAGTCT CCTTCGCTGC ATATTGTTCT CTGCTGGCTA CTTTGGCGTC ACTTTCGTCA GCATCTGGCT GTGAGGTGGC GTAAAGTCTG CGGCGGATGG TTTTTACTCA TCGCCATTTC GATGCTGACA ACCTGGCAGC ATCATTTTAT TGATGTCATC ACAGGGCTGG CGGTAGGTAT GTTGATTGAC TGGATGATAC CCGTCGACCG TCGTTGGAAT TATCAGAAAC CTGATCAACG TCGAATCAAA ATAGCACTGC CATATGTCGT AGGCGCGTGC GCGTGCATTG TGTTGATGGA GCTAATGATG ATGGTTCAGT TATGGTGGTC AGTCTGGTTA TGTTGGCCAG TATTATCGCT ACTCATTATT GGCCGTGGGT ACGGTGGGCT TGGCGCGATA ACAACAGGGA AAGATAGTCA GGGGAAACTC CCGCCCGCCG TTTACTGGCT GACATTGCCC TGGCGCATCG GGATGTGGCT ATCTATGCGT TGGTTTTGTC GTCGCCTGGA GCCGGTGAGC AAAATGACTG CTGGTGTTTA TTTAGGGGCG TTTCCACGAC ATATTCCGGC ACAGAATGCG GTTCTGGATG TCACCTTTGA ATTCCCTCGC GGACGAGCGA CAAAAGATCG ACTCTATTTC TGTGTACCGA TGCTGGATCT GGTGGTTCCG GAAGAGGGGG AGCTCCGACA GGCCGTGGCG ATGCTGGAAA CATTACGCGA AGAGCAAGGC AGCGTTCTGG TCCATTGCGC ATTGGGATTA TCGCGCAGTG CGCTGGTAGT GGCGGCATGG CTGTTATGTT ACGGACACTG TAAAACAGTT GATGAAGCGA TTAGCTTTAT TCGAGCCAGA CGCTCGCATA TTGTGCTTAA GGAAGAGCAC AAAGCGATGT TGAAATTATG GGAAAACAGG TAA
|
Protein sequence | MLQGAGWLLL LAPFFFFTYG SLNQFTAVQD FNSHDIPSQV FGWETAIPFL PWTIVPYWSL DLLYGFSLFV CSSTFEQRRL VHRLILATVM ACCGFFLYPL KFSFIRPEVS GVTGWLFSQL ELFDLPYNQS PSLHIVLCWL LWRHFRQHLA VRWRKVCGGW FLLIAISMLT TWQHHFIDVI TGLAVGMLID WMIPVDRRWN YQKPDQRRIK IALPYVVGAC ACIVLMELMM MVQLWWSVWL CWPVLSLLII GRGYGGLGAI TTGKDSQGKL PPAVYWLTLP WRIGMWLSMR WFCRRLEPVS KMTAGVYLGA FPRHIPAQNA VLDVTFEFPR GRATKDRLYF CVPMLDLVVP EEGELRQAVA MLETLREEQG SVLVHCALGL SRSALVVAAW LLCYGHCKTV DEAISFIRAR RSHIVLKEEH KAMLKLWENR
|
| |