Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1761 |
Symbol | |
ID | 6146547 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1768727 |
End bp | 1770019 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641616637 |
Product | PAP2 family protein |
Protein accession | YP_001743815 |
Protein GI | 170680460 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2453] Predicted protein-tyrosine phosphatase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.000113055 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGCTACAAG GAGCTGGCTG GTTATTGTTG CTGGCCCCGT TTTTCTTCTT CACATATGGA TTTCTTAATC AGTTCACCGC GACTCAGGAT CTTAACAACC ATGATATCCC CAGTCAGGTA TTCGGTTGGG AAACGGCGAT CCCTTTTCTT CCCTGGACTA TTTTGCCTTA CTGGAGTCTG GATCTTTTAT ATGGATTTTC GCTGTTCGTT TGTAGCTCGA CATTTGAGCA GCGCCGTCTG GTCCACCGGC TTATTCTGGC AACGGTAATG GCCTGCTGCG GTTTTTTTCT CTATCCGCTG AAGTTTAGTT TTATCCGTCC TGAAGTGAGT GGGGTGACAG GATGGTTATT TTCGCAACTT GAATTGTTTG ATCTGCCTTA TAACCAGTCT CCTTCGCTGC ATATTATTCT CTGCTGGCTA CTTTGGCGTC ACTTTCGTCA GCATCTGGCT GTGAGGTGGC GTAAAGTCTG TGGCGGATGG TTTTTACTCA TCGCCATTTC GACGCTAACG ACCTGGCAGC ATCATTTTAT TGATGTCATC ACAGGGCTGG CGGTAGGTAT GTTAATTGAC TGGATGGTGC CCGTCGACCG TCGTTGGAAT TATCAGAAAC CTGATCAACG TCGAATCAAA ATAGCACTGC CATATGTCGT AGGCGCGTGC TCGTGCATTG TGTTGATGGA GCTAATGATA ATGCTTCAGT TATGGTGGTC AGTCTGGTTA TGTTGGCCAG TATTATCGCT ACTCATTATT GGCCGTGGGT ACGGTGGGCT TGGCGCGATA ACAACAGGTA AAGATAGTCA AGGGAAACTC CCGCCCGCCG TTTACTGGCT GACATTGCCA TGGCGCATCG GGATGTGGCT GTCTATGCGT TGGTTTTGCC TTCGCCTGGA GCCGGTGAGC AAAATTACTG CTGGTGTTTA TTTAGGGGCG TTTCCACGAC ATATTCCGGC ACAGAATGCG GTTCTGGACG TCACCTTTGA ATTCCCTCGC GGACGAGCGA CAAAAGATCG ACTCTATTTT TGTGTACCGA TGTTGGATCT GGTAGTTCCG GAAGAGGGGG AGCTCCGACA GGCCGTGGCG ATGCTGGAAA CATTACGCGA AGAGCAAGGC GGCGTTCTGG TCCATTGCGC GTTGGGATTA TCGCGCAGTG CGCTGGTGGT GGCGGCATGG CTGTTATGTT ACGGACATTG TAAAACCGTT GATGAAGCGA TTAGTTATAT TCGAGCCAGA CGCTCGCGGA TTGTGCTTAA GGAAGAGCAC AAAGCGATGC TGAAATTATG GGAAAACAGG TAA
|
Protein sequence | MLQGAGWLLL LAPFFFFTYG FLNQFTATQD LNNHDIPSQV FGWETAIPFL PWTILPYWSL DLLYGFSLFV CSSTFEQRRL VHRLILATVM ACCGFFLYPL KFSFIRPEVS GVTGWLFSQL ELFDLPYNQS PSLHIILCWL LWRHFRQHLA VRWRKVCGGW FLLIAISTLT TWQHHFIDVI TGLAVGMLID WMVPVDRRWN YQKPDQRRIK IALPYVVGAC SCIVLMELMI MLQLWWSVWL CWPVLSLLII GRGYGGLGAI TTGKDSQGKL PPAVYWLTLP WRIGMWLSMR WFCLRLEPVS KITAGVYLGA FPRHIPAQNA VLDVTFEFPR GRATKDRLYF CVPMLDLVVP EEGELRQAVA MLETLREEQG GVLVHCALGL SRSALVVAAW LLCYGHCKTV DEAISYIRAR RSRIVLKEEH KAMLKLWENR
|
| |