Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1494 |
Symbol | |
ID | 5592328 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 1501337 |
End bp | 1502629 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640920651 |
Product | PAP2 family protein |
Protein accession | YP_001458207 |
Protein GI | 157160889 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2453] Predicted protein-tyrosine phosphatase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 49 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCTACAAG GCGCTGGCTG GTTATTGTTG CTGGCCCCGT TTTTCTTCTT CACCTATGGA TCTCTTAATC AGTTCACCGC GGTTCAGGAC CTTAACAGCC ATGATATTCC CAGTCAGGTA TTCGGTTGGG AAACGGCGAT CCCTTTTCTT CCCTGGACTA TTGTTCCTTA CTGGAGTCTG GATCTTTTAT ATGGATTTTC GCTGTTCGTT TGTAGCACGA CATTCGAACA GCGCCGACTT GTCCACCGGC TTATTCTGGC AACGGTAATG GCCTGCTGCG GTTTTTTTCT CTACCCGCTG AAGTTTAGTT TTATCCGTCC TGAAGTGAGT GGGGTGACAG GATGGCTATT TTCGCAACTT GAACTGTTTG ATCTGCCTTA TAACCAGTCT CCTTCGCTGC ATATTATTCT CTGCTGGCTA CTTTGGCGTC ACTTTCGTCA GCATCTGGCT GTGAGGTGGC GTAAAGTCTG CGGCGGATGG TTTTTACTCA TCGCCATTTC GACGCTGACG ACCTGGCAGC ATCATTTTAT TGATGTCATC ACGGGGCTGG CGGTAGGTAT GTTAATTGAC TGGATGGTGC CCGTCGACCG TCGTTGGAAT TATCAGAAAC CTGATCAACG TCGAATCAAA ATAGCACTGC CATATGTCGT AGGCGCGGGC TCGTGCATTG TGTTGATGGA GCTAATGATA ATGCTTCAGT TATGGTGGTC AGTCTGGTTA TGTTGGCCAG TATTATCGCT ATTCATCATT GGCCGTGGGT ACGGTGGGCT TGGCGCGATA ACAACAGGGA AAGATAGTCA GGGGAAACTC CCGCCCGCCG TTTACTGGCT GACATTGCCC TGGCGTATCG GGATGTGGCT GTCTATGCGT TGGTCTTGTC TTCGCCTGGA GCCGGTGAGC AAAATTACTG CTGGTGTTTA TTTAGGGGCG TTTCCACGAC ATATTCCGGC ACAGAATGCG GTTCTGGACG TCACCTTTGA ATTCCCTCGC GGACGAGCCA CAAAAGATCG ACTCTATTTT TGTGTACCGA TGCTGGATCT GGTGGTTCCG GAAGAGGGGG AGCTCCGACA GGCCGTGGCG ATGCTGGAAA CATTACGCGA AGAGCAAGGC AGCGTTCTGG TCCATTGTGC ATTGGGATTA TCGCGCAGTG CGCTGGTGGT GGCGGCATGG TTGTTATGTT ACGGACACTG TAAAACCGTT AATGAAGCGA TTAGCTATAT TCGAGCCAGA CGCCCGCAGA TTGTGCTGAC AGACGAGCAC AAAGCGATGC TGAGATTATG GGAAAACAGG TAA
|
Protein sequence | MLQGAGWLLL LAPFFFFTYG SLNQFTAVQD LNSHDIPSQV FGWETAIPFL PWTIVPYWSL DLLYGFSLFV CSTTFEQRRL VHRLILATVM ACCGFFLYPL KFSFIRPEVS GVTGWLFSQL ELFDLPYNQS PSLHIILCWL LWRHFRQHLA VRWRKVCGGW FLLIAISTLT TWQHHFIDVI TGLAVGMLID WMVPVDRRWN YQKPDQRRIK IALPYVVGAG SCIVLMELMI MLQLWWSVWL CWPVLSLFII GRGYGGLGAI TTGKDSQGKL PPAVYWLTLP WRIGMWLSMR WSCLRLEPVS KITAGVYLGA FPRHIPAQNA VLDVTFEFPR GRATKDRLYF CVPMLDLVVP EEGELRQAVA MLETLREEQG SVLVHCALGL SRSALVVAAW LLCYGHCKTV NEAISYIRAR RPQIVLTDEH KAMLRLWENR
|
| |