Gene EcHS_A1494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1494 
Symbol 
ID5592328 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1501337 
End bp1502629 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content50% 
IMG OID640920651 
ProductPAP2 family protein 
Protein accessionYP_001458207 
Protein GI157160889 
COG category[T] Signal transduction mechanisms 
COG ID[COG2453] Predicted protein-tyrosine phosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones49 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCTACAAG GCGCTGGCTG GTTATTGTTG CTGGCCCCGT TTTTCTTCTT CACCTATGGA 
TCTCTTAATC AGTTCACCGC GGTTCAGGAC CTTAACAGCC ATGATATTCC CAGTCAGGTA
TTCGGTTGGG AAACGGCGAT CCCTTTTCTT CCCTGGACTA TTGTTCCTTA CTGGAGTCTG
GATCTTTTAT ATGGATTTTC GCTGTTCGTT TGTAGCACGA CATTCGAACA GCGCCGACTT
GTCCACCGGC TTATTCTGGC AACGGTAATG GCCTGCTGCG GTTTTTTTCT CTACCCGCTG
AAGTTTAGTT TTATCCGTCC TGAAGTGAGT GGGGTGACAG GATGGCTATT TTCGCAACTT
GAACTGTTTG ATCTGCCTTA TAACCAGTCT CCTTCGCTGC ATATTATTCT CTGCTGGCTA
CTTTGGCGTC ACTTTCGTCA GCATCTGGCT GTGAGGTGGC GTAAAGTCTG CGGCGGATGG
TTTTTACTCA TCGCCATTTC GACGCTGACG ACCTGGCAGC ATCATTTTAT TGATGTCATC
ACGGGGCTGG CGGTAGGTAT GTTAATTGAC TGGATGGTGC CCGTCGACCG TCGTTGGAAT
TATCAGAAAC CTGATCAACG TCGAATCAAA ATAGCACTGC CATATGTCGT AGGCGCGGGC
TCGTGCATTG TGTTGATGGA GCTAATGATA ATGCTTCAGT TATGGTGGTC AGTCTGGTTA
TGTTGGCCAG TATTATCGCT ATTCATCATT GGCCGTGGGT ACGGTGGGCT TGGCGCGATA
ACAACAGGGA AAGATAGTCA GGGGAAACTC CCGCCCGCCG TTTACTGGCT GACATTGCCC
TGGCGTATCG GGATGTGGCT GTCTATGCGT TGGTCTTGTC TTCGCCTGGA GCCGGTGAGC
AAAATTACTG CTGGTGTTTA TTTAGGGGCG TTTCCACGAC ATATTCCGGC ACAGAATGCG
GTTCTGGACG TCACCTTTGA ATTCCCTCGC GGACGAGCCA CAAAAGATCG ACTCTATTTT
TGTGTACCGA TGCTGGATCT GGTGGTTCCG GAAGAGGGGG AGCTCCGACA GGCCGTGGCG
ATGCTGGAAA CATTACGCGA AGAGCAAGGC AGCGTTCTGG TCCATTGTGC ATTGGGATTA
TCGCGCAGTG CGCTGGTGGT GGCGGCATGG TTGTTATGTT ACGGACACTG TAAAACCGTT
AATGAAGCGA TTAGCTATAT TCGAGCCAGA CGCCCGCAGA TTGTGCTGAC AGACGAGCAC
AAAGCGATGC TGAGATTATG GGAAAACAGG TAA
 
Protein sequence
MLQGAGWLLL LAPFFFFTYG SLNQFTAVQD LNSHDIPSQV FGWETAIPFL PWTIVPYWSL 
DLLYGFSLFV CSTTFEQRRL VHRLILATVM ACCGFFLYPL KFSFIRPEVS GVTGWLFSQL
ELFDLPYNQS PSLHIILCWL LWRHFRQHLA VRWRKVCGGW FLLIAISTLT TWQHHFIDVI
TGLAVGMLID WMVPVDRRWN YQKPDQRRIK IALPYVVGAG SCIVLMELMI MLQLWWSVWL
CWPVLSLFII GRGYGGLGAI TTGKDSQGKL PPAVYWLTLP WRIGMWLSMR WSCLRLEPVS
KITAGVYLGA FPRHIPAQNA VLDVTFEFPR GRATKDRLYF CVPMLDLVVP EEGELRQAVA
MLETLREEQG SVLVHCALGL SRSALVVAAW LLCYGHCKTV NEAISYIRAR RPQIVLTDEH
KAMLRLWENR