Gene EcSMS35_1761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1761 
Symbol 
ID6146547 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1768727 
End bp1770019 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content49% 
IMG OID641616637 
ProductPAP2 family protein 
Protein accessionYP_001743815 
Protein GI170680460 
COG category[T] Signal transduction mechanisms 
COG ID[COG2453] Predicted protein-tyrosine phosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.000113055 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGCTACAAG GAGCTGGCTG GTTATTGTTG CTGGCCCCGT TTTTCTTCTT CACATATGGA 
TTTCTTAATC AGTTCACCGC GACTCAGGAT CTTAACAACC ATGATATCCC CAGTCAGGTA
TTCGGTTGGG AAACGGCGAT CCCTTTTCTT CCCTGGACTA TTTTGCCTTA CTGGAGTCTG
GATCTTTTAT ATGGATTTTC GCTGTTCGTT TGTAGCTCGA CATTTGAGCA GCGCCGTCTG
GTCCACCGGC TTATTCTGGC AACGGTAATG GCCTGCTGCG GTTTTTTTCT CTATCCGCTG
AAGTTTAGTT TTATCCGTCC TGAAGTGAGT GGGGTGACAG GATGGTTATT TTCGCAACTT
GAATTGTTTG ATCTGCCTTA TAACCAGTCT CCTTCGCTGC ATATTATTCT CTGCTGGCTA
CTTTGGCGTC ACTTTCGTCA GCATCTGGCT GTGAGGTGGC GTAAAGTCTG TGGCGGATGG
TTTTTACTCA TCGCCATTTC GACGCTAACG ACCTGGCAGC ATCATTTTAT TGATGTCATC
ACAGGGCTGG CGGTAGGTAT GTTAATTGAC TGGATGGTGC CCGTCGACCG TCGTTGGAAT
TATCAGAAAC CTGATCAACG TCGAATCAAA ATAGCACTGC CATATGTCGT AGGCGCGTGC
TCGTGCATTG TGTTGATGGA GCTAATGATA ATGCTTCAGT TATGGTGGTC AGTCTGGTTA
TGTTGGCCAG TATTATCGCT ACTCATTATT GGCCGTGGGT ACGGTGGGCT TGGCGCGATA
ACAACAGGTA AAGATAGTCA AGGGAAACTC CCGCCCGCCG TTTACTGGCT GACATTGCCA
TGGCGCATCG GGATGTGGCT GTCTATGCGT TGGTTTTGCC TTCGCCTGGA GCCGGTGAGC
AAAATTACTG CTGGTGTTTA TTTAGGGGCG TTTCCACGAC ATATTCCGGC ACAGAATGCG
GTTCTGGACG TCACCTTTGA ATTCCCTCGC GGACGAGCGA CAAAAGATCG ACTCTATTTT
TGTGTACCGA TGTTGGATCT GGTAGTTCCG GAAGAGGGGG AGCTCCGACA GGCCGTGGCG
ATGCTGGAAA CATTACGCGA AGAGCAAGGC GGCGTTCTGG TCCATTGCGC GTTGGGATTA
TCGCGCAGTG CGCTGGTGGT GGCGGCATGG CTGTTATGTT ACGGACATTG TAAAACCGTT
GATGAAGCGA TTAGTTATAT TCGAGCCAGA CGCTCGCGGA TTGTGCTTAA GGAAGAGCAC
AAAGCGATGC TGAAATTATG GGAAAACAGG TAA
 
Protein sequence
MLQGAGWLLL LAPFFFFTYG FLNQFTATQD LNNHDIPSQV FGWETAIPFL PWTILPYWSL 
DLLYGFSLFV CSSTFEQRRL VHRLILATVM ACCGFFLYPL KFSFIRPEVS GVTGWLFSQL
ELFDLPYNQS PSLHIILCWL LWRHFRQHLA VRWRKVCGGW FLLIAISTLT TWQHHFIDVI
TGLAVGMLID WMVPVDRRWN YQKPDQRRIK IALPYVVGAC SCIVLMELMI MLQLWWSVWL
CWPVLSLLII GRGYGGLGAI TTGKDSQGKL PPAVYWLTLP WRIGMWLSMR WFCLRLEPVS
KITAGVYLGA FPRHIPAQNA VLDVTFEFPR GRATKDRLYF CVPMLDLVVP EEGELRQAVA
MLETLREEQG GVLVHCALGL SRSALVVAAW LLCYGHCKTV DEAISYIRAR RSRIVLKEEH
KAMLKLWENR