Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4388 |
Symbol | |
ID | 6142929 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4476866 |
End bp | 4478419 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641619209 |
Product | serine/threonine protein phosphatase family protein |
Protein accession | YP_001746333 |
Protein GI | 170681676 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.693263 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATTA AAATGCTTGC TGCGGGTATC GCATTAACAC TGCCGTTTTG GGCTTGCGCC AAAGATGTCA CCATCATTTA TACCAACGAC CTCCATGCAC ATGTAGAGCC TTATAAAGTG CCGTGGATTG CTGACGGTAA ACGGGATATT GGCGGCTGGG CAAATATCAC TACGCTTGTT AAACAAGAAA AAGCTAAAAA CAAAGCGACC TGGTTTTTTG ATGCGGGTGA CTATTTTACC GGACCGTATA TCAGCAGCCT GACTAAAGGC AAGGCGATTA TCGATATTAT GAATACCATG CCATTCGATG CGGTCACTAT AGGTAATCAT GAATTTGATC ACGGCTGGGA CAATACGTTA TTACAGTTGA GTCAGGCAAA ATTCCCGATT GTGCAGGGTA ATGTTTTTTA TCAGAACAGC AGTAAATCAT TCTGGGATAA GCCCTATACC ATCATTGAAA AAGACGGCGT GAAAATTGGC GTGATTGGTT TGCACGGTGT ATTTGCCTTT AATGATACGG TATCTGCGGC AACGCGAGTG GGTATTGAGG CGCGTGATGA AATTAAATGG CTACAACGTT ATATCGATGA ACTCAAAGGC AAGGTTGATC TAACCGTCGC CCTGATCCAC GAAGGTGTTC CGGCCCGCCA GTCCAGTATG GGAGGCACGG ATGTGCGTCG CGCACTGGAT AAAGATATTC AGACGGCAAG TCAGGTGAAA GGGTTGGATA TTTTGATCAC CGGGCATGCA CATGTGGGTA CGCCGGAACC GATTAAAGTC GGCAATACGT TAATCCTCTC CACTGACAGC GGCGGGATTG ATGTCGGTAA ACTGGTTCTC GACTACAAAG AGAAGCCGCA CACTTTTACG GTGAAAAACT TCGAGCTTAA AACCCTTTAC GCCGATGAGT GGAAGCCCGA TCCGCAAACG AAACAGGTGA TTGATAGTTG GAACAAAAAG CTGGATGAAG TCGTGCAACA AACGGTGGCG CAATCGCCGG TTGAACTAAA ACGTGCCTAT GGTGAATCAG CTTCTCTCGG GAACCTGGCG GCAGACGCTT TGCTGGCAGC GGCGGGTAAA AATACCCAGT TGGCGTTAAC CAACTCTGGC GGGATTCGCA ATGAGATCCC GGCGGGTGCA ATTACGATGG GTGGCGTCAT CAGTACCTTC CCGTTCCCCA ACGAACTGGT GACGATGGAT CTCACGGGTA AACAATTACG CAGTTTGATG GAACACGGCG CAAGTTTGAG TAATGGCGTT TTACAGGTAT CGAAAGGCCT GGAAATGAAG TACGACAGCA GTAAGCCGGT TGGTCAGCGG GTAATCACGC TGACTCTGAA TGGCAAACCC ATTGAAGATG CGACGATTTA CCACATTGCT ACTCAGAGTT TCCTTGCTGA TGGTGGAGAT GGTTTTACCG CCTTTACCGA AGGGAAAGCG CGTAACACAA CGGGCGGTTA TTACGTTTAT CACGCCGTGG TTGATTACTT CAAAGCGGGT AACACCATCA CGGATGAACA GATCAACGGT ATGCGCGTGA AAGATATCAA GTAA
|
Protein sequence | MKIKMLAAGI ALTLPFWACA KDVTIIYTND LHAHVEPYKV PWIADGKRDI GGWANITTLV KQEKAKNKAT WFFDAGDYFT GPYISSLTKG KAIIDIMNTM PFDAVTIGNH EFDHGWDNTL LQLSQAKFPI VQGNVFYQNS SKSFWDKPYT IIEKDGVKIG VIGLHGVFAF NDTVSAATRV GIEARDEIKW LQRYIDELKG KVDLTVALIH EGVPARQSSM GGTDVRRALD KDIQTASQVK GLDILITGHA HVGTPEPIKV GNTLILSTDS GGIDVGKLVL DYKEKPHTFT VKNFELKTLY ADEWKPDPQT KQVIDSWNKK LDEVVQQTVA QSPVELKRAY GESASLGNLA ADALLAAAGK NTQLALTNSG GIRNEIPAGA ITMGGVISTF PFPNELVTMD LTGKQLRSLM EHGASLSNGV LQVSKGLEMK YDSSKPVGQR VITLTLNGKP IEDATIYHIA TQSFLADGGD GFTAFTEGKA RNTTGGYYVY HAVVDYFKAG NTITDEQING MRVKDIK
|
| |