Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plim_0936 |
Symbol | |
ID | 9137621 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Planctomyces limnophilus DSM 3776 |
Kingdom | Bacteria |
Replicon accession | NC_014148 |
Strand | - |
Start bp | 1200741 |
End bp | 1202180 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | |
Product | sulfatase |
Protein accession | YP_003628979 |
Protein GI | 296121201 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCATTAG GAAGTCACCC GGCTATCGCA CTTTGGTTGG CTCTTGTGGC ATTCTGTTCA CAAGCCCTCT TGGCTGCCGA GGATGTGAAT CAAACCAGCA AATCCGGTCG TCCCAACATT CTGGTCATTA TGGCAGATGA CCTGGGATAC GCTGACCTGG GGGTTCAGGG TGGCTGCGAA ATCCCCACAC CTCACCTGGA TCAATTAGCC GCCAGTGGCA TTCGCTGCAC CAACGCATAC GTCTCGGCAC CCTATTGCAG CCCCTCTCGC GCCGGCTTTC TCACCGGGAA GTACCAGACT CGGTTTGGAC ACGAGTTCAA TCCCCACGTA GGCGAAGAAG CGAAACTCGG CTTGCCGCTC GAAGAAGTGA CGATCGCCAA TTTATTACAA ACAGAAGGCT ATCGCACGGC CCTGATCGGC AAATGGCACC AAGGTTTCAG CAAAGATCAT CATCCGCAAA GCCGGGGGTT CGATGAGTTT TTCGGCTTCC TCGTCGGTGG CCACAACTAC CTCTTACATA AAGAAGTCAA AGCCAGGTTT GGAACTGCTC ACTCTCACGA CATGATCTAC CGGGGCCGCG AAGTCGAACC ACAGGAAGGC TACGCGACAG ATCTCTTCAC CAACGAAGCT CTTCGCTGGA TGAGTGGCCC GCCCAACAAA CCCTGGTTCT TGTATCTTTC GTACAATGCC GTCCACACAC CACTCGAAAT CGCCCCGCAT CTCCAGAAGA GAATCCCAGA GTCTGTAAAA CTCCCAGCTC GCCGCGGCTA CCTCTCCTTA CTGGCCGGCC TCGACGATTC CATCGGCCGG ATTACGCAGC ACCTGAGTCA GCATGGGCTC CGCGAAAAGA CACTCATTAT ATTTCTCAGT GATAATGGCG GGTCGGGACG CGCACCCATA CTCGCGTACA ACTCCGGGCT CAACCATCCC CTGCGCGGAG ATAAAGGGCA AACTCTCGAA GGCGGCATCC GCGTCCCCTT CTTTGTCTCA TGGCCCGGGC AACTTCCCGC CCGCACAATC TACGAGCAGC CCATCATTTC TCTCGATCTT CTCCCCACGG TTTGCCAGTT GGCTGCCAAT AATCCCGCAA AACCACAGCC ACTTCCTCAA GGAATCGATG GGGTGAATCT CATGCCCTAT TGGCTCGGCC AGCGTTCGGG AGCTCCTCAC GAATCGCTCT TCTGGAGATT TGGCCCCCAA AAAGCCGTCC GCGCAGGAAA CTGGAAGCTC GTGGATTGGC GAGATTTTCC CGCATCAAAA AATAGTGGCT GGGAACTTTA CGACCTCTCC ACAGACATCA GCGAAAAAAA CAACCTGGCA GAAACTCATC CCGAAATTGT CGCCCGTCTG AAAACTTCCT GGGAAAAGTG GAATCAATCC AATATCGAGC CCCTCTGGAG AGGCTCCAAG ATGGAAGACG CCACTCCAGT GACGAAATAA
|
Protein sequence | MSLGSHPAIA LWLALVAFCS QALLAAEDVN QTSKSGRPNI LVIMADDLGY ADLGVQGGCE IPTPHLDQLA ASGIRCTNAY VSAPYCSPSR AGFLTGKYQT RFGHEFNPHV GEEAKLGLPL EEVTIANLLQ TEGYRTALIG KWHQGFSKDH HPQSRGFDEF FGFLVGGHNY LLHKEVKARF GTAHSHDMIY RGREVEPQEG YATDLFTNEA LRWMSGPPNK PWFLYLSYNA VHTPLEIAPH LQKRIPESVK LPARRGYLSL LAGLDDSIGR ITQHLSQHGL REKTLIIFLS DNGGSGRAPI LAYNSGLNHP LRGDKGQTLE GGIRVPFFVS WPGQLPARTI YEQPIISLDL LPTVCQLAAN NPAKPQPLPQ GIDGVNLMPY WLGQRSGAPH ESLFWRFGPQ KAVRAGNWKL VDWRDFPASK NSGWELYDLS TDISEKNNLA ETHPEIVARL KTSWEKWNQS NIEPLWRGSK MEDATPVTK
|
| |