Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plim_3501 |
Symbol | |
ID | 9140219 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Planctomyces limnophilus DSM 3776 |
Kingdom | Bacteria |
Replicon accession | NC_014148 |
Strand | + |
Start bp | 4525460 |
End bp | 4526980 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | |
Product | sulfatase |
Protein accession | YP_003631513 |
Protein GI | 296123735 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.486843 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTCATG AATTGATGAC CCGCCAGCCC GTAAATCATC GAGAAGACCG GGGTGGGCAT CGAATTTCCT GGTTCCTGTG TCACCTGGCG ATCAGTCTAT CGCTGTGTCT CTGGCAGGTT GACTCGATCA CAAAAGTGAT GGCAGCCGAG GCTCGGCCTG AAAAACCGAA TGTCGTCATT ATTAACTGCG ATGACCTGGG TTATGCCGAT GTGGGGGCGT TTGGTGCTAC GATCTGCAAG ACGCCTGAGA TTGACAGGAT GGCGAGAGAA GGGGTGAAAG CCACTTCGTT TTACGTGGCG CAAGCGGTTT GCTCGGCTTC TCGCACAGCC CTGCTCACAG GCTGCCTTCC GAATCGTATC GGAATTCTCG GGGCGCTGAG TCATGTTTCG AAGAACGGCA TCGCCGACAG CGAAGTGACT CTGGGCGAAC TCTTTCAGTC ACAAGGCTAT TCCACCGCCA TGTATGGCAA ATGGCACCTG GGCTATCAGG CTCAGTTCCT GCCGGGTCAC CATGGATTTG GTGAAGCCTT GGGGATTCCT TACTCGAACG ATATGTGGTC GAAGAACCCT TATGGCAAGT TCCCGCCCTT GCCTCTTTTC CGTCAAAAAG GGGATTCACC CGCTGAGATC ATTGGCCATG ACACCGATCA ATCCCGCTTC ACGACTGACT TCACGATGGC AGCCGTTTCC TTTATCGACC GGCACGCCGA CAAACCCTTC TTCATCTATC TGGCTCACCC CATGCCCCAT ACGCCGATCT TTGTCAGCGA AGAGCGCAAC AGCGGTGAGA GAGCCCAACT TTACCGAGAT GTCATCGGCG AGATTGACTG GTCGGTCGGA ACCATTCGGC AGACACTCGA AAAGCATCAA CTCACTCGCA AAACACTGGT GATTTTTACT TCTGATAATG GCCCGTGGCT GGTTTTTGGA AATCATGCCG GCAGCACAGG GCCGCTCCGC GAAGGGAAAG GAACAATGTG GGATGGTGGT GCCCGCGTTC CGTTTGTGGC CTGCTGGCCC GGTGTCATTC CGCCCGATAC GACGGTCGAT CTTCCCATGG CCACCTACGA CCTCTTTCCC ACCTTTGCCA AAATGCTCGG TGCTAAACTT CCTGATCATC CCATTGATGG CGTCGATATC TGGCCTCAGT TAACCAGTGC CTCGAAAGCT CAACCCCACC AGGCGCTATG GTTCTACTAT GGCCGCGATC TGATTGCTGT GCGTTCGGGC CCATGGAAGC TCGTCTTCCC GCACACTTAC GTCCATCCCG TCGAACGGGG AAACGACGGC CAGCGCGGTA AGCTCGTCAA CCGGAAGTTC ACGGAGCTGG CACTTTACAA TCTGGATTCT GATATTGGCG AAACGACGAA TCTGGCCAGC CAGCACCCTG AAATCGTCAA GCAGTTGGAG GCCTATGCTG AGGTCGCCCG TAATGAACTG GGTGACGCTT TGACCAATCG CAAAGGTAGC GGAGTCCGCC CTCCCGGCAC AGTCAACGAT TCCCCAGCAC TCGGAAACTA A
|
Protein sequence | MRHELMTRQP VNHREDRGGH RISWFLCHLA ISLSLCLWQV DSITKVMAAE ARPEKPNVVI INCDDLGYAD VGAFGATICK TPEIDRMARE GVKATSFYVA QAVCSASRTA LLTGCLPNRI GILGALSHVS KNGIADSEVT LGELFQSQGY STAMYGKWHL GYQAQFLPGH HGFGEALGIP YSNDMWSKNP YGKFPPLPLF RQKGDSPAEI IGHDTDQSRF TTDFTMAAVS FIDRHADKPF FIYLAHPMPH TPIFVSEERN SGERAQLYRD VIGEIDWSVG TIRQTLEKHQ LTRKTLVIFT SDNGPWLVFG NHAGSTGPLR EGKGTMWDGG ARVPFVACWP GVIPPDTTVD LPMATYDLFP TFAKMLGAKL PDHPIDGVDI WPQLTSASKA QPHQALWFYY GRDLIAVRSG PWKLVFPHTY VHPVERGNDG QRGKLVNRKF TELALYNLDS DIGETTNLAS QHPEIVKQLE AYAEVARNEL GDALTNRKGS GVRPPGTVND SPALGN
|
| |