Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plim_3206 |
Symbol | |
ID | 9139920 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Planctomyces limnophilus DSM 3776 |
Kingdom | Bacteria |
Replicon accession | NC_014148 |
Strand | - |
Start bp | 4144214 |
End bp | 4145974 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | |
Product | sulfatase |
Protein accession | YP_003631219 |
Protein GI | 296123441 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.338438 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTCACA ACCATTGGCT CGACCACCTT CTCGCGCTGC GATCGATCCT GTTGCTGCTG GCCATCGTCA GTCCGTCGAT GGGCCTCAAG CTCGTCGCGG CAAGCGATAA GCCGAATGTA CTGCTGATCT GCGTCGACGA TCTCAAGCCC ACGCTGGGTT GCTATGGCGA TCCTCTGGCA AAAACGCCGA ATATTGATCG ACTGGCTTCC CGAGCCGTCC TGTTTGAAAG TGCCTATTGC AATCAGGCCG TCTGCTCGCC CTCACGCAAT GCACTCCTCA CAGGCCTGCG CCCGCAGACG CTGGGCATTT ACGATCTGGG TACCAACTTC CGGAAATCCC GGCCCAACGC CATCACCATG CCCCAGTTCT TCAAACAGCA GGGCTATCAC ACCGCAGCAC TCGGAAAAAT CTTTCATGTC GGTCACGGCA ACGGCGAAGA CCAGGCCTCC TGGAGTGTCC CGCACTTCAA AGCCAACGTC GTCGGTTATG CCTTGCCAGA AAGCAAAGCC CCACAAGGTT TGACTCGCGA AGAAGCTCTA TTTGATAACA AATCAGCCGC CGGCCTCCCG CGCGGTGCAG TGATCGAAGC TGCCAACGTT TCCGATGAAA CCTATGCCGA CGGAAAAATC GCTCAGGAGG CAATGCTCCG TCTGCAGCAG GCCAGCCAAA AGCCCAATGA ACCCTTCTTT CTCGCCGTCG GCTTTGTGAA GCCGCACCTC CCCTTTGTCG CTCCCCAGAA GTATTGGGAT CTTTACCAGA GAGATCAGTT CTCCCTCCCC AGCATCACTA AAGCCCCGGC CGGCGCACCG GCTTATGCCC CCACCAACTG GGGCGAACTC CGCCAATATC AAGGCGTGCC ACAGGAAGGC CCACTCTCCG CAGACCTGCA GAAGGAGTTG ATCCATGGCT ACTACGCCGC CACCAGCTAC ATGGATGCTC AAGTCGGTCG TGTCCTCGAT GAACTCGATC GCCTCCAATT GACCGATCGC ACGATCGTCG TCCTCTGGGG CGATCATGGC TGGCATCTGG GTGATCACGG CATGTGGTGC AAACATTCCA ACTATGAGCA GGCCACTCGA ATTCCTGTCC TGTTTTCCAT TCCCGGCGGT CAGAAAAATG TCAAAACCAA GGCTCTGCTT GAAACCGTCG ATATCTATCC CACACTCTGC CAGCTCGCTG GCTTGCCCTC ACCTGCCGAT ATCGATGGAA GCAGCCAGAT TCAGGTTCTA AGCCATCCCC AGTCTCACTT GAAGGATCAC GTGATTCATG TCTATCCCCG TTCGCCTCAG GGAAAAGGGC CGATTCTGGG CCGGGCGATT CGGACGGAGC GTTACCGCCT CGTCGAGTGG AAGGGCTATG GTGCCTCGCC GGAGACCGCA GAGTATGAAC TCTATGATTA TCAGGCCGAC CCACTGGAGA GAGAAAACCT GGCGAGCCAG CAGCCCCAGG TTGTCGAGCA GCTCAAATCA CTCCTGGCTC GCCATCCAGA GGCGAAACCT CAGATTAAAG CCAACGCCAC GCCAGCCAAA CCTGCTGAGG CCACGTCTCC ACAAAAACCC ACAGCAAAAA TCGATCGCAA CAAACTCTTC GCCACGAAAG ACAAAAACGC GGATGGCAAA TTGACCCACG AAGAGTTCAT GAGCAACCAA CCCGACGGCC CCGCCGCCGC CCAAAGGTTT ATTAAATTCG ATATCAATAA AGACGGAACC CTGAGCCAGG AAGAATTTGT CGGCAGCGGT TCAACGCGTA AAACCCAGTA A
|
Protein sequence | MIHNHWLDHL LALRSILLLL AIVSPSMGLK LVAASDKPNV LLICVDDLKP TLGCYGDPLA KTPNIDRLAS RAVLFESAYC NQAVCSPSRN ALLTGLRPQT LGIYDLGTNF RKSRPNAITM PQFFKQQGYH TAALGKIFHV GHGNGEDQAS WSVPHFKANV VGYALPESKA PQGLTREEAL FDNKSAAGLP RGAVIEAANV SDETYADGKI AQEAMLRLQQ ASQKPNEPFF LAVGFVKPHL PFVAPQKYWD LYQRDQFSLP SITKAPAGAP AYAPTNWGEL RQYQGVPQEG PLSADLQKEL IHGYYAATSY MDAQVGRVLD ELDRLQLTDR TIVVLWGDHG WHLGDHGMWC KHSNYEQATR IPVLFSIPGG QKNVKTKALL ETVDIYPTLC QLAGLPSPAD IDGSSQIQVL SHPQSHLKDH VIHVYPRSPQ GKGPILGRAI RTERYRLVEW KGYGASPETA EYELYDYQAD PLERENLASQ QPQVVEQLKS LLARHPEAKP QIKANATPAK PAEATSPQKP TAKIDRNKLF ATKDKNADGK LTHEEFMSNQ PDGPAAAQRF IKFDINKDGT LSQEEFVGSG STRKTQ
|
| |