Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plim_3662 |
Symbol | |
ID | 9140380 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Planctomyces limnophilus DSM 3776 |
Kingdom | Bacteria |
Replicon accession | NC_014148 |
Strand | + |
Start bp | 4709524 |
End bp | 4711437 |
Gene Length | 1914 bp |
Protein Length | 637 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | |
Product | sulfatase |
Protein accession | YP_003631673 |
Protein GI | 296123895 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.33998 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGCCGG ATCGCCAAGC GTACTTGAAT CAGCGTCGCC TCACTCGCAT TGTCTCGGGG CTTTGCCTGC TCTTTGCCTG CCTGATCTGG ATCCTCACAA CCAGCCCGAC ACCTGTCGTA CTCGCACAAA TCACTCCGGA AACGAGTGTT GACACCACCT CGGCAGCCCT CGCCGACCGC CCCAACATCA TCGTCATCAT GGTCGATGAC CTCGGCTGGC GAGACACTTC CATCTACGGC AGCAAGTCGT CCCGCACTCC TCACATCGAT GCCCTCGCCG CTCGCGGCGT CATCTTCACT CAAGCCTACT CGAGCAGTTC GCTCGATGAA CCCAGCCGGG CCAGCCTCCT CACTGGAAAA TGGCCCGCAC GTCTGAAGCT GACTCAATCG CGGGAACTCA ACCCGGGCGA AATTCTCGAA CCCTCTTTGC CACAAACCGC CCTTTCCCAT ATCTCGATGA TCACCCCGAC ATCACGCACT CAACTCCCTG GCGATGAACT CACTGCAGCA GAAATTCTCC AAAACGCAGG CTATGCCACC GCTTTCATGG GCGAATGGAA CCTGGGCGAA AACGCTTCTC AGCCAGAAAA TCAGGGCTTC TCTCACGTTG TCTGCAGTTC GCCGCTTACC AGTCAGCCGC AGTTCGCCGG TCAACATGCT GATGATCTGC TGACACAACA GGCCATCAAC TGGATGGAAA CCAATTCAAA GGAACCGTTC TTTCTGAATC TGTGGTATCA ATCAGTCGGT GCACCTTTTC AAGCTCCATC TGGGGATATA CAGCAGGCAC GCACATTGGC AGACCCCTCT CAAGATCCAC AGCAGGCCCC GGTCATGGCC GCCATGATCG CGGCACTCGA CCAGCGCGTC GGCCTCATTG TCGCCGCACT GGAGCGACTC CAGCTCACGC AGCGCACCAT CATCGTTTTC ACCTCCGATA ACGGTGGCAA CATGACCGAC ACGATCGAAG GCGACCTCCT CACCAGCAAT CGACCGCTCC GTGGTGGCAA AGGCTCGATG TATGAAGGGG GCAGTCGGGT CCCTCTCATC GTCGTCTGGC CTGGCGTCGC CACTCCTGCT CGCAGCTGCG ACGACGCCGT CAGTGCTGTG GATCTCCTCC CGACCCTGGT CGATATGGCG CGCGGCACCA TCCCCGCAGG TCATCAGATT GACGGCGTCA GTCTGAAACC CGCACTCACA GGGGCCACAG GTTTTGATCG AGGTGCCATC TTTCATCACT ACCCGCACTA CAACCCAACG ACAGGAACCA CACCTGCGAT CTCTGTTCGC AGTGAAAACA TGAAGCTCAT TCGCTTCTTC GGTGGTCATG TCACCCAGAC AGACCGCATT GAAGTCTACG ACCTGCAGAA TGATCCCGGC GAGCGCATCA ACCTCGCACG TTCACGCCGG GATGAAATCG TACGCCTGAC GAACCTCATC CAGAACTTTC TCATAGAAAC CCGCGCATTG GTTCCACAGA AAAATCCGAA CTTTGAAAGA CCGCAACAAG GCTGGGCGAC TGGTGCCGAT GCCGAAGTCG AAGATGGCGA GCTAGTCCTC AAACGTTCCG GAGATCGACC CATTCTCTTT GAAACCTACG ATGTACCGCA TGTGAATCGG CAACTGCGCC TTCGCCTGCC ACTCAAGACC TCGTCCCGCA TGGAAGGGCG TGTCATGTGG TCAACTGCCA GCGAGCCAAG CTTTACGCAA AACCGGCAGG CCAAATTTGC AGCCACACAG ACAGGTGAAT GGGAAACTCA CGAGATCACC TTGCCAGTTC AAGACCTGCT CACCGGGATC CGCATCGAGT TCGGTCGAGG AAATAGCGAC CTCTCTCTCG GCTGGATCCG GGCCGAACTG ATCGATGGCA ACCTCGTTAA AGAATGGCAA TTCGGCGAAG TCGAATCTGA GTAA
|
Protein sequence | MLPDRQAYLN QRRLTRIVSG LCLLFACLIW ILTTSPTPVV LAQITPETSV DTTSAALADR PNIIVIMVDD LGWRDTSIYG SKSSRTPHID ALAARGVIFT QAYSSSSLDE PSRASLLTGK WPARLKLTQS RELNPGEILE PSLPQTALSH ISMITPTSRT QLPGDELTAA EILQNAGYAT AFMGEWNLGE NASQPENQGF SHVVCSSPLT SQPQFAGQHA DDLLTQQAIN WMETNSKEPF FLNLWYQSVG APFQAPSGDI QQARTLADPS QDPQQAPVMA AMIAALDQRV GLIVAALERL QLTQRTIIVF TSDNGGNMTD TIEGDLLTSN RPLRGGKGSM YEGGSRVPLI VVWPGVATPA RSCDDAVSAV DLLPTLVDMA RGTIPAGHQI DGVSLKPALT GATGFDRGAI FHHYPHYNPT TGTTPAISVR SENMKLIRFF GGHVTQTDRI EVYDLQNDPG ERINLARSRR DEIVRLTNLI QNFLIETRAL VPQKNPNFER PQQGWATGAD AEVEDGELVL KRSGDRPILF ETYDVPHVNR QLRLRLPLKT SSRMEGRVMW STASEPSFTQ NRQAKFAATQ TGEWETHEIT LPVQDLLTGI RIEFGRGNSD LSLGWIRAEL IDGNLVKEWQ FGEVESE
|
| |