Gene Plim_3662 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_3662 
Symbol 
ID9140380 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp4709524 
End bp4711437 
Gene Length1914 bp 
Protein Length637 aa 
Translation table11 
GC content57% 
IMG OID 
Productsulfatase 
Protein accessionYP_003631673 
Protein GI296123895 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.33998 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGCCGG ATCGCCAAGC GTACTTGAAT CAGCGTCGCC TCACTCGCAT TGTCTCGGGG 
CTTTGCCTGC TCTTTGCCTG CCTGATCTGG ATCCTCACAA CCAGCCCGAC ACCTGTCGTA
CTCGCACAAA TCACTCCGGA AACGAGTGTT GACACCACCT CGGCAGCCCT CGCCGACCGC
CCCAACATCA TCGTCATCAT GGTCGATGAC CTCGGCTGGC GAGACACTTC CATCTACGGC
AGCAAGTCGT CCCGCACTCC TCACATCGAT GCCCTCGCCG CTCGCGGCGT CATCTTCACT
CAAGCCTACT CGAGCAGTTC GCTCGATGAA CCCAGCCGGG CCAGCCTCCT CACTGGAAAA
TGGCCCGCAC GTCTGAAGCT GACTCAATCG CGGGAACTCA ACCCGGGCGA AATTCTCGAA
CCCTCTTTGC CACAAACCGC CCTTTCCCAT ATCTCGATGA TCACCCCGAC ATCACGCACT
CAACTCCCTG GCGATGAACT CACTGCAGCA GAAATTCTCC AAAACGCAGG CTATGCCACC
GCTTTCATGG GCGAATGGAA CCTGGGCGAA AACGCTTCTC AGCCAGAAAA TCAGGGCTTC
TCTCACGTTG TCTGCAGTTC GCCGCTTACC AGTCAGCCGC AGTTCGCCGG TCAACATGCT
GATGATCTGC TGACACAACA GGCCATCAAC TGGATGGAAA CCAATTCAAA GGAACCGTTC
TTTCTGAATC TGTGGTATCA ATCAGTCGGT GCACCTTTTC AAGCTCCATC TGGGGATATA
CAGCAGGCAC GCACATTGGC AGACCCCTCT CAAGATCCAC AGCAGGCCCC GGTCATGGCC
GCCATGATCG CGGCACTCGA CCAGCGCGTC GGCCTCATTG TCGCCGCACT GGAGCGACTC
CAGCTCACGC AGCGCACCAT CATCGTTTTC ACCTCCGATA ACGGTGGCAA CATGACCGAC
ACGATCGAAG GCGACCTCCT CACCAGCAAT CGACCGCTCC GTGGTGGCAA AGGCTCGATG
TATGAAGGGG GCAGTCGGGT CCCTCTCATC GTCGTCTGGC CTGGCGTCGC CACTCCTGCT
CGCAGCTGCG ACGACGCCGT CAGTGCTGTG GATCTCCTCC CGACCCTGGT CGATATGGCG
CGCGGCACCA TCCCCGCAGG TCATCAGATT GACGGCGTCA GTCTGAAACC CGCACTCACA
GGGGCCACAG GTTTTGATCG AGGTGCCATC TTTCATCACT ACCCGCACTA CAACCCAACG
ACAGGAACCA CACCTGCGAT CTCTGTTCGC AGTGAAAACA TGAAGCTCAT TCGCTTCTTC
GGTGGTCATG TCACCCAGAC AGACCGCATT GAAGTCTACG ACCTGCAGAA TGATCCCGGC
GAGCGCATCA ACCTCGCACG TTCACGCCGG GATGAAATCG TACGCCTGAC GAACCTCATC
CAGAACTTTC TCATAGAAAC CCGCGCATTG GTTCCACAGA AAAATCCGAA CTTTGAAAGA
CCGCAACAAG GCTGGGCGAC TGGTGCCGAT GCCGAAGTCG AAGATGGCGA GCTAGTCCTC
AAACGTTCCG GAGATCGACC CATTCTCTTT GAAACCTACG ATGTACCGCA TGTGAATCGG
CAACTGCGCC TTCGCCTGCC ACTCAAGACC TCGTCCCGCA TGGAAGGGCG TGTCATGTGG
TCAACTGCCA GCGAGCCAAG CTTTACGCAA AACCGGCAGG CCAAATTTGC AGCCACACAG
ACAGGTGAAT GGGAAACTCA CGAGATCACC TTGCCAGTTC AAGACCTGCT CACCGGGATC
CGCATCGAGT TCGGTCGAGG AAATAGCGAC CTCTCTCTCG GCTGGATCCG GGCCGAACTG
ATCGATGGCA ACCTCGTTAA AGAATGGCAA TTCGGCGAAG TCGAATCTGA GTAA
 
Protein sequence
MLPDRQAYLN QRRLTRIVSG LCLLFACLIW ILTTSPTPVV LAQITPETSV DTTSAALADR 
PNIIVIMVDD LGWRDTSIYG SKSSRTPHID ALAARGVIFT QAYSSSSLDE PSRASLLTGK
WPARLKLTQS RELNPGEILE PSLPQTALSH ISMITPTSRT QLPGDELTAA EILQNAGYAT
AFMGEWNLGE NASQPENQGF SHVVCSSPLT SQPQFAGQHA DDLLTQQAIN WMETNSKEPF
FLNLWYQSVG APFQAPSGDI QQARTLADPS QDPQQAPVMA AMIAALDQRV GLIVAALERL
QLTQRTIIVF TSDNGGNMTD TIEGDLLTSN RPLRGGKGSM YEGGSRVPLI VVWPGVATPA
RSCDDAVSAV DLLPTLVDMA RGTIPAGHQI DGVSLKPALT GATGFDRGAI FHHYPHYNPT
TGTTPAISVR SENMKLIRFF GGHVTQTDRI EVYDLQNDPG ERINLARSRR DEIVRLTNLI
QNFLIETRAL VPQKNPNFER PQQGWATGAD AEVEDGELVL KRSGDRPILF ETYDVPHVNR
QLRLRLPLKT SSRMEGRVMW STASEPSFTQ NRQAKFAATQ TGEWETHEIT LPVQDLLTGI
RIEFGRGNSD LSLGWIRAEL IDGNLVKEWQ FGEVESE