Gene Plim_1568 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_1568 
Symbol 
ID9138268 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp2020556 
End bp2022472 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content55% 
IMG OID 
Productsulfatase 
Protein accessionYP_003629600 
Protein GI296121822 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0365787 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGATGGG CGAGATTCGT GTGGTTGAGA CGATGGCATG GGCTGATGGG CTTCGGAATA 
TCGTCATACG TGATGCTGCT GGCCTGCTGT GTGGTGCTGA TCAGTCCCAC CTCGTCAGAG
GCTCAACAAT CCGCCAGGCC ACAGCCTCCC AACATCGTGG TGCTGCTCGC GGATGATGCG
GGTTGGGGTG ATTACTCAGT CAGTGGAAAT CCCTATGTAA AGACTCCGCA TATCGACTCC
ATCGCTCAAC AGGGAGTCTC GCTGACAAAT TTTTATGTCT GCCCGGTCTG CTCACCAACT
CGTGCCGAAT TTCTTACCGG TCGATATGCC CTCCGCACGG GTGTTCGCGG AGTCTCCCTG
GGTGAGGAGC GACTCAACCT CGATGAGCAG TCGATTGCGG AGCACTTCCG AAAAGCGGGT
TATCGAACCG GGATCTTTGG CAAATGGCAC AATGGGTCAC AAGGCCCCTA TCACCCATTG
GCTCGGGGAT TCGATGTCCA GTTGGGTTAC ACCGCCGGCC ATTGGAGCGA ATACATCGAT
GCTCCACTCG AGTCGCAAGG CCGCCCTGTC ACTTCCGAGG GTTACATTGT CGATACCTGC
ATGAATGCGG CTATCGATTT TATCTCCAGC AGCCAGCCAC CATTCTTCTG CTATGTTCCT
CTCACGACAC CTCATTCCCC CTGGTGCGTC CCTCAAACGT ATTGGAATCG CTGGAAAGAT
CGCGATGTCG GTTTGACTGG CAAAGAGGCC GACGCGGTCC GATGTGTCTA TGCCATGATG
GAACAGCAGG ATGATGCCGT CGGCCGGTTA CTCGCACGCC TCGATTCGCT CCACCTTTCC
TCCAACACGA TTGTCCTCTA TTTCTCTGAT AACGGGCCCA ACACAGTCCG CTGGAATGGC
GACATGCGCG GCCGCAAAGG GACTGTTGAT GAAGGTGGAG TTCGTTCCGT CGGATTTCTG
CGGTGGCCGG GGCATATCCC TCCAGGTTCC ACACAAACCG GACTCATCGG AGCAATTGAT
CTGCTCCCCA CATTGGCGGG TCTGGCGAAT ATCCCGCTCG ATTCCGCCAA GCCACTCGAT
GGTCTTGATG TCTCGACGGC ACTCCTGAAA AATGTCCCCT TTGAGCGTCA ACAACCTCTA
CTCTCACATT GGGCAGGCAA GTTCAGCTTG CGTTCCCCCA CTCATCGGCT CGATTTCCAG
AGTCGCTTGT ATGACATGCA GCATGACCGC AGTCAGAAAG TCGACCTCAG CACCACTCAT
CCTGAAATTG CCAGCGTCAT GAGGCAGCAA CTTGAACGCC TGAAAACTGA GTTGTTGCAG
CCTCCCGCAA CCAGCCAACA GTCCGCGCAA CTCGATACGG CGGACATACT CAAATCCCCC
TGGTTGCTGC CTGCGCACAA AGAGTTTCTC GCCCATCCAG CGCAAGATTC CCGACTCTAT
CCTGTAGGAT ATGTTTCTCT GCCATACACC TGGCTCCCCG CACGCGATGG CCAACCTCTC
GGCAAGATTC AGAGAAGTTC GAATGCTCCC AACAGTTCTT ATTTTACGAA CTGGAATGAT
GACCAAAGTG CGGTCGTCTG GCCGGTGGAT ATCCTCACGA GTGGGGCATA TCGCATTGAA
CTGGAATCGA CATCACCGGC AACCGCAATA GGCTCTGAAT TGGAAATCAG TTTTCAGGAC
AGCAAACTCT CCGTGCTGAT TTCCATCTCT CACGATCCAC CCACGGACAC CCGGCAGGAC
ACGATTCCCC GCCCCAAAGC CGAGACACTC TCGAAGCCAT TTCGACGCTG GCAGGCCGGG
TCGATTCAAC TTCCTGCAGG CCCGGGCTTA CTGACGATTC GTGCGACTCA GCTCCAGGGT
TCCTCCATCA TCGATCTGAA ATCCATCAAC CTGATTCTGG AGGAGCAGCG ACCGTGA
 
Protein sequence
MRWARFVWLR RWHGLMGFGI SSYVMLLACC VVLISPTSSE AQQSARPQPP NIVVLLADDA 
GWGDYSVSGN PYVKTPHIDS IAQQGVSLTN FYVCPVCSPT RAEFLTGRYA LRTGVRGVSL
GEERLNLDEQ SIAEHFRKAG YRTGIFGKWH NGSQGPYHPL ARGFDVQLGY TAGHWSEYID
APLESQGRPV TSEGYIVDTC MNAAIDFISS SQPPFFCYVP LTTPHSPWCV PQTYWNRWKD
RDVGLTGKEA DAVRCVYAMM EQQDDAVGRL LARLDSLHLS SNTIVLYFSD NGPNTVRWNG
DMRGRKGTVD EGGVRSVGFL RWPGHIPPGS TQTGLIGAID LLPTLAGLAN IPLDSAKPLD
GLDVSTALLK NVPFERQQPL LSHWAGKFSL RSPTHRLDFQ SRLYDMQHDR SQKVDLSTTH
PEIASVMRQQ LERLKTELLQ PPATSQQSAQ LDTADILKSP WLLPAHKEFL AHPAQDSRLY
PVGYVSLPYT WLPARDGQPL GKIQRSSNAP NSSYFTNWND DQSAVVWPVD ILTSGAYRIE
LESTSPATAI GSELEISFQD SKLSVLISIS HDPPTDTRQD TIPRPKAETL SKPFRRWQAG
SIQLPAGPGL LTIRATQLQG SSIIDLKSIN LILEEQRP