Gene Plim_3949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_3949 
Symbol 
ID9140668 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp5072123 
End bp5073535 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content54% 
IMG OID 
Productsulfatase 
Protein accessionYP_003631959 
Protein GI296124181 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.184377 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTCTGC GTAACGGCTT CCATTTCTCT TCGATCTGTC TCGTGGGGAT CTGCCTGGCA 
GGAATTTCTT CGATCTGCGA TCTGGCACAA GGGGCTGAGC CAACTCAGAC CTCCCGGAAG
CCGAATGTCA TCATCTTCTA CGCGGATGAC CTCGGATGGG GAGAAACCGG GATTCAAGGA
AATCCACAGA TTCCCACGCC TCACATCGAT TCAATCGCCA AAAATGGGGT GCGATGCACT
CAAGGCTTTG TCGCAGCGAC CTACTGCAGT CCTTCGCGAG CCGGCTTGTT GACCGGCCGC
TATCCCACAC GCTTCGGCCA TGAGTTTAAT CGGATTGCCA ATGTCTCTGG CCTCGATCTT
CAGGAAACAA CTCTGGCTGA TCGCCTGCAT GGCTTAGGCT ACAAAACTGC CTGTGTCGGC
AAATGGCACC TGGGAGACGG CCCGGAATAT CGACCAACAA AACGAGGTTT TGACGAGTTC
TTCGGCACAC TCGCGAACAC CCCGTTTTTT CATCCCACCA AGTTTGTCGA TTCCCGAGTC
TCGAATGATG TCGCAGAAGT CTCCGACGAA AACTTTTACA CCACAGACGA ATACGCCAAA
CGCTCAGTGG AGTGGATTGG ACAGCAACAG CAGTCTCCCT GGTTTTTGTA TCTTCCGTTC
AATGCACAGC ATGCTCCACT GCAGGCTCCA CAGAAGTATC TGGATCGCTT TGAATCGATC
GCAGATCCCA AGCGTAAGCT CTTCGCAGCC ATGATGTCCG CCATGGATGA CGCCATTGGT
CAGGTGCTGG GCAAGGTGCG AGAACTCGGG CAGGAAGAAA ACACACTGGT CTTCTTCATT
TCCGACAATG GGGGCCCGAC CCAAGGCACG ACATCTCAGA ATGGCCCCTT GCGCGGCTTC
AAAATGACCA CCTTCGAAGG GGGAACACGC GTGCCGTTCC TCGTTCAATG GAAAGGTAAG
CTCCCCGCTG GAAAAACTTA CGACAATCCT GTCATCAACC TGGATGTTCT GCCGACCGTG
CTCACCGCAG CAGGGAGCAA AATCGATCCC GCCTGGAAGC TGGATGGTGT TGATCTGGTG
CCTTATTTTA CAAGTTCCAT CGCAAACAAG CCCCACGAAA CCTTGTACTG GCGATTTGGT
GAGCAATGGG CTGTTCGCCA GGGCGATTGG AAGCTGGTTG TCGCCCGCGG AGGGAGTGGA
CAGCCCGAAC TCTACGATCT GGCGAGTGAT ATTGCCGAGT CGAAAAATCT CGCTTCAGAA
AACCCCGCCA AGGTCAAAGA ATTGCAGGCA CTATGGGATC AATGGAGTCA CGAACAGGCT
GCTCCCAAAG TTGTTGACCA GCCCAATAAC GCCAAGAAGG CAGGAAACAA AAAAGGCGCC
AAGAAGAAAG CCGCAGCCGG TTCCGCCACT TAG
 
Protein sequence
MVLRNGFHFS SICLVGICLA GISSICDLAQ GAEPTQTSRK PNVIIFYADD LGWGETGIQG 
NPQIPTPHID SIAKNGVRCT QGFVAATYCS PSRAGLLTGR YPTRFGHEFN RIANVSGLDL
QETTLADRLH GLGYKTACVG KWHLGDGPEY RPTKRGFDEF FGTLANTPFF HPTKFVDSRV
SNDVAEVSDE NFYTTDEYAK RSVEWIGQQQ QSPWFLYLPF NAQHAPLQAP QKYLDRFESI
ADPKRKLFAA MMSAMDDAIG QVLGKVRELG QEENTLVFFI SDNGGPTQGT TSQNGPLRGF
KMTTFEGGTR VPFLVQWKGK LPAGKTYDNP VINLDVLPTV LTAAGSKIDP AWKLDGVDLV
PYFTSSIANK PHETLYWRFG EQWAVRQGDW KLVVARGGSG QPELYDLASD IAESKNLASE
NPAKVKELQA LWDQWSHEQA APKVVDQPNN AKKAGNKKGA KKKAAAGSAT