Gene Plim_1519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_1519 
Symbol 
ID9138219 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp1958804 
End bp1960417 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content54% 
IMG OID 
Productsulfatase 
Protein accessionYP_003629551 
Protein GI296121773 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0408631 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTTCC TTCATCCTGT CTCTCGTGTG TTGAGCGGTG GCCAGTATCT GCTGCTGGCT 
GGTCTTTGCT TGATGGAGGT TCTGTTTGCC AAAGGCTTAT TGGCGAATCA AAACTCTTCT
ACTCCCCAAC CAACTGCGGA AGACCGTCGG CCGCATCTTG TACTTATCCT TGCTGACGAC
ATGACGTACG ATGCGATGAG TCTCTTCGAT CGGCGCGAGG TGCAAACACC TCATCTCGAT
GCGCTCGCTC GTCGCGGAAC CTTGTTCACG CATGCCGCCA ATATGGGTTC ATGGTCTCCG
GCAGTCTGCA TTGCCAGTCG ATCAATGCTG CTCACCGGCC GCACACTCTG GAAAGCTGAA
AAGTCCCACA AACAATTGTC TCATTTGGCC TCGCAGGAAC TTCTCTGGCC GCAACGACTC
AAAAAAGCTG GGTATCGCAC CTTCATTTCT GGCAAATGGC ATTTACCAGT CCCACCCGAG
AAAGTCTTTA ATCAAGCGGT TCATATTCGC CCCGGCATGC CCGCAGACAA ACCCGCTGGA
TACAATCGCC CGTTGGAAGA CACGAAAGAC GTCTGGTCGG CCTCAGACAA AACGTTCGGA
GGCTACTGGG AAGGTGGCAC CCATTGGTCG GAAGTTCTCG CCAACGATGC AGAAACTTTT
CTCGCCTCCC ACGCGAGAGC CAACTCACCT ATTCTCATGT ATCTGGCCTT CAATGCGCCG
CACGACCCGA GACAGGCCCC ACAGGAGTTT CTCGATCAGT ACACGGCCAG CCATCTCAAA
CTTCCGGCAC CATTCCTGCC CAGATATCCT TTCGCAGAAG CCATGGGTTG CCCTCCGAGT
CTACGCGACG AGCGGCTGGC CCCCTTCCCC CGAACCGAAG CAGCCATCAG ACAGCACCTC
AAAGAGTATT ATGCTCTCAT TACTCACCTC GATGCCCAGG TCGGGAAGAT CCTCCGCGCC
ATCGAATCCA GCGGCACTCA GCGAGAGACC ATTATCGCCT TCACCGCAGA TCATGGTCTC
GCTATCGGCC AGCACGGACT CATGGGGAAA CAAAACCTCT ATGATCACAG CACCCGGGTC
CCCCTGTTTT TTGTCAGCAT TCCCGCAGGA TCATCCGCTA AATCACCTCA TCGGGAGACC
ACCATTCCCG AGGGGAAGAA AGTGACCCGT CCGGTCTACC TGCAAAGCGT AGCCGCCACG
TTTCTTGACC TGGCTGGGAT GCCTCTTTTG CCAGAGGAAG AATACCCGTC ACTGCTCCCG
CTGATTCAGG AAACGACCAA CATTGATGGG GCCACTTCTC AGAATTTTCA GAACCTGCGT
CTCGCCAAAG AGACTGATTC GTCGTCGGCA AAGGACTGGG ACGACGATCT GATTGGCAGC
TATCTCAATC GGCAACGTTC CATCACCAGC GACGGAATGA AGCTGATCCT CTACCCGGAG
GCAAACGCTG AACGTCTTTA CAACCTCAAA ACAGACCCTT TAGAGCAGAC CGACCTCTCG
GAAGATGAAT CCCATTGCGC AATCCGAAAC CAACTCCGGG AACGACTGAT TCAGTGGCAG
AAAACATTGG AAGATCCGCT CCATCTCAAC GGAAAAGATA ACGCCTCGCG ATGA
 
Protein sequence
MNFLHPVSRV LSGGQYLLLA GLCLMEVLFA KGLLANQNSS TPQPTAEDRR PHLVLILADD 
MTYDAMSLFD RREVQTPHLD ALARRGTLFT HAANMGSWSP AVCIASRSML LTGRTLWKAE
KSHKQLSHLA SQELLWPQRL KKAGYRTFIS GKWHLPVPPE KVFNQAVHIR PGMPADKPAG
YNRPLEDTKD VWSASDKTFG GYWEGGTHWS EVLANDAETF LASHARANSP ILMYLAFNAP
HDPRQAPQEF LDQYTASHLK LPAPFLPRYP FAEAMGCPPS LRDERLAPFP RTEAAIRQHL
KEYYALITHL DAQVGKILRA IESSGTQRET IIAFTADHGL AIGQHGLMGK QNLYDHSTRV
PLFFVSIPAG SSAKSPHRET TIPEGKKVTR PVYLQSVAAT FLDLAGMPLL PEEEYPSLLP
LIQETTNIDG ATSQNFQNLR LAKETDSSSA KDWDDDLIGS YLNRQRSITS DGMKLILYPE
ANAERLYNLK TDPLEQTDLS EDESHCAIRN QLRERLIQWQ KTLEDPLHLN GKDNASR