Gene Plim_3501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_3501 
Symbol 
ID9140219 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp4525460 
End bp4526980 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content55% 
IMG OID 
Productsulfatase 
Protein accessionYP_003631513 
Protein GI296123735 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.486843 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTCATG AATTGATGAC CCGCCAGCCC GTAAATCATC GAGAAGACCG GGGTGGGCAT 
CGAATTTCCT GGTTCCTGTG TCACCTGGCG ATCAGTCTAT CGCTGTGTCT CTGGCAGGTT
GACTCGATCA CAAAAGTGAT GGCAGCCGAG GCTCGGCCTG AAAAACCGAA TGTCGTCATT
ATTAACTGCG ATGACCTGGG TTATGCCGAT GTGGGGGCGT TTGGTGCTAC GATCTGCAAG
ACGCCTGAGA TTGACAGGAT GGCGAGAGAA GGGGTGAAAG CCACTTCGTT TTACGTGGCG
CAAGCGGTTT GCTCGGCTTC TCGCACAGCC CTGCTCACAG GCTGCCTTCC GAATCGTATC
GGAATTCTCG GGGCGCTGAG TCATGTTTCG AAGAACGGCA TCGCCGACAG CGAAGTGACT
CTGGGCGAAC TCTTTCAGTC ACAAGGCTAT TCCACCGCCA TGTATGGCAA ATGGCACCTG
GGCTATCAGG CTCAGTTCCT GCCGGGTCAC CATGGATTTG GTGAAGCCTT GGGGATTCCT
TACTCGAACG ATATGTGGTC GAAGAACCCT TATGGCAAGT TCCCGCCCTT GCCTCTTTTC
CGTCAAAAAG GGGATTCACC CGCTGAGATC ATTGGCCATG ACACCGATCA ATCCCGCTTC
ACGACTGACT TCACGATGGC AGCCGTTTCC TTTATCGACC GGCACGCCGA CAAACCCTTC
TTCATCTATC TGGCTCACCC CATGCCCCAT ACGCCGATCT TTGTCAGCGA AGAGCGCAAC
AGCGGTGAGA GAGCCCAACT TTACCGAGAT GTCATCGGCG AGATTGACTG GTCGGTCGGA
ACCATTCGGC AGACACTCGA AAAGCATCAA CTCACTCGCA AAACACTGGT GATTTTTACT
TCTGATAATG GCCCGTGGCT GGTTTTTGGA AATCATGCCG GCAGCACAGG GCCGCTCCGC
GAAGGGAAAG GAACAATGTG GGATGGTGGT GCCCGCGTTC CGTTTGTGGC CTGCTGGCCC
GGTGTCATTC CGCCCGATAC GACGGTCGAT CTTCCCATGG CCACCTACGA CCTCTTTCCC
ACCTTTGCCA AAATGCTCGG TGCTAAACTT CCTGATCATC CCATTGATGG CGTCGATATC
TGGCCTCAGT TAACCAGTGC CTCGAAAGCT CAACCCCACC AGGCGCTATG GTTCTACTAT
GGCCGCGATC TGATTGCTGT GCGTTCGGGC CCATGGAAGC TCGTCTTCCC GCACACTTAC
GTCCATCCCG TCGAACGGGG AAACGACGGC CAGCGCGGTA AGCTCGTCAA CCGGAAGTTC
ACGGAGCTGG CACTTTACAA TCTGGATTCT GATATTGGCG AAACGACGAA TCTGGCCAGC
CAGCACCCTG AAATCGTCAA GCAGTTGGAG GCCTATGCTG AGGTCGCCCG TAATGAACTG
GGTGACGCTT TGACCAATCG CAAAGGTAGC GGAGTCCGCC CTCCCGGCAC AGTCAACGAT
TCCCCAGCAC TCGGAAACTA A
 
Protein sequence
MRHELMTRQP VNHREDRGGH RISWFLCHLA ISLSLCLWQV DSITKVMAAE ARPEKPNVVI 
INCDDLGYAD VGAFGATICK TPEIDRMARE GVKATSFYVA QAVCSASRTA LLTGCLPNRI
GILGALSHVS KNGIADSEVT LGELFQSQGY STAMYGKWHL GYQAQFLPGH HGFGEALGIP
YSNDMWSKNP YGKFPPLPLF RQKGDSPAEI IGHDTDQSRF TTDFTMAAVS FIDRHADKPF
FIYLAHPMPH TPIFVSEERN SGERAQLYRD VIGEIDWSVG TIRQTLEKHQ LTRKTLVIFT
SDNGPWLVFG NHAGSTGPLR EGKGTMWDGG ARVPFVACWP GVIPPDTTVD LPMATYDLFP
TFAKMLGAKL PDHPIDGVDI WPQLTSASKA QPHQALWFYY GRDLIAVRSG PWKLVFPHTY
VHPVERGNDG QRGKLVNRKF TELALYNLDS DIGETTNLAS QHPEIVKQLE AYAEVARNEL
GDALTNRKGS GVRPPGTVND SPALGN