Gene Plim_1567 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_1567 
Symbol 
ID9138267 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp2018988 
End bp2020559 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content54% 
IMG OID 
Productsulfatase 
Protein accessionYP_003629599 
Protein GI296121821 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0183451 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACAACA ATGTTTCTCT CTCAAGTTTC ACACGATGGA TTCATAACCA ATCGGCATGG 
GCCCTGCTGC TGGCAGTCAG TTTGATCCAT TCGACGATGG TGGTGGCTCA CGAGAATTCT
CCTCAATCTC CACCCAAGCG CCCCAATGTC CTGATGATTG CCATTGATGA CCAGAATGAC
TGGATCGAAC CTCTGGGTGG GCATCCACTG GTCAAAACGC CGCAACTTAA ATCACTTGCT
GAGCGAGGTA CGGTCTTTTT GAATGCTCAT TGCCAGGCCC CTTTATGCAA TCCTTCGCGA
ACGAGTCTTC TCTTAGGCTT GCGGAGCACG ACAACGGGCA TCTATGGATT GTCTCCCTGG
TTTCGAGATG TCCCGGCGCT CTCGGGACGA CTGACGTTGC CGCAGGCCTT TGGCAAGGCA
GGCTATACCA CTCTCAGTAC AGGAAAGATC TTTCATGGAG GTGGCGGTAA GCCCAAAGAT
CGCCTGAAAG AGTTCGACGA ATGGGGCCCA GCGGGAGGTG TCGGAAAACG TCCTGAAAAG
CGGCTTATCC AGCCTCCGCC TCACTCCAAT CCACTGGTCG ATTGGGGTGC CTTTCCTCAT
CTCGACAGTG AGAAAGGCGA TACTCAGATC ACCGATTGGG CCATTGAAAA ACTCAAACAG
CGGCAAGTCC AACAGTCGTC ATCAACAGGT GAATCCAAAC CTTTTCTGAT GTGTGTGGGG
TACTTCCTGC CACATGTTCC CTGCTACGTC ACGCCCGAAT GGCTGGCCAT GTATCCTGAT
GACGATTCGA TTTTGCCGTT CATCGAAAAA GATGATCGAA AGGATACCCC CCGCTTCTCC
TGGTATCTGC ATTGGCGGCT TCCCGAACCA CGACTCAAAT GGCTGCAGCA GCATGAGCAC
TGGAGATCTC TGGTGCGTTC CTACCTGGCG TCGACTTCGT ATGTCGATGC CCAGATCGGG
CGACTGTTGG CCGCGCTGGA AGCGACAGGC GAGGCAAACA ATACGTTGAT CGTCCTCTGG
TCGGACCATG GCTGGCATCT GGGTGAGAAA GGGATCACGG GTAAGAACAC GCTCTGGGAA
CGCTCCACCC GTGTGCCTCT CCTCTTCGCC GGCCCGGGAG TTCTCGCAGG TGGAAAATGT
GTAGAACCCG TCGAACTGCT CGATATCTAC CCCACTCTGG CACAGCTTTG CCAGCTTGAG
GCCCCGACTG ATCTGGAAGG GGTCTCACTG GTTCCGCAAT TGACAAACCC ACTCGCTGTT
CGCCAGCGAC CGGCAATCAC TTCCCACAAT CAAGGCAACC ATGCGATCCG TACGCGAGAT
CATCGCTACA TTCGCTATGC CGATGGATCG GAAGAGTTGT ACGATCACCT CGTCGATCCT
CATGAACTCA AGAATCTTGC CGATGATCCT GCACATTCAG GCCTCAAGAA ACAGCTCAAT
TCATGGCTCC CATCGATCGA TCAACCACCT GTGACGGGAA GTAAAGACCG CGTTCTCACC
TTTGACCGGC AGACGAACCG CGCGATCTGG GAAGGCGAGA TCATTGAGCG TTCGTCACCC
ATCCCGGAGT AG
 
Protein sequence
MNNNVSLSSF TRWIHNQSAW ALLLAVSLIH STMVVAHENS PQSPPKRPNV LMIAIDDQND 
WIEPLGGHPL VKTPQLKSLA ERGTVFLNAH CQAPLCNPSR TSLLLGLRST TTGIYGLSPW
FRDVPALSGR LTLPQAFGKA GYTTLSTGKI FHGGGGKPKD RLKEFDEWGP AGGVGKRPEK
RLIQPPPHSN PLVDWGAFPH LDSEKGDTQI TDWAIEKLKQ RQVQQSSSTG ESKPFLMCVG
YFLPHVPCYV TPEWLAMYPD DDSILPFIEK DDRKDTPRFS WYLHWRLPEP RLKWLQQHEH
WRSLVRSYLA STSYVDAQIG RLLAALEATG EANNTLIVLW SDHGWHLGEK GITGKNTLWE
RSTRVPLLFA GPGVLAGGKC VEPVELLDIY PTLAQLCQLE APTDLEGVSL VPQLTNPLAV
RQRPAITSHN QGNHAIRTRD HRYIRYADGS EELYDHLVDP HELKNLADDP AHSGLKKQLN
SWLPSIDQPP VTGSKDRVLT FDRQTNRAIW EGEIIERSSP IPE