Gene Plim_0354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_0354 
Symbol 
ID9137013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp439671 
End bp441152 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content54% 
IMG OID 
Productsulfatase 
Protein accessionYP_003628404 
Protein GI296120626 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGCTCT ATCTAGCGAT GACGCTATTG ATTCTGCTGA TTGGAAGTCC ATGGGTGCAG 
GCGGCACCTC CGAATATCGT CATTCTGTAC GCGGATGATA TGGGCTATGG CGATCTTCAC
ATTCAGAATC CAGAGTCCCG GATCCCCACA CCACATCTTG ACCGACTCGC CCGACAAGGG
ACTCGCTTTA CCGATGCCCA CAGTTCGTCT GGCATCTGTA CGCCCAGTCG ATATGCACTA
TTGCAGGGGC GATATCACTG GCGCAAGTTT CACGGCATCG TGAACTCGTT CGACCCTCCC
GTGCTTGACG ATGAAAAGCT GACGATTGCC GAACTTTTGA AAACCAAGGG ATACCGAACG
GCCTGTATCG GTAAATGGCA CCTGGGCTGG GATTGGAATG CCATTAAGAA ACAAGGTGTA
AAACCGACTG ACAAAGCCGG TTTTGCCGCT GATGCTTTTG ACTGGAGCCA ACCCATTCCG
GGTGGGCCAC GATCACACGG GTTTGACTAC TACTTTGGCG ATGACGTCCC GAACTTTCCG
CCCTATGCCT GGTTTGAGAA TGATCGTGTC ATCACGACAC CGACCGTCAC TTTAAAAACG
ACAGCACCCA CTGCAGAAGG AAGCTGGGAG GCTCGGCCGG GCCCGGCTGT TCAAGACTGG
GATTTCTGGG CTGTCATGCC CACACTCACG CAAAAAGCCG AGCAGTGGAT CAGCGAGCAG
AAAGCCGATC AACCGTTCTT TCTCTACTTC CCTTTCACTT CGCCGCATGC CCCGATTGTG
CCGACATCGG ATTTCACAGG TAAATCACAG GCTGGTGGCT ATGGCGACTT TATGTTCCAG
ACAGACGACA CGGTCGGCCG CGTGCTGGCG GCTCTTGAGA AGCATGGGTT TTCGGAAAAT
ACACTCGTGA TTTTCACGGC TGATAATGGC CCTGAGCGCT ACGCTTACGA TCGGATTCGA
AACTTTGGTC ATCGCAGCAT GGGCCCACTG CGCGGGCTGA AACGCGACAT CTGGGAAGGT
GGACATCGTG TGCCGATGAT TGTCCGCTGG CCGGGTGTAG TCCCCGCTGA AAAGGTGTGT
GATGAACTCA TCAGTCAGAT CGATCTCTTC GCAACCATTG CGGCTGTTGT TGATGCAGAA
ATCGCTCCAG GCTCCGCAGA AGACAGCTAC AATCAACTGG AATTGCTCAA AGGGACTGGT
TCCAGTGCTC GCCAGACTCT GGTTCACAAC ACGAATCCCA AAGGCTATGC CCTCCGGCAT
GGTGACTGGG TACTGATCGA CGCCAAAACT GGTGCGGTCA GTCAGGTTCC CAAGTGGTTC
GATGAAGCCA ATGGTTACAC CAGCCACTCA TTGCCGGGTG AACTCTATAA CTTGAAGGAC
GACCTCGCTC AGCGTCAGAA TCTGTATGCT GAGAATCCTG AAAAAGTTGC TGAGTTGAAA
GCTCTACTCG GAAAAATTCA GGCCCAGGGC CAGGTTCGCT GA
 
Protein sequence
MPLYLAMTLL ILLIGSPWVQ AAPPNIVILY ADDMGYGDLH IQNPESRIPT PHLDRLARQG 
TRFTDAHSSS GICTPSRYAL LQGRYHWRKF HGIVNSFDPP VLDDEKLTIA ELLKTKGYRT
ACIGKWHLGW DWNAIKKQGV KPTDKAGFAA DAFDWSQPIP GGPRSHGFDY YFGDDVPNFP
PYAWFENDRV ITTPTVTLKT TAPTAEGSWE ARPGPAVQDW DFWAVMPTLT QKAEQWISEQ
KADQPFFLYF PFTSPHAPIV PTSDFTGKSQ AGGYGDFMFQ TDDTVGRVLA ALEKHGFSEN
TLVIFTADNG PERYAYDRIR NFGHRSMGPL RGLKRDIWEG GHRVPMIVRW PGVVPAEKVC
DELISQIDLF ATIAAVVDAE IAPGSAEDSY NQLELLKGTG SSARQTLVHN TNPKGYALRH
GDWVLIDAKT GAVSQVPKWF DEANGYTSHS LPGELYNLKD DLAQRQNLYA ENPEKVAELK
ALLGKIQAQG QVR