Gene Plim_2371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_2371 
Symbol 
ID9139082 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp3090700 
End bp3092232 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content58% 
IMG OID 
Productsulfatase 
Protein accessionYP_003630396 
Protein GI296122618 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.852311 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGCTT CTCGATTTTG GCTGACATGC CTCTTCATTG CCGGATTCTG CTTATCCGCG 
TCCGCCGCAG ACAAGCCAAA TATTCTCGTG ATCATGGCGG ACGACGTCGG ATGGATGAAC
GTCTCTTCCT ACGGCGGCGA CATCATGGGT ATTCGCACGC CGAACATAGA CCGCATCGGC
CAGGAGGGCA TTCGCTTCAC GTCGTTCTAT GCGCAACCAA GTTGTACAGC CGGACGAGCG
GCATTCCTGA CCGGCCAGCT TCCTGTGCGG ACTGGATTGA CAACCGTCGG CACACCCGGT
TCCCCGGCCG GATTGCAAAA GGAAGACATC ACGCTCGCGG AGATTCTCAA GACGAAGGGC
TATTCGACTG CCCAGTTCGG CAAGAATCAC CTCGGTGACC TCGAAGAGCA TCTGCCACAC
CGGCACGGCT TCGACGAGTA CTTCGGCAAC CTTTACCATC TCAACGGCAA TGAGGATCTG
GAAGATCCGG ACCGGCCGAC CGATCCCGAG TTCCGCAAGA AGTTCGATCC GCGCGGAGTC
GTTTCCGGCA CGGCCGATGG CCCGACGAAG GATGAGGGTC CATTGACGAC CAAGCGGATG
GAAACCTTTG ACGATGAGAT CGTCGCCAAA TCACTCGATT TTCTCGACCG CAAGGCCAAG
GATCAGAAAC CATTTTTCCT CTGGCACTGC TCCGCCCGCC TGCATGTGTT CTTTCACTTC
AAAGAAGGCG TTCGCGGGAA GTCGCGAGCC GGTCGCGAAG ATGTTTACGG CGATGCTCTT
GCCGAACACG ACGGCCATAT CGGTCAACTT CTGGCGAAGC TGGAGGCCAC AGGCCTCGAT
AAGAACACCA TCGTCGTTTA TGTGACCGAT AACGGGGCCT ATCAATACAT GTGGCCCGAA
GGCGGCACCA GTCCTTTCCG CGGCGATAAG GGGACGACTT GGGAGGGCGG CGTTCGCGCC
CCGTGCATGG TCCGCTGGCC CGGAGCGGTC GGTGGTCGCG TTAGCAGCGA GATCGTGGAC
ATGACGGATC TCTTGCCCAC ACTGGCATCT GCCGCTGGCG AGACTGACGC CGTCGAAAAG
CTGAAAAAGG GTGCCGACTA CGGCGGCAAG AACTACAAGG TGCATCTCGA TGGCTACGAT
CAGACGGCCC TCTTTACCGG CAAGAGCGAC AAGTCCGCTC GCAAATTCGT CTTCTACTAC
GACGAGACCG TGCTCACAGC CATCCGCTAC GAATCATTCA AGGTTACCTT TTCCATCAAG
GAGGGCGGAC ATTGGGATGA CCCGCTGGTC GGCCTCGGCC GACCGATGAT CACCAACCTG
CGGATGGACC CCTTTGAGCG ACAAACCGGA GACGTGAACC GCCAGTATGC GGAACACAAG
ACCTGGGTGC TCACGCCGAT CGTTGGCATC GCGGAGAAAC ACTTGACGAC CTTTCGTGAC
TTTCCCGTCC GCCAGCTTGG GCTGAGTGCC CAGATGGCGA AGACGCTCGA AGGTATCCAG
TCGCAGATCT TGAAACTCAA GCCGAATAAC TAA
 
Protein sequence
MMASRFWLTC LFIAGFCLSA SAADKPNILV IMADDVGWMN VSSYGGDIMG IRTPNIDRIG 
QEGIRFTSFY AQPSCTAGRA AFLTGQLPVR TGLTTVGTPG SPAGLQKEDI TLAEILKTKG
YSTAQFGKNH LGDLEEHLPH RHGFDEYFGN LYHLNGNEDL EDPDRPTDPE FRKKFDPRGV
VSGTADGPTK DEGPLTTKRM ETFDDEIVAK SLDFLDRKAK DQKPFFLWHC SARLHVFFHF
KEGVRGKSRA GREDVYGDAL AEHDGHIGQL LAKLEATGLD KNTIVVYVTD NGAYQYMWPE
GGTSPFRGDK GTTWEGGVRA PCMVRWPGAV GGRVSSEIVD MTDLLPTLAS AAGETDAVEK
LKKGADYGGK NYKVHLDGYD QTALFTGKSD KSARKFVFYY DETVLTAIRY ESFKVTFSIK
EGGHWDDPLV GLGRPMITNL RMDPFERQTG DVNRQYAEHK TWVLTPIVGI AEKHLTTFRD
FPVRQLGLSA QMAKTLEGIQ SQILKLKPNN