Gene Plim_1879 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_1879 
Symbol 
ID9138581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp2452948 
End bp2454336 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content56% 
IMG OID 
Productsulfatase 
Protein accessionYP_003629908 
Protein GI296122130 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGTTC TATTCGCGAC CTGTCTGCTA TTGCTGCCGA CCATTGGCGT CGCAGCCGAA 
CGTCCCAACA TTGTCTTCAT GCTTTCGGAT GATCAAGCCT GGAACGGTCT ATCGGTGGCG
ATGCACCCCC AGCTAGCGGG CTCTAAGGGA GATATTTTTC ATACGCCAAA TCTGGAGAAA
CTTGCGGTTC AAGGGATGCG GTTCTCAGCT GGATATGCGC CAGCTTCGGT CTGTTCTCCA
ACCCGCATCA GTTTGATGAC CGGAAAAAGT CCAGCCGCGC TGCACTGGAC AAAGGCCGCT
CCACCGGAGA CCGGGCACAA ATTGATAGAG CCACGAAACA TCCGCAGCAT CCCGGCGAAT
GAAACTACGA TTGGTGATGT CCTGCGCCAG GCAGGTTACG CGACCGCTCA CTACGGCAAA
TGGCATATTG GCGGCGGCGG TCCAGAACAG CACGGTTTTG ACAAATCGGA TGGTGACACC
GGCAACGAGA ACGCCTATCA GTTCAAAGAC CCGAACCCCG TCGATATCTT CGGCATGGCC
GATCGCGCGG CCGCGTTTAT GGATAGAAGT TCCAAGGCCA AGAAGCCGTT CTTCATTCAA
CTCTCCTGGA ACGCATTGCA CGCCTCGGAG AATGCGAATC AGGCCACACT TGCCAAATAC
GAGCGGCAAC TCAAAGGCGA GAACCGAAAA CGCATCACCA CAGCGGCGAT TACGGAAGAC
CTTGATACGG GGGTGGGCCG TGTCCTGGAG GCTATTGACC AACTCGGCCT GACCGAAACA
ACCTACGTGA TCTACATGGC TGATAACGGT GCTGGCGGTG GCAAAAAAGT TCTGGCCGGC
GGTAAAGGGG GAGTGTGGGA AGGAGGGATT CGTGTTCCCT TCATCGTGCG TGGCCCAGGC
GTAAAGCCGA ACTCGTGGTG TCACACTCGA GTGGTCGGTT ACGACCTCTT TCCCACCTTC
TGCGAGTGGG CGGGGATCGC TCCCGGCAAG CTGCCGAAGG GAATCGAAGG AGGCAGTATC
GCTTCACTGC TCAAGACCGA AGGTCGGGGA GACGTCAAGC GTTCGCGAGA GGAACTTGTC
TTTCACTTTC CACACTATCA GGGGGATGCA CCGCACTCGG CGATCTTCCT TGGTGACCTG
AAACTGTTGC ACTTCTACGA AGACAACCGC GACGAGTTGT ACGACCTCTC CAAAGACATC
GGCGAGCGAG ATGACCTCGC AGGACAGCGC CCTGCCGAGA CGAAAAAGCT CCGTGAGCGT
CTCGACAAAT ACCTTGCCCA AGTCGATGCG CAGTTCCCGA CACTGAACCC GAACTTCGAC
CCCAATCAGC CAGTTGAACC GAAAAAACGT GGTGGGAAGA ACAAACCCGG GAAACCCGCA
ACGAAATGA
 
Protein sequence
MRVLFATCLL LLPTIGVAAE RPNIVFMLSD DQAWNGLSVA MHPQLAGSKG DIFHTPNLEK 
LAVQGMRFSA GYAPASVCSP TRISLMTGKS PAALHWTKAA PPETGHKLIE PRNIRSIPAN
ETTIGDVLRQ AGYATAHYGK WHIGGGGPEQ HGFDKSDGDT GNENAYQFKD PNPVDIFGMA
DRAAAFMDRS SKAKKPFFIQ LSWNALHASE NANQATLAKY ERQLKGENRK RITTAAITED
LDTGVGRVLE AIDQLGLTET TYVIYMADNG AGGGKKVLAG GKGGVWEGGI RVPFIVRGPG
VKPNSWCHTR VVGYDLFPTF CEWAGIAPGK LPKGIEGGSI ASLLKTEGRG DVKRSREELV
FHFPHYQGDA PHSAIFLGDL KLLHFYEDNR DELYDLSKDI GERDDLAGQR PAETKKLRER
LDKYLAQVDA QFPTLNPNFD PNQPVEPKKR GGKNKPGKPA TK