Gene Plim_1647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_1647 
Symbol 
ID9138348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp2125845 
End bp2127617 
Gene Length1773 bp 
Protein Length590 aa 
Translation table11 
GC content51% 
IMG OID 
Productsulfatase 
Protein accessionYP_003629678 
Protein GI296121900 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.425588 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGTTTGC AGGGCCTGCT TGCCGGTAAC GACCGGTTCG CAGCACGGTC CTCTGGCCAG 
GGCCATGTCC AGCAAATCGT TAACCAGCGA GACCTGCTGC ATGGAATCTC GAACCTTTTC
ATGGAAAGTC TTTGCACGAT GAAACAACTT CACATGAGCC TGCTGACGTT GGCCCTGGGA
TGGATGCTGT CTGGAATGGC GATGGCGGCG GACAAGCCCA ATATTCTGCT GATCATTTCG
GACGATACGG GATATGGCGA TCTCGGGCCT TATGGGGGAG GAGAAGGCCG CGGGATGCCA
ACACCGAATA TCGATCGGCT GGCAGCAAAT GGTATGACCT TTTTCTCGTT TTATGCTCAG
CCCAGTTGCA CGCCAGGGCG TGCGGCCATT GTTACAGGTC GCATTCCTAA TCGGAGTGGA
ATGACGACAG TGGCTTTTCA GGGTGAAGGT GGCGGACTTC CTAAAGCCGA GTGGACGCTG
GGTTCTGTAC TCAAACAGGG TGGCTACAAA ACATACTTCA CGGGCAAATG GCATCTGGGG
GAAGCCGATT ACGCTCTGCC GAATGCTCAT GGCTATGACG TGATGAAGCA TTGCTTTCTT
TACCATTGCA ATGCCTACAC CTATGGTGAT CCCACCTGGT TCCCGGATAT GGATCCATCG
CTGAGGGAAC TCTTCGGAAA GATCACCAAA GGTTCGCTTT CAGGGAATGC TGGCGAAAAA
GCTCGCGAAG ACTGGAAGGT CAATGGCCAG TATGTCAACA CGCCGGACAA AGGCGTCGTC
GGGATTCCAT TTCTGGATCA GTACATTGAG CAGTCGGGCA TCAGTTTTCT GGAAGATGCC
GCCAAAACTC CCAATCAGCC CTTCTTTGCA CATATCAACT TCATGAAGGT GCATCAGCCG
AATCTTCCAG CACCGGAGTT TGTGCATAAG TCACCTTCGA AGTCGAAGTA TGCCGATTCT
GTTGTTGAGC TCGATACGCG CATTGGCCGT GTGGTCGATA AGTTGAAAGA ATTAAAGCTC
GATCAGAATA CGTTGATTTT CTACACGACT GATAACGGCG CCTGGCAGGA TGTCTACCCT
GATGCCGGGT ACACACCATT TCGCGGAACC AAGGGGACTG TTCGCGAGGG TGGGAATCGT
GTGCCTGCGA TTGCCGTCTG GCCTGGAAAA ATTAAGCCTG CAGTACGAAA TCACGATATC
GTGGGTGGTC TGGACTTGAT GGCGACATTT GCCTCAGTTG CCGGGGTACA GCTACCAACG
AAAGATCGTG AAGGCCAACC TATGATTTTT GATAGCTATG ACCTGACTCC GGTCCTCACT
GGGAGTGGTA AATGTCCAAG AAACTCGTGG TTCTACTTCA CCGAAACGGA ACTGAGCCCC
GGCGCTATTC GAATTGGGAA CTATAAGGTC GTATTCAACC TGAGGGGAGA TGATGGCCAA
CCAACGGGAG GACTGAGTGT TGATTCAAAC CTGGGCTGGA AAGGGCCCGA AAAATATGTT
GCCACAGTTC CACAGGTATT TGACCTCTGG CAGGATCCTC AAGAACGCTA CGATATTTTT
ATGAATAACT ACACAGAACG TACATGGATT CTGGTGTCGT TCAATGTGGC CATCAAGGAT
CTGATGAAAA CTTATCTCAC CTATCCACCT CGAAAAATGC AAAGCGAGGC CTATACAGGC
CCCATTACGA TTCCTGAATT TGAGCGACTG AAACATGTTC GCGAACTGCT CGAAAAAGAA
GGCATTCGAA TTCCTTTGCC CACAGGCAAC TGA
 
Protein sequence
MCLQGLLAGN DRFAARSSGQ GHVQQIVNQR DLLHGISNLF MESLCTMKQL HMSLLTLALG 
WMLSGMAMAA DKPNILLIIS DDTGYGDLGP YGGGEGRGMP TPNIDRLAAN GMTFFSFYAQ
PSCTPGRAAI VTGRIPNRSG MTTVAFQGEG GGLPKAEWTL GSVLKQGGYK TYFTGKWHLG
EADYALPNAH GYDVMKHCFL YHCNAYTYGD PTWFPDMDPS LRELFGKITK GSLSGNAGEK
AREDWKVNGQ YVNTPDKGVV GIPFLDQYIE QSGISFLEDA AKTPNQPFFA HINFMKVHQP
NLPAPEFVHK SPSKSKYADS VVELDTRIGR VVDKLKELKL DQNTLIFYTT DNGAWQDVYP
DAGYTPFRGT KGTVREGGNR VPAIAVWPGK IKPAVRNHDI VGGLDLMATF ASVAGVQLPT
KDREGQPMIF DSYDLTPVLT GSGKCPRNSW FYFTETELSP GAIRIGNYKV VFNLRGDDGQ
PTGGLSVDSN LGWKGPEKYV ATVPQVFDLW QDPQERYDIF MNNYTERTWI LVSFNVAIKD
LMKTYLTYPP RKMQSEAYTG PITIPEFERL KHVRELLEKE GIRIPLPTGN