Gene Plim_0789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_0789 
Symbol 
ID9137472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp1024044 
End bp1025621 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content56% 
IMG OID 
Productsulfatase 
Protein accessionYP_003628833 
Protein GI296121055 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0797845 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTCTCA AAGGAAGCCG GTGGCTTTCC ATGCTCGCTG TTGCGCTTGT TCTCGTGGCT 
CACACGGGAG TAGCGCTGGC CCAAACCAGG AAGCCCAACA TTCTGGTCAT CTGGGGCGAC
GATATCGGCA CCTGGAATAT CAGCCACAAC AGCCGCGGCA TGATGGGCTA CCAGACGCCG
AATATCGATC GACTGGGCAA AGAAGGACTG GCCTTCACCG ATTATTACGG TCAGCAAAGT
TGTACGGCGG GCCGCGCTGC TTTTCTGGGA GGCAACGTCC CTGTCCGCAC TGGCATGACC
AAGGTGGGTC TGCCCGGAGC GAAAGAAGGC TGGCAGAAGA CCGACGTGAC CATCGCGACC
GTCCTGAAGT CACAAGGCTA CGCGACAGGT CAGTTTGGTA AGAATCACCA GGGCGATCGC
GACGAGCATC TTCCCACAAT GCATGGCTTC GACGAGTTCT TCGGAAACCT CTACCACCTG
AACGCTCAGG AGGAACCGGA GAATGAGGAT TATCCCACCA ACATGAGGAT GGCCAATGGC
AAGACTTTCA TCGAGAACTA CGGGCCACGC GGCATTATTC GCAGCAAAGC CGATGGCAAG
GGTGGTCAGA CAATCGAGGA CACTGGCCCA TTGACCAAAA AGCGCATGGA GACCATCGAC
GAAGAAACCG TGGCGGCCGC TAAGGACTTT ATCACCCGCC AGAAAAATGC CGACCAGCCC
TTCTTCTGCT GGTGGAACGG CACACGAATG CACTTCCGCA CACATGTCAA GAAAGAGAAC
CGTCATCCGG GAAATGACGA ATACACCGAT GGCATGATCG AGCACGATGG TCACGTGGGC
GAATTGCTCA AACTGCTCGA TGAACTGGGG CTCGCCAAAG ACACGATCGT CATGTATTCC
ACGGATAACG GCCCGCACTA CAACACCTGG CCGGATGCGG GCACGACGCC TTTCCGCAGC
GAGAAAAACT CGAACTGGGA AGGTGCTTAC CGTGTCCCCT GTTTCGTGCG CTGGCCAGGC
CGGTTTCCGG CAGGCAAAAC ACTGAATGGC ATCGTCTCTC ACGAAGACTG GCTCCCCACA
CTGGCTGCCG CCGCTGGTGC CAGCGATATC AAGCAAAAGC TCGCACAAGG GGTCGAACTC
AACGGCCGCA AATACCGCAA CTATGTGGAT GGCTACAATC AGCTCGATTA CTTCGGCGGC
AAGACGGATC AATCGCCCCG GAACGAATTT ATCTATGTGA ATGACGACGG CCAGATTGTC
GCCCTGCGAT ACGATGCATG GAAGGCTGTG TTTCTTGAGA ACCGGGGGGA GGCATTTGGC
GTGTGGCGAG AACCCTTTAC CGAGCTGCGT GTGCCGCTGT TGTTCAATCT GCGCCGCGAT
CCCTTCGAGC GCTCACAGCA CAATTCGAAC ACCTATAACG ACTGGTTCCT CGACCGCGTT
TTTGTGATCA CGCCGATGCA GCAGATGGCG GGCAAGTTTC TGATGACGAT GAAGGAGTAT
CCACCCAGCC AGACACCCGG CTCATTCAAC CTGGAAAAAA TCCAGAAGAT GATCGAGGCC
GGTGCCAGCG GGAAGTAA
 
Protein sequence
MFLKGSRWLS MLAVALVLVA HTGVALAQTR KPNILVIWGD DIGTWNISHN SRGMMGYQTP 
NIDRLGKEGL AFTDYYGQQS CTAGRAAFLG GNVPVRTGMT KVGLPGAKEG WQKTDVTIAT
VLKSQGYATG QFGKNHQGDR DEHLPTMHGF DEFFGNLYHL NAQEEPENED YPTNMRMANG
KTFIENYGPR GIIRSKADGK GGQTIEDTGP LTKKRMETID EETVAAAKDF ITRQKNADQP
FFCWWNGTRM HFRTHVKKEN RHPGNDEYTD GMIEHDGHVG ELLKLLDELG LAKDTIVMYS
TDNGPHYNTW PDAGTTPFRS EKNSNWEGAY RVPCFVRWPG RFPAGKTLNG IVSHEDWLPT
LAAAAGASDI KQKLAQGVEL NGRKYRNYVD GYNQLDYFGG KTDQSPRNEF IYVNDDGQIV
ALRYDAWKAV FLENRGEAFG VWREPFTELR VPLLFNLRRD PFERSQHNSN TYNDWFLDRV
FVITPMQQMA GKFLMTMKEY PPSQTPGSFN LEKIQKMIEA GASGK