Gene Plim_2372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_2372 
Symbol 
ID9139083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp3092262 
End bp3094625 
Gene Length2364 bp 
Protein Length787 aa 
Translation table11 
GC content59% 
IMG OID 
Productsulfatase 
Protein accessionYP_003630397 
Protein GI296122619 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.973835 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACATCA GATCGCTATT GGCCGTTCTC AGCCTCGCGC TGGGAGTCGG GTTCGCGTCA 
ACGGCAGACT CTCAAGAGAT TTTGCCTTTC CCCCCCGCGC CGTCCGGCTC CACTCCCGGG
CTGACGATCC AGGATTCCAC CTACAAGAAA CGCGTCGAGC CGAAGCGGCT GGCCGAAAAT
GTGCCGAACA TCCTCATCAT CCTTATGGAC GATGTGGGCC CGGGGACAGC CTCAACCTAT
GGCGGTGAGA TCAACACGCC GACGCTTGAA CGCGTGGCGA AGCTGGGTGT TTCGTTCAAT
CGCTTCCACT CCACGGCCAT GTGCTCGCCT ACGCGGGCGG CACTGCTCAC CGGGCGCAAT
CACACCCGCG TCGGCAACGG TCAGATCGCG GCCATTGCCA ACGACTTCGA CGGTTTCAAC
GGCACAATCC CGAAGTCCTC TGCCACAGTC GCCGAAGTAC TCAAGAACTA TGGCTATAAC
ACAGGTGCCT GGGGGAAATG GCACAACACA CCGGAGGAAC AGATCACCTC CAAAGGACCG
TTCGAGTATT GGCCCACCGG CTATGGCTTC GAGTATTTCT ATGGTTTTCT CGCGGGCGAG
GCTTCGCAGT ACGAGCCGAC GCTGACCCGC AATACTTCGC CAGTTACCGA GCATCTCCCC
CAAGGCTACC ACTTCACGGA TGACATCGCG CAGGACGCGA TCACCTGGCT GCGGGAACAG
AAGGCGTATG CGCCGGACAA GCCGTTCTTC ATGTACTGGG CGCCCGGTGC CTCGCACGGT
CCGCATCAGG TGATGAAGGA GTGGGCTGAC AAGTACAAGG GGAAGTTCGA TGACGGCTGG
GACAAGTATC GAGAGCGCGT CTTCGCGAGG GCCAAGGCCA AGGGCTGGAT TCCGCAGACG
GCACAACTCA CGCCGCGCCC AGAGTCGATG GCTTCGTGGG ATTCGATCCC CGATGAGGAA
AAGCCGTTCC AGCGTCGATT GATGGAAGTC TTCGCCGGTT TCACTGAGCA CGCCGACTTC
AATGCAGGCC GCGTGATTGA TGAAATCGAA CGCCAGGGCA AGCTCGATAA CACGCTTATC
TTCTACATTT GGGGCGATAA CGGCTCTTCC GCTGAAGGGC TGTATGGCAC GATCAGCGAG
CAACTCGCAC AGAATGGTAT CCCGACCAAG ATTTCACAGC ACCTTGAGGC GCTGGAGGAA
CTCGGTGGTC TCGACGCGCT GGGCGGCCCG AAAACCGACA ACATGTATCA CGCGGGCTGG
GCCTGGGCGG GCAGCACTCC CTACAAATCC ACCAAGCTCG TTGGTGCGCA CTTCGGAGGC
ACCCGACAGC CGATGGCCGT CGCCTGGCCG AAGCGCATAA AGGCGGACCC GACAGCGCGA
TCACAGTTCC ACCACGTCAT CGACATTGTG CCGACGATTT ACGAACTGAC CAGGATCACG
CCGCCAAAAG TTGTGAACGG GTTCGAGCAA GATTCGATTG ACGGAGTCAG CATGGCCTAT
GCATTGGGCG ACGCCCAGGC ACCGGGAACG CGGAAGACAC AGTTCTTCGA CATCATGGCC
AGCCGCGGTA TTTACCACGA CGGTTGGTTT GCCAGCGCAC CGGGACCGCG TGAACCCTGG
GTGGGTGGGC TCCCAAAGGG AATCAAGGAG TGGTCACCAC TGACGGACAA GTGGGAGCTT
TACAACCTTG ATGAAGACTG GAGCCAGGCG AACGATCTCG CCGCAACGAA TCCGAAGAAA
CTCGAAGAGC TGAAACTGCT CTTCTTGCTC GAGTCCACGA AGAACAAGAA TCTGCCCATC
GGTGGTGGCT TGTGGTCCAC CGCGCTGTTC CATCCCGAAG ACTCGCCTGC TTCCGCGCTC
ACGGAATGGA CATTCGATGC CCCGATCACC CGGATGCCAG AGTCCGCTGC GCCCAAGCTC
GGCAAGCAGG ACAGCCTGGT GAGCATGGAG GTGGACGTGC CAGAAAATGC AAACGGCGTG
CTCTATGCCC TGGCAGGCTT TTCCGGCGGC ATCACCTGCT ACCTGAAAGA CGGCTTCCTG
TGCTACGAGT TCAATCTGTT TGAGATTCAG CGCACCAAGC TCAAGTCCAA AGACAAACTC
CCGACGGGCA AGGTGAAAAT TGAAGTCGAA TCCAAGTTGG CGGCCAAAAT CGGCGGGCCG
ATGGATGTCA CGCTCAGGGT GAATGGCAAA GACGTGGCAC AAGGCCGTGT GCCAGCCGCG
ATGTCCCTGC ACTTCACGTC GAATGCGACT TTCGACATCG GCGCAGACTT GGACTCCCCG
GTCTCGCTCG ATTACTTCGA CCAGGCACCG TTCAAGTTCA ATGGCACGAT CGGCGCCACG
AAGATCGCCT ATCCCAAGAA ATAG
 
Protein sequence
MNIRSLLAVL SLALGVGFAS TADSQEILPF PPAPSGSTPG LTIQDSTYKK RVEPKRLAEN 
VPNILIILMD DVGPGTASTY GGEINTPTLE RVAKLGVSFN RFHSTAMCSP TRAALLTGRN
HTRVGNGQIA AIANDFDGFN GTIPKSSATV AEVLKNYGYN TGAWGKWHNT PEEQITSKGP
FEYWPTGYGF EYFYGFLAGE ASQYEPTLTR NTSPVTEHLP QGYHFTDDIA QDAITWLREQ
KAYAPDKPFF MYWAPGASHG PHQVMKEWAD KYKGKFDDGW DKYRERVFAR AKAKGWIPQT
AQLTPRPESM ASWDSIPDEE KPFQRRLMEV FAGFTEHADF NAGRVIDEIE RQGKLDNTLI
FYIWGDNGSS AEGLYGTISE QLAQNGIPTK ISQHLEALEE LGGLDALGGP KTDNMYHAGW
AWAGSTPYKS TKLVGAHFGG TRQPMAVAWP KRIKADPTAR SQFHHVIDIV PTIYELTRIT
PPKVVNGFEQ DSIDGVSMAY ALGDAQAPGT RKTQFFDIMA SRGIYHDGWF ASAPGPREPW
VGGLPKGIKE WSPLTDKWEL YNLDEDWSQA NDLAATNPKK LEELKLLFLL ESTKNKNLPI
GGGLWSTALF HPEDSPASAL TEWTFDAPIT RMPESAAPKL GKQDSLVSME VDVPENANGV
LYALAGFSGG ITCYLKDGFL CYEFNLFEIQ RTKLKSKDKL PTGKVKIEVE SKLAAKIGGP
MDVTLRVNGK DVAQGRVPAA MSLHFTSNAT FDIGADLDSP VSLDYFDQAP FKFNGTIGAT
KIAYPKK