Gene Plim_1565 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_1565 
Symbol 
ID9138265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp2015483 
End bp2016844 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content58% 
IMG OID 
Productsulfatase 
Protein accessionYP_003629597 
Protein GI296121819 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.717098 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCTGCG GCGATCTCCT CGCGGCCGAG ACTCCTGCTC AGCCGCTGAA CATTGTGGTG 
CTTTACGCCG ATGACTGGCG TTTTGATTCC CTCGGCTGCG CCGGCAACCC CATTATCCAG
ACACCCCACA TCGACGCCCT TGCGGCCAAG GGCGTTCGCT TTCCGCGTAA TGCCGTCACC
ACCTCCATCT GCGGCGTCTC CCGCGCCACG CTCCTCACTG GCCAATGGAT GTCGCGTCAC
GGCAACCCGG CCTTCGAGAT GTTCAAAACC CCCTGGGCCG AAACGTATCC CGCGATCCTT
CGTGAACGGG GCTATCACGT GGCCCATGTC GGCAAGTGGC ACAACGGCAA GTTCCCCGCC
GCCAACTACG ATTACAGCCG CATCTCAGCC ACGAGGCACT GGGTTCCCGC TCGGGGCGAG
GCCGGTAAGA AAGGCGAGAA GGTTCATATC ACCGCCCTGC AGGAACAGGA CGCCCTCGAC
TTCTTCGACA GCCGTTCCAA GGAAAAACCC TTCTGCCTGA CGGTCGCCTT CTTCGCCCCT
CACGCCGATG ACCCTTCACC CGCACAATAC TTGCCGCAAC CGAAAAGCAT GTCGCTGTAT
GTCAATGACA TCATTCCAGT ACCTGCGACA GCGAACGAGC AGGCTTTTCG AAATTTGCCA
CCCTTCCTGG CGAACGACAA GCAGGAGGGC CGCCGTCGCT GGTCGTTAAG GTTCTCGACG
GATGAAGCCT TCCAGACGTC GATGAAAAAC TACTACCGGC TCATCACCGA AGTCGATGCT
GCCTGCGGCC GCATTCTGGA TCGATTGAAT GCGGAGGGGT TGGCGGACAA CACGCTCATC
CTCTTCACCA CGGACAACGG CTATTTCCAC GCCGAAAAAG GCTTGGCTGA TAAGTGGTAT
CCTTACGAAG AAAGTATCCG CGTGCCGCTG GTAATCGTCG ATCCCCGGAT GGACAAATCG
CTGGCGGGAA TGACGAATAA TGCCCAGACA CTGAACGTCG ATCTCGCGCC GACGATCCTT
CGCGCCGCCG GTGCCCAGCC CACCCCGCGG ATGCAGGGCC AGGACATGTC ACCCCTCTAT
CTGGGAACGC CCGCTTCAAG ACAGGCAGCC GCCAAAAGTT GGCGAACCGA CTTCTTCTAC
GAGCATTCGG CTATCCGTGA CATTTCGTTT ATCCCTTCAT CGCAGGCTCT CGTGACGCCC
GAGTGGAAGT ACCTGTATTG GCCAGACTTT CAGCGGGAAG AACTCTTTCA TCTCACCACT
GATCCTCGCG AAGAACACGA CTTGGCCGGA GATGAAAAAT CTCTCGACAC TTTACGTGAT
CTCCGTGAGC GCTTCGCGAA GCTAAGAAAT CTCGCCAGGT AA
 
Protein sequence
MICGDLLAAE TPAQPLNIVV LYADDWRFDS LGCAGNPIIQ TPHIDALAAK GVRFPRNAVT 
TSICGVSRAT LLTGQWMSRH GNPAFEMFKT PWAETYPAIL RERGYHVAHV GKWHNGKFPA
ANYDYSRISA TRHWVPARGE AGKKGEKVHI TALQEQDALD FFDSRSKEKP FCLTVAFFAP
HADDPSPAQY LPQPKSMSLY VNDIIPVPAT ANEQAFRNLP PFLANDKQEG RRRWSLRFST
DEAFQTSMKN YYRLITEVDA ACGRILDRLN AEGLADNTLI LFTTDNGYFH AEKGLADKWY
PYEESIRVPL VIVDPRMDKS LAGMTNNAQT LNVDLAPTIL RAAGAQPTPR MQGQDMSPLY
LGTPASRQAA AKSWRTDFFY EHSAIRDISF IPSSQALVTP EWKYLYWPDF QREELFHLTT
DPREEHDLAG DEKSLDTLRD LRERFAKLRN LAR