Gene Plim_3999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_3999 
Symbol 
ID9140719 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp5133062 
End bp5134465 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content51% 
IMG OID 
Productsulfatase 
Protein accessionYP_003632009 
Protein GI296124231 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAGAT CTTCCATTAT GAGACTGTGG TTGGTGCTGA TTCTGGCGTT TCAATTCACC 
AGTCAACTGG CTTTGGCTCA GAGGGCCACA ACAGAAACAA CCAGCGAACG TCGCCCCAAC
ATTTTATTGA TACTCTCTGA TGACTGCGGA CATGCCGAGT TTTCGATTCA AGGTCATCCC
CGTTACAAGA CCCCGCACAT TGATTCGATT GGCAAGAACG GTGTCCATTT TCGACAGGGA
TATGTCTCGG GATGTGTCTG CAGTCCATCG CGAGCGGGGT TATTGGCAGG ACGGTATCAA
CAGCGATTTG GGCATGAGTT CAATATCCCA CCAGCATATA GCGAGACAAA CGGCCTCCCA
CGATCAGAAA CTTTGCTCCC TCAACTTCTC AAGGAAGATG GCTATCGAAC GATTGCACTC
GGGAAATGGC ACCTGGGCTA TGCCCCACAG TTTCATCCCA TGGAACGGGG CTTTACTGAT
TACTACGGAT TTCTGCAGGG CTCGCGAAGC TATTTCCCTC TCAAGAAACC AACTCGTTTG
AATCAGATGC TGCGCGATCG GACTGCGATC CCTGAGGAAC AATTCGGCTA CATGACAGAT
CATCTGGCCG ATGAGGCCAT TGCCTATATC AAACAGTGGC AGTCTCAACC GTGGATGATG
TACCTGGCAT TCAATGCGAC TCATAGCCCC AACGATGCCA CGGCAGTCGA TTTGCAGGCG
GCTGATGGCA ACAAGATTTA TGCGATGACC ATCGCTCTCG ATCGCGCTGT CGGAAAAGTT
CTGGATGCCC TGAAGGAGTG CGGCCTGTCG AAAGATACTC TGGTGATCTT TATCAACGAT
AATGGCGGAG CAGGCGGGCA CGACAATGGT TCGCTACACG GGAAAAAAGG CTCAACCTGG
GAAGGAGGCA CAAGAATTCC TTTTCTCGTT CAATACCCTG CGAAGATTCC TTCCGGTCAA
GTGATCGATG AGCCTGTGAT TGCTCTCGAT CTCTTTCCCA CCATCCTCGA TGTGGCTGGT
CTTGGTGATG CTGAACTGAA GAAGATCCCG TTCGATCCTG AGAAGCTGGA TGGCATCAGC
CTGATTCCCA GAATGACGGG CAAAACCCAA CGACTGGTCG ATCGACCACT GTATTGGAAG
TCTGGAAAAC GATGGGCGAT TCGACAGGGA AACTTGAAAG CCGTCTCGGG CAACGATGAC
CAGGGTGATC AAGTTGAGTT ATTTGATCTC TCAAGTGATC CTGACGAGCA GCGAAACCTG
GCTGCGACAC ACCCCGACGA ACTTCAACAG CTCGAAGCAC TCTACCGCAA GTGGGAATCC
ACTCTCGAGA AACCCCGCTG GGGGTCATCG CCTGGTAAAA AAAGTGGCAG CGGTACCGAC
GAGAGTTCTT CCGATAATCC TTGA
 
Protein sequence
MRRSSIMRLW LVLILAFQFT SQLALAQRAT TETTSERRPN ILLILSDDCG HAEFSIQGHP 
RYKTPHIDSI GKNGVHFRQG YVSGCVCSPS RAGLLAGRYQ QRFGHEFNIP PAYSETNGLP
RSETLLPQLL KEDGYRTIAL GKWHLGYAPQ FHPMERGFTD YYGFLQGSRS YFPLKKPTRL
NQMLRDRTAI PEEQFGYMTD HLADEAIAYI KQWQSQPWMM YLAFNATHSP NDATAVDLQA
ADGNKIYAMT IALDRAVGKV LDALKECGLS KDTLVIFIND NGGAGGHDNG SLHGKKGSTW
EGGTRIPFLV QYPAKIPSGQ VIDEPVIALD LFPTILDVAG LGDAELKKIP FDPEKLDGIS
LIPRMTGKTQ RLVDRPLYWK SGKRWAIRQG NLKAVSGNDD QGDQVELFDL SSDPDEQRNL
AATHPDELQQ LEALYRKWES TLEKPRWGSS PGKKSGSGTD ESSSDNP