Gene Plim_0571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_0571 
Symbol 
ID9137249 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp716499 
End bp718052 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content53% 
IMG OID 
Productsulfatase 
Protein accessionYP_003628618 
Protein GI296120840 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTATTC TGGCAAGAGA ATCATTTGCT TCAGTGGAAA ATGTCGTGTC GCAAGAGAAC 
TGTTTATGGC GTTGGATCAA GTTCCTGTGT TTAGGCTTAA TCGCTGTTAA CTCGCAGGCT
GTCCGAGCAG AGGATGGGAA AAATCCCACT CGTCCAAATG TGATTTTCAT CATGGCGGAT
GATCTGGGGT ATACCGATCT GGGTTGCTTT GGCAGCCGAT ATTATGAAAC GCCTCACATC
GATCAACTGG CCCGGCAAGG CGTCCGTTTC CTGAATCACC ATCACTGTCA GAATTGTGCA
CCCACTCGGG CCGCCATTAT GACCGGGCAA TATGGGCCCC GAACGGGTGT TTACACCGTG
GGAAGCATTG ACCGGTTCGA CTGGCAAAGC CGACCTCTGC GGCCTGTCGA GAATGTGGAG
AAGTTGCCTC TCGACCCGGC AACTATTGCA CAGCAGGTGC AGCGCAGCGG CTACCAAACC
GGAATGTTCG GGAAGTGGCA CCTGGGGCTG AATGGCTCTT ACCATCCGTC GCAACGCGGC
TTTCATGAAG CGGTCGAATC TTCTGGCCAG CACTTCAATT TCAAAACCAC TCCAGCGCAG
AAAGATACCG AAGGAAAGTA TCTGGCCGAC TACCTGACAG ATCGCGCCAT CGACTTTATC
GAGCGGCATC AGTCGGAACC ATTTTTTCTA TACTTGCCCC ATTTTGGTGT GCATTCACCA
TTTCAGGCGA AGAAGGAATG GATCGAGAAG TTTGAAAACA AGCCGGGCGT GGGTGGCCAT
AGGAACCCGG TGTATGCCGC GATGATCGCC AGTGTGGATG AAAGTGTCGG GCGAATCCTC
GATAAGCTGG ACGAACTCAA ACTCGCCGAC AAGACCGTGG TGATCTTTGC CAGTGATAAT
GGTGGAGTGG GAGGCTACGA ACGGGAAGGT TTGCACAAAG CCAATGATGT GACCGATAAC
GCACCATTGC GAAGTGGTAA AGGGAGCCTG TACGAAGGTG GAACTCGTGT ACCTTTAATT
GTGCGCTGGC CTGGCGTGGC ACCCGCAGGG GTTGAGTGCC GCACGCCGAC AATCCATGTC
GATCTTTATC CGACGTTCCT GGAGATCACT TCGGCTGAGC ACCCCGGGCA TCCCCTCGAT
GGTGAGAGTC TGGTGAAGTT GATCAAAGAC CCCGGGGCAA AGCTCCATCG AGAGGCCATT
TTCCAGCACT TTCCCGGTTA TCTGGGTTCG GGTCAGAACC AGTGGCGAAC CACACCTGTG
AGTTTGATTC AAAGTGGTGA CTGGAAGCTG ATGGAGTTTC TGGAAGATGG GCGACTGGAA
CTTTACAACC TCACCAGCGA TGTGGGGGAA CAAAAGAATC TGGCAGCCAC ATATCCTGAC
AAGGTGAACG AATTGCAGTC CAGGCTCAAA GCGTGGCGCG AGGAGATTAA AGCCCCTATG
CCTGCGAAAA ATGCTTCACC TTCCGAGAGC AACAATGGTG TGAGAAAGGG AAGAGCCAGT
CAGAAAAAAT CACAGAAGGG AAAAGCCAAA GCAGCTGCTT CCTCCGAAGA TTGA
 
Protein sequence
MFILARESFA SVENVVSQEN CLWRWIKFLC LGLIAVNSQA VRAEDGKNPT RPNVIFIMAD 
DLGYTDLGCF GSRYYETPHI DQLARQGVRF LNHHHCQNCA PTRAAIMTGQ YGPRTGVYTV
GSIDRFDWQS RPLRPVENVE KLPLDPATIA QQVQRSGYQT GMFGKWHLGL NGSYHPSQRG
FHEAVESSGQ HFNFKTTPAQ KDTEGKYLAD YLTDRAIDFI ERHQSEPFFL YLPHFGVHSP
FQAKKEWIEK FENKPGVGGH RNPVYAAMIA SVDESVGRIL DKLDELKLAD KTVVIFASDN
GGVGGYEREG LHKANDVTDN APLRSGKGSL YEGGTRVPLI VRWPGVAPAG VECRTPTIHV
DLYPTFLEIT SAEHPGHPLD GESLVKLIKD PGAKLHREAI FQHFPGYLGS GQNQWRTTPV
SLIQSGDWKL MEFLEDGRLE LYNLTSDVGE QKNLAATYPD KVNELQSRLK AWREEIKAPM
PAKNASPSES NNGVRKGRAS QKKSQKGKAK AAASSED