Gene Plim_2379 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_2379 
Symbol 
ID9139090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp3101571 
End bp3102983 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content53% 
IMG OID 
Productsulfatase 
Protein accessionYP_003630404 
Protein GI296122626 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGATATC CGATTTACTT TGCGCTGTTG TTGTTTATGG GCGCGCCGTT CTTCCCGGTT 
GAAGCGAAGG AAATGGCGGA CAAACCCAAT GTCCTGCTGA TCTTCATCGA CGATCTCGGC
AAAACCGACA TTGGCATTGA GGGCTCCTCG TTTTACGAAA CACCACGCAT CGATGCTCTC
GCAAAATCCG GGGCACGCTT TACACAGTTT TACTCGGCAC ATCCTGTCTG CTCGCCAACT
CGGGCCGCTT TGATGACTGG AAAAATGCCT CAGCGTTTGG GCATTACCGA CTGGATTCGC
CCCGAGAGCG ACGTCGCTCT GCCGCAATCC GAAGTCACCA TCGGGCAGGC TTTTCAGGAA
GCTGGCTATC ACACCGCGTA CCTTGGCAAA TGGCACCTCG GGCACAAACC ACAACAGCAT
CCTGCAGCCC GAGGCTTCGA TTGGACGAAA GGCGTCAATC ACGGTGGCCA GCCCTCCAGC
TATTATTTCC CGTACAAAAA TCCCCAGAAA CCCGATGCGC CGAATAACGT CCCCGATTTT
GAAAAATGCC AGCCAGAGGA CTACCTGACC GATGTCTTGA CCTCCAGTGC CATTGAGCAT
CTGCAGCAGC GCGATCGCAC ACGTCCGTTC TTTCTGTGTT TAGCTCATTA CGCAGTCCAT
ACACCCATTC AGCCACCTAA AAATCTGGTC GAAAAGTATC AGGTCAAATT GGCCACACAG
AAGAATCCAA AATCTCCAGG CGAGGGGATT CAAGAAGGTT CGGCCATCTC TCGCAGCCAG
CAGGATCATC CCGCATACGC AGCCATGGTC GAGAATCTCG ATACGCAGGT GGGCCGTCTG
CTCGATGAGC TCAAAACTCA AGGAATTCTG GATCAGACGA TTGTCGTCTT CACTTCAGAT
AATGGCGGTC TGTGTACGTT AAATGGTAAA TCGCCAGGGC CGACCTGCAA TCTTCCTTTA
CGAGCCGGCA AAGGCTGGAC TTATGAAGGG GGCATTCGCA TCCCCACGTA CATTTCCTGG
CCCGGGAAGA TCTCGCCTCA GGTGCTCGAT ATCCCAGCTT ACACTTGTGA TATTTATCCG
ACACTTTTAA GCCTGTGCCA GATACCACCC AGGCCCACTC AGCATGTCGA TGGAATCTCA
CTCGCCGGTT TGCTCACGAA GTCGTCAAGT TTGCCAGAGA GCGAACGAAC TCTCGTCTGG
TATTACCCTC ATACGCACGG CTCAGGCCAC AAACCCTCAG CCGCCATTCG ACAAGGCCCC
TGGAAGCTGA TTCATTTTCT CGAAACAGAC CGTATTGAAC TCTACCATCT CGAAGACGAT
CCTGGCGAAA GTCGCAACCT CGCATCGAAG CATCCCGAAC GAGCCCTCCA ACTTCAGAAG
GAGTTGCAGA AAATCATCGA GTCTTCCAGT TAA
 
Protein sequence
MRYPIYFALL LFMGAPFFPV EAKEMADKPN VLLIFIDDLG KTDIGIEGSS FYETPRIDAL 
AKSGARFTQF YSAHPVCSPT RAALMTGKMP QRLGITDWIR PESDVALPQS EVTIGQAFQE
AGYHTAYLGK WHLGHKPQQH PAARGFDWTK GVNHGGQPSS YYFPYKNPQK PDAPNNVPDF
EKCQPEDYLT DVLTSSAIEH LQQRDRTRPF FLCLAHYAVH TPIQPPKNLV EKYQVKLATQ
KNPKSPGEGI QEGSAISRSQ QDHPAYAAMV ENLDTQVGRL LDELKTQGIL DQTIVVFTSD
NGGLCTLNGK SPGPTCNLPL RAGKGWTYEG GIRIPTYISW PGKISPQVLD IPAYTCDIYP
TLLSLCQIPP RPTQHVDGIS LAGLLTKSSS LPESERTLVW YYPHTHGSGH KPSAAIRQGP
WKLIHFLETD RIELYHLEDD PGESRNLASK HPERALQLQK ELQKIIESSS