Gene Plim_1520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_1520 
Symbol 
ID9138220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp1960491 
End bp1962386 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content53% 
IMG OID 
Productsulfatase 
Protein accessionYP_003629552 
Protein GI296121774 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0242635 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGAAGC ATCTGTTGCT ATCAGTACTG GTATTCCTCT GTGGAATTGC CGTGAACTTT 
GCTCAAGCCA GCGAAGCGAA TCCGCCTTCA GCTCCACGCC AGAACCGTCC CAATATCGTG
GTGATTCTTG CCGATGATCT GGGCTGGGCC GATCTCGGTT GTTACGGTAA TCCATTTCAT
AAAACTCCCC ACCTCGATCA ACTCGCCCGT GATGGGATCC GCTGCACACA GGCCTACGCG
GCCTGCCCTG TTTGCTCTCC CACGCGAGCG GCACTTCTCA CGGGGCAAAA CCCCGCCCGG
CTGCATCTCA CAGACTGGCT TCCCGGAAGA GGCAACCGTA ACGACCAGGC CTTGCGTGTC
CCCGAGATTC GAAACTCTTT GCCGCAAGGG ATTATGACTC TGCCGGGAGT TTTAAAGTCG
AATGGTTATC AGACTTGCAG TATTGGCAAA TGGCACCTCG GTGGTGGAGC ATCCGGCCCT
CTCCAACATG GTTTTCATGA GCAAATCGCA GGCGATGAAC GAGGTTCACC CGCCAGATGG
TTTGCACCTT TTGGTCCCCA AGCAGCGACC AATGGAGAAA AGGATCGACA AGGTAAGCCA
ATCCCTGGTC TGGAAGACAT CCCCGATGGA AAGTACCTCA CCGACGCTCT GGCGGACAAA
GCCGTTGCTT TCATTGAAAA ACAAACTGCC GAAAAGCCAT TCTTTCTTTA CCTGCCACAT
TTTGCTGTTC ACACACCAAT GAATGCTCCT GAGGAGACCA TCCAGAAGTT TCGAGACAAC
AAGCCGCCCG GCGTGGTTCG AAATGAGATC TACGCTGCCA TGCTCTATCA CCTCGACGCA
GCGGTCGGCA AAGTGATGAA CTCTCTGACT GAAAAGGGCT TCGCGAAGAA TACGATCGTG
GTCTTCACTT CTGATAATGG CGGTCTAGCG ACCATTGAAG GCAAGAACAC ACCGGCCACC
ATCAATGCTC CCCTTCGCGA AGGGAAAGGC TGGCTTTACG AAGGGGGGAT TCGTGTTCCG
CTCATTGTCA GTTTTCCCAA GCACATCCCG GATGGTTCGA CGACAGACGT TCCAATGACC
ACACTCGATC TGCTCCCCAG CCTTCTCTCT CTGGCAGGAA TCCAGTATCA GGTCGATGCG
AACTCCCCGC TTGACGGGAT GAATATCTCT GACATCTGGA CAGGGAATGC TACGCCCGAA
TTAAAGAAAG CCGCCTTTGA AAGGCCGCTT TACTGGCATT ACCCGCATTA CGCCAATCAG
GGTGGATTCC CCGGCGGCGT TATTCGCCAG GGCCCCTGGA AGTATATTGA GAATTATCAG
ACCGGTCGCA AAGAGCTGTT TCTCGTCGAT AAAGATCCGG GCGAAGGCCG AAATCGTGCT
CCCGACGAAC CCGAAAAAAT TACACAATTT GCAGCCCAAT TGGCAGCGTG GAAGCAATCG
ATAAGCGCTC AGGAAACAGT CCCCAACCCT GATTACATAC CGAATCCTCC GCATGCCAAA
ACAGGTGTGA TTTCGATCCC TGCCAAATCA GCTCAGGTCT ATGGAAGACA GCTTCGTTAC
GAACCATTAC CTCACAAGCA AACGGTCGGC TTCTGGGTCG AGAAAGACGA CTATGTCGTC
TTTCCATTGA CTGTCACGAA ACCCGGCAAC TATCGCTTGA GAGTATTCCA GGGTTGTGGC
AAAGGGAGTG GTGGTGCCAT CGTCGAGGCA AGGATTAAAG ATTCTTCGCT GACATTTGTC
GTCGAAGAGA CGGGCCACTT TCAGAACTTC GTCTGGCGAG ATGTCGGAGA ACTCAAAATC
GACGACATCG GCCAACAAGA AATCAGCCTC AAAGCAGTTT CAAAACCAGG CATCGCTGTC
GGAGATTTTC GCGCATTGGA ACTGATTCCT CAATAG
 
Protein sequence
MTKHLLLSVL VFLCGIAVNF AQASEANPPS APRQNRPNIV VILADDLGWA DLGCYGNPFH 
KTPHLDQLAR DGIRCTQAYA ACPVCSPTRA ALLTGQNPAR LHLTDWLPGR GNRNDQALRV
PEIRNSLPQG IMTLPGVLKS NGYQTCSIGK WHLGGGASGP LQHGFHEQIA GDERGSPARW
FAPFGPQAAT NGEKDRQGKP IPGLEDIPDG KYLTDALADK AVAFIEKQTA EKPFFLYLPH
FAVHTPMNAP EETIQKFRDN KPPGVVRNEI YAAMLYHLDA AVGKVMNSLT EKGFAKNTIV
VFTSDNGGLA TIEGKNTPAT INAPLREGKG WLYEGGIRVP LIVSFPKHIP DGSTTDVPMT
TLDLLPSLLS LAGIQYQVDA NSPLDGMNIS DIWTGNATPE LKKAAFERPL YWHYPHYANQ
GGFPGGVIRQ GPWKYIENYQ TGRKELFLVD KDPGEGRNRA PDEPEKITQF AAQLAAWKQS
ISAQETVPNP DYIPNPPHAK TGVISIPAKS AQVYGRQLRY EPLPHKQTVG FWVEKDDYVV
FPLTVTKPGN YRLRVFQGCG KGSGGAIVEA RIKDSSLTFV VEETGHFQNF VWRDVGELKI
DDIGQQEISL KAVSKPGIAV GDFRALELIP Q