Gene Plim_3468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_3468 
Symbol 
ID9140186 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp4486697 
End bp4488097 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content56% 
IMG OID 
Productsulfatase 
Protein accessionYP_003631480 
Protein GI296123702 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGCTGG CTTGGGTCTC CCTGATCTCT TACGCGTTGG CTGCGCAGCC GAATGTGCTG 
TTGATTTGTG TCGATGATCT GCGGCCTGAA TTGGGTTGCT ATGGTTCGCG AAGTGTTTCG
ACCCCGCACA TCGATGCGCT GGCGAGTCGC AGCCGGTTGT TTACCCGGCA CTATGTGCAG
GCTCCGACCT GTGGGGCCTC CCGGTGTACG CTGCTCACCG GCCGGTATGG CCCGGCTGGA
AACAATGCAC TCTTTGAACT GGCCAAACGC CGCAAACAGG ATGGCGAAGC GGTACCACCT
TCCATGCCCG AATACTTTCG TGGAAAAGGG TACACGACGG TTTCGGTCGG AAAAGTTTCG
CATCATCCCG GTGGTCGCGG CGGAGCCAAC TGGGATGATG ACAACCTGAA CGAAATGCCC
GGCGCCTGGA CTCGACATCT CATGCCCACC GGCCTATGGA AGCATCCCCG GGGAGCCATG
CACGGATTGG CACATGGTGA AATTCGGGCA TCGGAAAAAG GGAAAATGGC GGTCTTTCAG
GCGACTGAAG GGACCGATGA CATTTATCCC GATGGTCTGA TTGTCGAAGA GTCACTCCGA
CAACTGGATG TGCTGACCAG CGAAGACAAA CCGTTCTTTC TGGCGGTGGG ATTGATTCGG
CCTCACCTGC CGTTTGGATC TCCTGCAAAA TACTTCGAGA AAGTGAGCCA ACTCCCCTTA
TTGCCCATTC CTCATCCGGA AAAGCCGACC TGGCCATCGA CGTGGCATGG CTCGAATGAG
TTGAGGCAAT ACCAGTTGTG GGAGAAAGAT CCTCTCAAAG ATCCATCATT TGCTGATGAA
ATTCGTCGGC ACTACTACGC CTGTGTGACC TATGCCGATG CGAATGTGGG CCGCCTGCTG
GAAAAGCTGG CTGCCACGAA AGGGGCCGAC AATACGATTA TCGTGCTGTG GGGAGACCAT
GGCTGGCATC TGCGTGAACA TGCGGTGTGG GGAAAGCATA CCTTGTTTGA AGAATCGTTG
AGATCGCCGC TGCTGATCTC GACCGGTCAG TTGAAACAAC CGGGGGAAGC GACGGAAGCC
GTCGTCGAAA CGATCGATAT TTTCCCCACG TTGTGCGAGC TGACCGGGCT GGAAAAGCCC
GCGTTTGTGC AGGGTGTGAG CCTGGTTCCC CAGTTGAATG ATCCGAACGC TACGGGACAT
GCCGCTTTCT CGTACTCGGG GAAAACCCGG ACGATACTGA CAGATCGCTA CCGCTTGATT
GCTCATCCCG ACAAAGCCGG AACGATGGAA TTGTTCGACC ATGCCGTTGA TGCGGGTGAA
ACAAGAAACA TCGCCGAGAC TCAACCTGAG ATTGTCAGCG CATTGCTGAA AGAACTCGAC
CAGCGACTTC CTCCTCGTTG A
 
Protein sequence
MVLAWVSLIS YALAAQPNVL LICVDDLRPE LGCYGSRSVS TPHIDALASR SRLFTRHYVQ 
APTCGASRCT LLTGRYGPAG NNALFELAKR RKQDGEAVPP SMPEYFRGKG YTTVSVGKVS
HHPGGRGGAN WDDDNLNEMP GAWTRHLMPT GLWKHPRGAM HGLAHGEIRA SEKGKMAVFQ
ATEGTDDIYP DGLIVEESLR QLDVLTSEDK PFFLAVGLIR PHLPFGSPAK YFEKVSQLPL
LPIPHPEKPT WPSTWHGSNE LRQYQLWEKD PLKDPSFADE IRRHYYACVT YADANVGRLL
EKLAATKGAD NTIIVLWGDH GWHLREHAVW GKHTLFEESL RSPLLISTGQ LKQPGEATEA
VVETIDIFPT LCELTGLEKP AFVQGVSLVP QLNDPNATGH AAFSYSGKTR TILTDRYRLI
AHPDKAGTME LFDHAVDAGE TRNIAETQPE IVSALLKELD QRLPPR