Gene Plim_3913 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_3913 
Symbol 
ID9140631 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp5028260 
End bp5029828 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content56% 
IMG OID 
Productsulfatase 
Protein accessionYP_003631923 
Protein GI296124145 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.656698 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCATC GCACAACAAG AGGCTGGTTC TGTTCTTGTC TGGCAAGTCT CTGCACGCTG 
GTGCTGTTAA ATACTGATCA ATCGATCGTT GCCGCTGAGC AACCGAACAT CCTGCTGATT
CTGGCCGACG ATCTCGGCTA TGGCGATCTT CGCTGCTACA ACAGCCAGTC GAAAGTATCG
ACATCACATA TAGACCGGCT TGCCCGCGAA GGGATGAGGT TCACTGATGC GCATAGCCCC
AGCACAGTCT GCACTCCGAC TCGCTATGGA TTGATGACGG GACAGATGCC ATTTCGAGCA
CCCAGCGGCG GCACAGTCTT CACCGGAGTT GGCGGGCCGT CGCTGATTGC TCCGGGCCGA
CTGACGTTGC CGATGATGCT CCGCGAGCGG GGCTACTCAA CAGCGTGTGT CGGCAAGTGG
CATATCGGGC TGACATTCTT CGACCGTGAA GGTCGGCCCA TTCACAGTAA TGCCCTTGAA
GCCGTCAGGC AGGTCGATTT CAGTCGCCGG ATCGACGGCG GCCCGGTCGA TCATGGCTTT
GATTCATTCT TTGGCACAGC CTGTTGTCCC ACGACCGACT GGTTATATGC CTTCATCGAG
AATGATCGCG TCCCTGTGCC CCCCACCGCA TCACTCGAAA AGTCAGCACT TCCCAAACAC
CCTTATGCCC ATGACTGCCG GCCCGGACTC ATCGCATCAG ACTTCGCCAT GGAAGAGATC
GATCTGATCT TTCTGGAGAA GAGTCGTCAG TTTCTCAATC AGCATGTTCG CCAAAATCCC
GGCAAACCAT TCTTCCTCTT CCATTCAACA CAGGCGGTTC ATCTTCCCTC ATTTGCTGCC
AAACAGTTTC AGGGGAAGTC AGAGGCCGGG CCACATGGCG ACTTTCTCCT CGAACTCGAC
TACATCGTTG GCGAACTCAT GAAGTCCTTG GAAGAGCTTC ACATCGCTGA GAATACACTG
GTCATCTTCA CGAGTGACAA CGGCCCCGAA GTGACGAGCG TTATTCACAT GCGAAGCGAC
CATGGTCATG ATGGTGCGCG CCCCTGGCGG GGAATGAAGC GAGACGCCTG GGAAGGAGGA
CATCGCGTCC CATTCATCGT GCGGTGGCCA GGTAAAGTCA GACCTGGCAC TACTAATTCG
CAACTGACGA GTCTGACCGA TGTGATGGCG ACAGTGGCCG CGATTGTGGA TACTCAACTC
CCCGACCATG CCGCTGAAGA CAGCTTCAAC ATGCTCCCGG CATGGCTTGA TGAAAGTGCG
CCGCCGATTC GGCCCTACCT GCTGACACAA TCCTTCGGCG GATCGCGCAC TCTCGCGATC
CGGCAGGGCG AGTGGAAATA TCTCGACCAC ACTGGTTCAG GAGGCAACCG CTACGAAAAT
GATCCCAGCC TCAAGCCTTT CATCCTCCCT GATGCCGCCC CCGACGCTCC TGGTCAGCTC
TATAACCTTT CGACGGATCC GGGAGAATCA ACAAATCTCT ACCACGCCCG ACCAGAAGTC
ACATCCAGGT TAAAGACACT TCTCGAACAG TCCAAAACAA ATGGCCGCAG CCGACCAACG
CGACCATAA
 
Protein sequence
MSHRTTRGWF CSCLASLCTL VLLNTDQSIV AAEQPNILLI LADDLGYGDL RCYNSQSKVS 
TSHIDRLARE GMRFTDAHSP STVCTPTRYG LMTGQMPFRA PSGGTVFTGV GGPSLIAPGR
LTLPMMLRER GYSTACVGKW HIGLTFFDRE GRPIHSNALE AVRQVDFSRR IDGGPVDHGF
DSFFGTACCP TTDWLYAFIE NDRVPVPPTA SLEKSALPKH PYAHDCRPGL IASDFAMEEI
DLIFLEKSRQ FLNQHVRQNP GKPFFLFHST QAVHLPSFAA KQFQGKSEAG PHGDFLLELD
YIVGELMKSL EELHIAENTL VIFTSDNGPE VTSVIHMRSD HGHDGARPWR GMKRDAWEGG
HRVPFIVRWP GKVRPGTTNS QLTSLTDVMA TVAAIVDTQL PDHAAEDSFN MLPAWLDESA
PPIRPYLLTQ SFGGSRTLAI RQGEWKYLDH TGSGGNRYEN DPSLKPFILP DAAPDAPGQL
YNLSTDPGES TNLYHARPEV TSRLKTLLEQ SKTNGRSRPT RP