Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plim_3913 |
Symbol | |
ID | 9140631 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Planctomyces limnophilus DSM 3776 |
Kingdom | Bacteria |
Replicon accession | NC_014148 |
Strand | + |
Start bp | 5028260 |
End bp | 5029828 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | |
Product | sulfatase |
Protein accession | YP_003631923 |
Protein GI | 296124145 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.656698 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCATC GCACAACAAG AGGCTGGTTC TGTTCTTGTC TGGCAAGTCT CTGCACGCTG GTGCTGTTAA ATACTGATCA ATCGATCGTT GCCGCTGAGC AACCGAACAT CCTGCTGATT CTGGCCGACG ATCTCGGCTA TGGCGATCTT CGCTGCTACA ACAGCCAGTC GAAAGTATCG ACATCACATA TAGACCGGCT TGCCCGCGAA GGGATGAGGT TCACTGATGC GCATAGCCCC AGCACAGTCT GCACTCCGAC TCGCTATGGA TTGATGACGG GACAGATGCC ATTTCGAGCA CCCAGCGGCG GCACAGTCTT CACCGGAGTT GGCGGGCCGT CGCTGATTGC TCCGGGCCGA CTGACGTTGC CGATGATGCT CCGCGAGCGG GGCTACTCAA CAGCGTGTGT CGGCAAGTGG CATATCGGGC TGACATTCTT CGACCGTGAA GGTCGGCCCA TTCACAGTAA TGCCCTTGAA GCCGTCAGGC AGGTCGATTT CAGTCGCCGG ATCGACGGCG GCCCGGTCGA TCATGGCTTT GATTCATTCT TTGGCACAGC CTGTTGTCCC ACGACCGACT GGTTATATGC CTTCATCGAG AATGATCGCG TCCCTGTGCC CCCCACCGCA TCACTCGAAA AGTCAGCACT TCCCAAACAC CCTTATGCCC ATGACTGCCG GCCCGGACTC ATCGCATCAG ACTTCGCCAT GGAAGAGATC GATCTGATCT TTCTGGAGAA GAGTCGTCAG TTTCTCAATC AGCATGTTCG CCAAAATCCC GGCAAACCAT TCTTCCTCTT CCATTCAACA CAGGCGGTTC ATCTTCCCTC ATTTGCTGCC AAACAGTTTC AGGGGAAGTC AGAGGCCGGG CCACATGGCG ACTTTCTCCT CGAACTCGAC TACATCGTTG GCGAACTCAT GAAGTCCTTG GAAGAGCTTC ACATCGCTGA GAATACACTG GTCATCTTCA CGAGTGACAA CGGCCCCGAA GTGACGAGCG TTATTCACAT GCGAAGCGAC CATGGTCATG ATGGTGCGCG CCCCTGGCGG GGAATGAAGC GAGACGCCTG GGAAGGAGGA CATCGCGTCC CATTCATCGT GCGGTGGCCA GGTAAAGTCA GACCTGGCAC TACTAATTCG CAACTGACGA GTCTGACCGA TGTGATGGCG ACAGTGGCCG CGATTGTGGA TACTCAACTC CCCGACCATG CCGCTGAAGA CAGCTTCAAC ATGCTCCCGG CATGGCTTGA TGAAAGTGCG CCGCCGATTC GGCCCTACCT GCTGACACAA TCCTTCGGCG GATCGCGCAC TCTCGCGATC CGGCAGGGCG AGTGGAAATA TCTCGACCAC ACTGGTTCAG GAGGCAACCG CTACGAAAAT GATCCCAGCC TCAAGCCTTT CATCCTCCCT GATGCCGCCC CCGACGCTCC TGGTCAGCTC TATAACCTTT CGACGGATCC GGGAGAATCA ACAAATCTCT ACCACGCCCG ACCAGAAGTC ACATCCAGGT TAAAGACACT TCTCGAACAG TCCAAAACAA ATGGCCGCAG CCGACCAACG CGACCATAA
|
Protein sequence | MSHRTTRGWF CSCLASLCTL VLLNTDQSIV AAEQPNILLI LADDLGYGDL RCYNSQSKVS TSHIDRLARE GMRFTDAHSP STVCTPTRYG LMTGQMPFRA PSGGTVFTGV GGPSLIAPGR LTLPMMLRER GYSTACVGKW HIGLTFFDRE GRPIHSNALE AVRQVDFSRR IDGGPVDHGF DSFFGTACCP TTDWLYAFIE NDRVPVPPTA SLEKSALPKH PYAHDCRPGL IASDFAMEEI DLIFLEKSRQ FLNQHVRQNP GKPFFLFHST QAVHLPSFAA KQFQGKSEAG PHGDFLLELD YIVGELMKSL EELHIAENTL VIFTSDNGPE VTSVIHMRSD HGHDGARPWR GMKRDAWEGG HRVPFIVRWP GKVRPGTTNS QLTSLTDVMA TVAAIVDTQL PDHAAEDSFN MLPAWLDESA PPIRPYLLTQ SFGGSRTLAI RQGEWKYLDH TGSGGNRYEN DPSLKPFILP DAAPDAPGQL YNLSTDPGES TNLYHARPEV TSRLKTLLEQ SKTNGRSRPT RP
|
| |