Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plim_3640 |
Symbol | |
ID | 9140358 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Planctomyces limnophilus DSM 3776 |
Kingdom | Bacteria |
Replicon accession | NC_014148 |
Strand | + |
Start bp | 4681938 |
End bp | 4683500 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | |
Product | sulfatase |
Protein accession | YP_003631651 |
Protein GI | 296123873 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTGA AACGATGGCT GTTGACTCTA TGCACTCTGG CTTTATGCGT GGTAAACCAC TCGGAAGTTC CTTCAGCTCT GGCTGCTGAA ACCACAGGAA AGCCCCAGGT TTCCCGGCCC AACATCGTGC TCATCTATGT GGACGATCTC GGTTATGGCG ACATCAGTTG TCATGGCGCC ACGCTGGTCA AAACGCCCCA TGTCGATCGA TTGGCTCGCG AAGGACTCAA CTTTTCTGAT GGACACTCAC CGTCGGCCAC TTGCACTCCC TCGCGCTACG CCATGCTCAC CGGCGAATAT GCCTGGCGGA AAAAAGGGAC TGGAGTTCTT CCGGGCGATG CCAGGCTGAT TATTGAACCG GGCCGCCGCA CTCTCGCTTC GACACTTCAA AAGGCGGGCT ATCGCACCGG TGTCGTCGGG AAATGGCATC TGGGTTTAGG AGACGAAAAA CTTAACTGGA ACGGCGTCAT CAAACCCGGC CCTCTCGAAG TGGGCTTTGA TGAATCGTTC ATCATGGCCG CGACCGGCGA TCGAGTGCCA TGTGTCTACG TTGAGCAGGA TCGTGTGCTC AACCTTGACC CGAACGACCC AATCAAAGTC CAGTTCGGCA AACCAATTGA TCCTGCCCTA CCCACAGGGA AATCGCATCC AGAATTGCTC ACGGTCATGA AGCCGAGCCA CGGTCACGAC ATGACGATTA TCAATGGTGT CAGCCGGATT GGCTATATGA CAGGTGGCAA GGCCGCTCTC TGGAACGATC AGGAGATGGC CGATGTCTTT ACCTCAAAAG CACTCAAGTT CATGACCGAT CATTGGGCTC GCCATGCCGA TCAGCCGTTT TTCTTGTTCT TTTCGCTGCA CGATATTCAC GTTCCCCGCT TGCCTCACCC CCGCTTTGTC GGCAGCACCA GCATGGGCCC GCGCGGCGAC GTGATTGTCG AAATGGATTG GTGTGTCGGT CAGGTGCTCG ACAAGCTCGC GGCCTTAGGA ATTGACGACG AGACGATGGT CATCTTTACC AGCGATAATG GCCCCGTCGT CGATGATGGA TACAAAGATG AAGCCGTCAC GAAGCTGAGT CATCATCAAC CGGCTGGCCC TTATCGAGGT GGTAAATATA GTGCCTATGA AGGAGGGACT CGCGTCCCCT TCATTGTCCG CTGGCCAGGT CGCATCCAGC CGGGAACATC GAACGCGTTG ATGTGTCAGA TCGACCTCAT GGCCTCGCTC GGCAAACTGG TGGGGCAACC TGTCCCACCC CAGGAAGCGT ATGACAGTAT TGATGTCCTT CCCGCTTTAT TGGGTGAGTC ACAGGCAGGT CGAGAGCAAC TGGTGGAGCA CTCGGGAGTT CTGGGCCTGC GCGCGGGCCC CTGGAAACTG ATTGAGCCCG GCAAAGCCCC GCGTGTCTTC CAGCAGACCA ATACCGAAAC CGGCCAGCTC CCCAGACCTC GCCTGTTTAA TCTCGAAGAA GACCCCGGCG AAACCCGCGA CCTCGCCGAA GACCAACCCG AAAAAGTCAA AGAACTCCAA GCCCTCCTCG AGAGAATCAA AGGTGAGATC TAA
|
Protein sequence | MKLKRWLLTL CTLALCVVNH SEVPSALAAE TTGKPQVSRP NIVLIYVDDL GYGDISCHGA TLVKTPHVDR LAREGLNFSD GHSPSATCTP SRYAMLTGEY AWRKKGTGVL PGDARLIIEP GRRTLASTLQ KAGYRTGVVG KWHLGLGDEK LNWNGVIKPG PLEVGFDESF IMAATGDRVP CVYVEQDRVL NLDPNDPIKV QFGKPIDPAL PTGKSHPELL TVMKPSHGHD MTIINGVSRI GYMTGGKAAL WNDQEMADVF TSKALKFMTD HWARHADQPF FLFFSLHDIH VPRLPHPRFV GSTSMGPRGD VIVEMDWCVG QVLDKLAALG IDDETMVIFT SDNGPVVDDG YKDEAVTKLS HHQPAGPYRG GKYSAYEGGT RVPFIVRWPG RIQPGTSNAL MCQIDLMASL GKLVGQPVPP QEAYDSIDVL PALLGESQAG REQLVEHSGV LGLRAGPWKL IEPGKAPRVF QQTNTETGQL PRPRLFNLEE DPGETRDLAE DQPEKVKELQ ALLERIKGEI
|
| |