Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plim_2738 |
Symbol | |
ID | 9139450 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Planctomyces limnophilus DSM 3776 |
Kingdom | Bacteria |
Replicon accession | NC_014148 |
Strand | - |
Start bp | 3550742 |
End bp | 3552142 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | |
Product | sulfatase |
Protein accession | YP_003630760 |
Protein GI | 296122982 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.846304 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCACCCG CTCATCACCT TACATCGGCC ACATCCCGTC TCACGTTGTT GCTTTGGAGC TTTTGCGTCC TCTGCGGTGC TTCCTTCGCC GCAGATTCCA CGAAGCCCAA CATCGTGGTG ATCTTTGCTG ATGATCTTGG TTACGGTGAT CTCGGCTGTT ATGGGTCGCC AACAATCCGC ACGCCGCACC TCGACGAGAT GGCTGCGGAG GGATTACGGT TTACAGATTT TTATTCTGCC GCCGAGGTGT GTACCCCCAG CCGAGCGGCT TTGCTCACGG GACGATTGCC AATTCGCAGT GGCATGTGCG GGGCTCGCCG CGTGCTGTTT CCAAACTCCA AAGGAGGGTT GCCCCCGGCA GAGATCACCA TCGCCGAGGC ACTCAAGGAA AAGGGCTATG CCACTGCACA GATTGGCAAG TGGCATCTCG GCATTCACCC GGGGTCGCGT CCATTGGATC AGGGCTTCGA TCAAAGCTTT GGCCTGCCAT ACTCCAATGA TATGGATGCC CGGGCCGATT TGCCGAAAGG CTCGACGGGT TCACCCAATC CACCGCTCGA CGGCTGGAAT GTGGCGCTGT TGCGCAATGG AGAGGTTGTT GAACAACCGG CAAACCAGAC CACGTTAACG AAACGTTATA CCGAAGAAGC CATCAAGTTC ATCACAGAGA AGAAGAACGT TCCATTCTTC CTCTACATGC CTCACACCTT TCCGCATGTG CCCATGTTTG CCTCGCAGGA TTTCAAGGGC AAAAGCCGTG CGGGCATTTA TGGTGACGCT GTTGAAGAGC TGGATTGGAG TGTGGGGCAG GTCCTGGGAG CCTTGTGTCG GGAAGGTATC GCTGAGAATA CGCTTGTTTT CTTCTCCAGT GATAACGGCC CCTGGCTCAT CATGGGCGAT CAAGGCGGCA GTGCCGGTCT GCTCAAGGAT GGCAAAGGGA GCACCTGGGA AGGCGGCATG CGTGTACCCG GGATTGCCTG GATGCCGAGC CGGATCAAGC CCGGCGTGAC CAGTCAGCTC GCCAGTGCGA TGGATGTGTT TCCCACGGCT CTGGCCCTGG CCGGTGCATC GCTCCCGAAG GATGTTGTGT TCGATGGCGT CGATCTCGCG CCATTACTTT TCGAATCCAG GCCTCTGCCG GAGCGACCGT TCTTTTATTA TCGAGGCAAT CAACTTTTTG CCTGCCGCCT GGGTGAATGG AAGGCTCATT TCCAGACTCA AACAGGCTAT GGAGGCTCAA AACCGGAACG GCATGAACCA GAACTGCTCT TTCATCTCGG TCGCGATCCT TCCGAGAAAC GTAATGTCGC CGCCGCACAT CCCGAGGTTC TCATTCGTAT TCAGGAAGCT GTGAAGGCTC ACCAATCCCA AGTGATTCCA GGCCCCCCAC AGCTTCAATA G
|
Protein sequence | MSPAHHLTSA TSRLTLLLWS FCVLCGASFA ADSTKPNIVV IFADDLGYGD LGCYGSPTIR TPHLDEMAAE GLRFTDFYSA AEVCTPSRAA LLTGRLPIRS GMCGARRVLF PNSKGGLPPA EITIAEALKE KGYATAQIGK WHLGIHPGSR PLDQGFDQSF GLPYSNDMDA RADLPKGSTG SPNPPLDGWN VALLRNGEVV EQPANQTTLT KRYTEEAIKF ITEKKNVPFF LYMPHTFPHV PMFASQDFKG KSRAGIYGDA VEELDWSVGQ VLGALCREGI AENTLVFFSS DNGPWLIMGD QGGSAGLLKD GKGSTWEGGM RVPGIAWMPS RIKPGVTSQL ASAMDVFPTA LALAGASLPK DVVFDGVDLA PLLFESRPLP ERPFFYYRGN QLFACRLGEW KAHFQTQTGY GGSKPERHEP ELLFHLGRDP SEKRNVAAAH PEVLIRIQEA VKAHQSQVIP GPPQLQ
|
| |