Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0014 |
Symbol | |
ID | 4027333 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 17225 |
End bp | 19072 |
Gene Length | 1848 bp |
Protein Length | 615 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637965166 |
Product | sulfatase |
Protein accession | YP_572078 |
Protein GI | 92112150 |
COG category | [R] General function prediction only |
COG ID | [COG3083] Predicted hydrolase of alkaline phosphatase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAGATA CTCTGAGGCG ACGCTGGCGC GGCACGCTGG CGTTCGCGCT GTTGCAGTTG CCCCTCATCT GGCTGGTGGC CCTGCGCTAC ACGCCTTATC TCGCCGTGCC CGACGACCCC ATGGGCGTGG CTTACCTGGT CCTGACCTGG ATCGGCCACT TCGGCATGCT GGCATTGCTC GGCTGGCTGC CACTGGGCGT ACTGGCCCTG CTGCTGAAGG CGCGCTGGCT ATGGCTGCCG GCGGCGCTGC TGGGCGCGCT CGGCCTGTGT GCGCTGTTGC TGGATACCGT GGTCTATGCC CAGTACCGCT TCCATGTGAA CTACTTCATG GTCTCGCTGT TCTTGAACGA CGAGAATGGC GAGATCTTCA GCTTCACGAC CTCGACCTGG CTGGTGGTGA TCGGCTGCGT GCTGCTGGCC CTGACCCTGG AAGGCTGGCT CGCCCAGCGC CTGATCGCGG GGGGACGAGG CCGTCGCCTG CCGGTCGGCT CGGCGTGCGG CGTGGTGCTG CTGGCCCTGC TGGGCAGCCA CGCGCTGCAT ATCGTCGCCG ACGCCCGCTA CATGCGCAGC GTGACCCAGC AGGTCGGCGT CTATCCGCTG CTGTTTCCCA CCACTGCCAA GGACTTCATG GAGGAACACG GCTGGCTCGA TCCCCGCGCC GCCCGGGCCG CGAACGCCGA TATCGAGGCC CGGCAGGCAC AGAATCTCGA TTGGCCCAAG AACCCGCTGA GCTGTCAGGC CATGCAGCCG CCCAATGTGC TGGTGGTGCT GATCGACTCC TGGCGCGCCG ACGAATACGG ACCGAAGAAC ACGCCCAACC TGCATGCCGC ACTGAACGAG AGCGGCCGGC GCTATCTGAA TCACTACAGC GGCGGCAACG CGACCCGCAA CGGCACCATG AGCCTGTTCT ACGGCCTGAC CGGCAACTAC TACGCCTATC TGAACGACTC CCAGACGCCC CCGCTGCTGC TGACGCAGTT GCAGAAGCAG GATTACGCGC TGGGCATCTT CTCGTCCGCC AGCCTCGGCA GTGTCGGCTT CGACCGCACG ATCTTCTCGT CGATCGAGTC ACTGCGGATG GACACCCAGG GAGACTCGCC CGCGGACCGG GACCGGCAGA TGACCGAGGA CTGGATGCAC TGGCTCGGCC GGCAGGAACG GCAGGACGCT ACCCCGTGGT TCGGCATGCT GTTCTACGAT GCGCCCCACG GCTATGACGT CCCGGCCGAC GCCGCCCAGC CCTTCCAGCC GTCGGTACAG AACATGGACT ATCTCGAGCT GGGTCCCGAG ACCGATCCCC TGCCGTACTT CAACCGTCAT CGCAACGCCG TGCATTACGA CGACGTCCTG CTCGGCAAGA CCATCGACGA CCTGAAAGCC AAGGGCGAAT GGGACGAGAC CCTGCTGGTG GTCACATCCG ACCATGGCCA GTCATTCGAT GATTTCGACA AGAACTATTG GGGCCACAAC GGCCACTTCG CCTCGCCGCA GACCCGTGTG CCGATGCTCG TCAACGGCCC CGGCGTCGAG CCGGGCGAGG TCACGGGCAT GACCAGCCAC CTCGACGTGG CCCCCATGCT GATGCGCCAC GCCCTGGGGT GCAGCAACCC GCTCTCCGAC TATGCCATGG GCGAGGACCT GCTGAAGCCC GGCATCGACC ATCCCTGGGT GCAATCCAGC AGCTACATCG ACTACGGCAT CATCGAGCCG AACCGGATCA CGGTGGTCGA TGGCACCGGT CAGTGGGAGA TCGTCGACCG CCAGCTCGAT CCGATCGAAG GCGCCGAATT CTCGCCGGCG GTGTTCGACG CGATGCAGTG GTTCCGCCGC TTCTATCGCC AGGGCTGA
|
Protein sequence | MQDTLRRRWR GTLAFALLQL PLIWLVALRY TPYLAVPDDP MGVAYLVLTW IGHFGMLALL GWLPLGVLAL LLKARWLWLP AALLGALGLC ALLLDTVVYA QYRFHVNYFM VSLFLNDENG EIFSFTTSTW LVVIGCVLLA LTLEGWLAQR LIAGGRGRRL PVGSACGVVL LALLGSHALH IVADARYMRS VTQQVGVYPL LFPTTAKDFM EEHGWLDPRA ARAANADIEA RQAQNLDWPK NPLSCQAMQP PNVLVVLIDS WRADEYGPKN TPNLHAALNE SGRRYLNHYS GGNATRNGTM SLFYGLTGNY YAYLNDSQTP PLLLTQLQKQ DYALGIFSSA SLGSVGFDRT IFSSIESLRM DTQGDSPADR DRQMTEDWMH WLGRQERQDA TPWFGMLFYD APHGYDVPAD AAQPFQPSVQ NMDYLELGPE TDPLPYFNRH RNAVHYDDVL LGKTIDDLKA KGEWDETLLV VTSDHGQSFD DFDKNYWGHN GHFASPQTRV PMLVNGPGVE PGEVTGMTSH LDVAPMLMRH ALGCSNPLSD YAMGEDLLKP GIDHPWVQSS SYIDYGIIEP NRITVVDGTG QWEIVDRQLD PIEGAEFSPA VFDAMQWFRR FYRQG
|
| |