Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPS_2364 |
Symbol | |
ID | 3521400 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Colwellia psychrerythraea 34H |
Kingdom | Bacteria |
Replicon accession | NC_003910 |
Strand | - |
Start bp | 2459489 |
End bp | 2460967 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 637284821 |
Product | sulfatase family protein |
Protein accession | YP_269082 |
Protein GI | 71280931 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.027189 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAATT CATTATTTTT ATTATTCTCA GGACTGAGTT TATTTACCTG TAGCCAAGCT GTTGCTACTC CTGATAAAAG CACGAGTAAA CCTAATGTAG TGATGTTATT GGTTGATGAT TTTGGTCGTC AAGATTTAAG CACCTACGGT AGCAACTTCT ACGAAACACC TAACATAGAC CAGCTGGCCG CAGATGGCAT GAAGTTTGAT AACGCTTATG CCGCACACCC TCGCTGTGTG CCTTCTCGCG TTGCAATATT TAGTGGTAGT TACCCTACGC GCTATGGTGT ACCGCAAGGT GAACGTGTCG GTAAACATCA CTTACCTTTA TCTGCAGTTA CTTTTGGTGA ACATCTAAAA GAGGCTGGTT ATCAAACGGG TTATATCGGT AAGTGGCATT TAGGTAAAGA AGGCGGAGAT CCAACCAAGC AAGGTTTTGA TAGCAGTATT ATGGCTGGTC ATTGGGGCGC GCCACCGAGT TATTATTTTC CTTACACTAA AATGAGTAAA TCAGGAAAAA ATAAAGGTTT TGCTAAAGTA GAAGGCTCCG AAGAAGAATA CTTAACGGAT CGCCTAACCG ATGAAGCGTT AACGTTTATT GAACAGAAAA AAGATCAGCC ATTTTTATTA GTTTTAGCGC ATTATGCTGT TCATACACCT ATTGAAGGTA AGCCTGCTTT AGTTAAAAAA TATAAAACTA AAATGAAAAA GCTAGGTATT GCCAATGCGG GTCCTAAGAG TGATGCCGAT TTAATTAAAG ATAGCACTGG CTATCATAAA ACCATTCAAA ACAATCCAGA TTACGCTGCG ATGGTTGAAA GTGTTGATAT TAGTGTCGGG CGTATTGAAC AGCAGTTAAA AAGATTAGGA CTTGAAGATA ATACTATTAT TATTTTAACT TCAGATCACG GTGGATTATC AAGCCGAGGT TTAAAATCTA ACCGTGTGCT TGCTACTAGC AACAATCCCT ACCGCCACGG TAAAGGCTGG ATTTATGATG GAGGCACTCG TGTACCACTT ATTGTTAAAT GGCCTGAAAA AGTAAAAGCT GGTTCAATTA GCCAAGTACA AGTTACCGGA ACAGATCATT ACCCGACTAT ATTACAAATG GCTGGCTTAT CACTTTCACC AAAAGATCAT CAAGATGGTG TAAGCTACTT AGCCGCTTTA AACAGTGATG AAACACCTAG AAAAGCTATG TTTTGGCACT CTCCTGCAGC TCGCCCCAGT AAAACAGGTG ATACCAATAG CTCTGCAATA ATTGAAGGCG AATGGAAGCT GTTAGATTTT TGGTCTACAG GAAAAGTTGA ATTATATAAC TTAAAAGATG ATAAAAGTGA GGCGAACAAC TTAGCCAAAT TAATGCCAGA AAAAACAGCT GAAATGCTTG CTAAACTCAC TAATTGGAAA GACGATATTG ACGCCCATAC TGTAAAAAAG AAAAATAAAA AAAGTAAGAA AAAATCTAAA TCGCATTAA
|
Protein sequence | MKNSLFLLFS GLSLFTCSQA VATPDKSTSK PNVVMLLVDD FGRQDLSTYG SNFYETPNID QLAADGMKFD NAYAAHPRCV PSRVAIFSGS YPTRYGVPQG ERVGKHHLPL SAVTFGEHLK EAGYQTGYIG KWHLGKEGGD PTKQGFDSSI MAGHWGAPPS YYFPYTKMSK SGKNKGFAKV EGSEEEYLTD RLTDEALTFI EQKKDQPFLL VLAHYAVHTP IEGKPALVKK YKTKMKKLGI ANAGPKSDAD LIKDSTGYHK TIQNNPDYAA MVESVDISVG RIEQQLKRLG LEDNTIIILT SDHGGLSSRG LKSNRVLATS NNPYRHGKGW IYDGGTRVPL IVKWPEKVKA GSISQVQVTG TDHYPTILQM AGLSLSPKDH QDGVSYLAAL NSDETPRKAM FWHSPAARPS KTGDTNSSAI IEGEWKLLDF WSTGKVELYN LKDDKSEANN LAKLMPEKTA EMLAKLTNWK DDIDAHTVKK KNKKSKKKSK SH
|
| |