Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPS_0660 |
Symbol | |
ID | 3519819 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Colwellia psychrerythraea 34H |
Kingdom | Bacteria |
Replicon accession | NC_003910 |
Strand | - |
Start bp | 659322 |
End bp | 660899 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 637283125 |
Product | sulfatase family protein |
Protein accession | YP_267410 |
Protein GI | 71279364 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.61402 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAAAT TTTTACCTAT GAAAAGTTAC TTAACAATGA ATTTCACCAA AATCATAACA ACCTGTTTAT TAGTCACTTC TGGAATGGCC TCAGGGACTA CTGATACAGA GCGTCCAAAC ATACTAGCTA TCTGGGGAGA TGATATTGGC CAAAGTAATA TTAGCGCTTA TACCCACGGC ATGATGGGCT ATAAAACCAC TAACATTGAC CGTATAGCAA AAGAAGGTGT GTTATTTACT GATTATTACG GTGAAAACTC TTGTACCGCT GGTCGTGCAG CCTTTATTAC TGGTCAATAT CCGGTACGAA CGGGATTAAC CAAGGTTGGC TTACCTGGCT CTGATAAAGG TTTACGCGCA GAGGATGTCA CCATTGCCGA ACTATTAAAA GATAGAGGTT ATGTTACTGG CCAGTTTGGT AAGAATCACT TAGGGGATAA AGATGAATTT TTACCAACCA ATCACGGTTT TGATGAGTTT TTGGGAAATT TGTATCACTT AAATGCAGAA GAAGAACCAG AACATCCCGA CTACCCAAAA GATCAAGCTT ATAAAAAACG TTTTGGCCCA AGAGGTGTTA TTCATTCCTT TGCTGATGGA AAAATAGAGG ATTCGGGACC ATTAACTAAA AAACGCATGG AAACAATCGA TGATGAGTTT TTAGCGGCAA CGACTAAATT TATCGATAAA GCGCATAAAA ACAACAAACC ATTTTTTGTC TGGTTTAATG CGACTCGCAT GCACATCTGG ACACATTTAA AAGAAGAGTC TAAAGGTTTA TCAAAGCGTG GCGGCATATA CGGTGATGGC ATGATGGAAC ATGATTATCA GGTCGGCGTA TTACTTGATC AATTAGACCG TTTAGCTATT GCTGACAACA CCATTGTTTT ATACACAACC GATAATGGTG CTGAAGTCTT TTCTTGGCCT GATGGTGGCA CCATTCCGTT TAAAGGTGAA AAAAATACGA CTTGGGAAGG CGGATTCCGT GTACCGGCTA TGGTGAGATG GCCAGGTAAA ATAACAGCTG GCGACGCTAA AATAGAAATG GTGTCACATA TGGATTGGGC ACCAACATTG TTGGCTGCTG CAGGTGTCAC TGATATTAAA GAAAAGCTAA AACAAGGGAC TACCGTTAAT GGCAAAAAGT ATAAAGTACA TTTAGATGGT TATAACCTGT TGCCTTATCT TACGGGAGCA ACCGATGAAG CGCCAAGACC GAGTTACCTG TATTTTACCG ATGGTGGTGA TTTATCAGCG GTTCGTTTTG GCGATATGAA ACTGCAATAC AGTATTCAAG AATGTGAAGG ATTAAATGTT TGGATATGTC CATTAACCCC GTTAAGAGCA CCGCTGTTAA CCAATTTACG TCAAGACCCT TATGAACGTG CTCGAGACGA ATCAGGTAGT TATGAAAGGT GGTATGTGGA TCATATTTTT GAATTTTCTC GCGGCATTAC CATGACAGCA CAGCAAATGA AGACCTTTGT TGAATTCCCT CCTCGCCAAA AGCCGGCCAG TTGGAGCGTG GATGCCATGG TTAAAAAAAT AATGGGGATG TCAGTACCAC AATACTAG
|
Protein sequence | MDKFLPMKSY LTMNFTKIIT TCLLVTSGMA SGTTDTERPN ILAIWGDDIG QSNISAYTHG MMGYKTTNID RIAKEGVLFT DYYGENSCTA GRAAFITGQY PVRTGLTKVG LPGSDKGLRA EDVTIAELLK DRGYVTGQFG KNHLGDKDEF LPTNHGFDEF LGNLYHLNAE EEPEHPDYPK DQAYKKRFGP RGVIHSFADG KIEDSGPLTK KRMETIDDEF LAATTKFIDK AHKNNKPFFV WFNATRMHIW THLKEESKGL SKRGGIYGDG MMEHDYQVGV LLDQLDRLAI ADNTIVLYTT DNGAEVFSWP DGGTIPFKGE KNTTWEGGFR VPAMVRWPGK ITAGDAKIEM VSHMDWAPTL LAAAGVTDIK EKLKQGTTVN GKKYKVHLDG YNLLPYLTGA TDEAPRPSYL YFTDGGDLSA VRFGDMKLQY SIQECEGLNV WICPLTPLRA PLLTNLRQDP YERARDESGS YERWYVDHIF EFSRGITMTA QQMKTFVEFP PRQKPASWSV DAMVKKIMGM SVPQY
|
| |