Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPS_2368 |
Symbol | |
ID | 3522371 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Colwellia psychrerythraea 34H |
Kingdom | Bacteria |
Replicon accession | NC_003910 |
Strand | - |
Start bp | 2466469 |
End bp | 2468082 |
Gene Length | 1614 bp |
Protein Length | 537 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 637284825 |
Product | putative N-acetylglucosamine-6-sulfatase |
Protein accession | YP_269086 |
Protein GI | 71281899 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTTCAA AAATAGATGC GTTAAAAATA ACAGTGCTAA GTTTAAGTTT ATGCTTTTCA GTATCATCGT TATCAGCAAC CGTAAACAAA ACAGTAAAAC AAAAGAAAAA TGTCATCTAC ATTTTAACTG ATGACCAACG CTATGATGAA GTAGGCTTTT TAAACCCGCG TATTGATACA CCAAATATGG ATAAACTTGC TGCTGGCGGT GTTTATTTCA AAAATGCTTT TGTTACTACC GCCCTTTGCT CACCTAGTCG TGCAACGATA TTAACTGGTC AGTACATGCA TAATCATGGA GTAGTTGATA ATAATAACCC AGCAAAAGAA AGCTCTGTAT ATTTCCCTTC CTATCTACAA GAGGTGGGTT ATGAAACAAG TTTCTTCGGC AAATGGCATA TGGGTGGTCA CGGTGACTCT CCTCAACCGG GGTTTGATCA TTGGTTAAGT TTTGCGGGTC AAGGACATTA CTATCCCAAA AAAGATAAAA AAGGTCGAAC AAACAAAATT AATATCAATG GCGAAAGAGT TGACCAAAAG GGCTATATTA CCGATGAGTT GACTGATTAC GCGGTGGATT GGTTAGACAA ACGTGATTCA GACAAACCAT TTTTTATGTA TTTATCTCAT AAAGCAGTAC ATTCTAATTT TGATCCTGCT CCACGCCATA AAGATCAATA TAGCGATGTA GCAATTGAAG TCCCTGAAAG CCAAGCCGAT ACTCCAGAAA ACTATGCAGG CAAGCCTATG TGGGTGAAGA ATCAACGTAA TAGCTGGCAT GGGGTCGACT TTCCTTACCA TAGTGAAATG GACGTTCAAG AATATAAGCG TCAATACCAT AGAGCACTAT CTGCCGTGGA TGATAGTTTA GGTCGTGTAT TAAAGTGGCT AAAAGATAAT AACTTAGAAA ATGATACTAT TGTGATGTTA ATGGGCGATA ACGGCTTTAT GTTTGGCGAA CACGGTTTAA TTGACAAGCG TAATGCTTAT GAAGAGTCTA TGCGTGTACC GTTACTTGCT TATGCTCCCG GTTATTTCAA ACCCGGCACC GTAGTAGACG AAATGGTTGC TAACCTAGAC ATAGCCCCTA CAATATTAGA AATTGCAGGC GCTAAAAAAC CAGCTCACTT TGATGGCGAC AGTTGGTTAC CTCTTGCTAA AAACAAAGAA GTAAATCAAT GGCGTGAGAA CTTTTTATAT GAATATTATT GGGAATTTAA CTACCCTTCT ACCCCAACTA CTTTTGCTTT GCGTACTGAC AACTACAAAC TAATTCAATA TCACGGTGTT TGGGACACTG AAGAGCTTTA TGACTTAAAA AATGATCCTA AAGAAATGAA CAACTTAATC AATACACCTA AACATCAACC ACTTATAGCG CAAATGCGTC ATGATTTATT CAACCTTTTA GTGAATAAAA AAGGTGATAA TGTTATCCCT TACACTGAAA AGTATACTCC TGGTGCGGTT TACCGTGAAC GTGACCGTGG CGAAACAGCT GACTTTCCAG ATAACTGGTT GAAAAAAGAG GGTGATGATG GTTTAAGAAC GTTCTTACGA ATAAAGCCTA TTAAAGATAA AAAAGATGAC AAGAAAAAAT CAGCTAAACA TTAA
|
Protein sequence | MSSKIDALKI TVLSLSLCFS VSSLSATVNK TVKQKKNVIY ILTDDQRYDE VGFLNPRIDT PNMDKLAAGG VYFKNAFVTT ALCSPSRATI LTGQYMHNHG VVDNNNPAKE SSVYFPSYLQ EVGYETSFFG KWHMGGHGDS PQPGFDHWLS FAGQGHYYPK KDKKGRTNKI NINGERVDQK GYITDELTDY AVDWLDKRDS DKPFFMYLSH KAVHSNFDPA PRHKDQYSDV AIEVPESQAD TPENYAGKPM WVKNQRNSWH GVDFPYHSEM DVQEYKRQYH RALSAVDDSL GRVLKWLKDN NLENDTIVML MGDNGFMFGE HGLIDKRNAY EESMRVPLLA YAPGYFKPGT VVDEMVANLD IAPTILEIAG AKKPAHFDGD SWLPLAKNKE VNQWRENFLY EYYWEFNYPS TPTTFALRTD NYKLIQYHGV WDTEELYDLK NDPKEMNNLI NTPKHQPLIA QMRHDLFNLL VNKKGDNVIP YTEKYTPGAV YRERDRGETA DFPDNWLKKE GDDGLRTFLR IKPIKDKKDD KKKSAKH
|
| |