Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPS_2375 |
Symbol | |
ID | 3519057 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Colwellia psychrerythraea 34H |
Kingdom | Bacteria |
Replicon accession | NC_003910 |
Strand | + |
Start bp | 2477827 |
End bp | 2479335 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 637284832 |
Product | sulfatase family protein |
Protein accession | YP_269093 |
Protein GI | 71278333 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0813491 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAAAAAC AGTCATTAGT CGCAACTAGC CTCTTAGCCG CTTTTATAAC AGGTGCTTGT ACAGCCGATA CAAATATCCA AGCAAAGCCA ATTGTTAGCC AGGTCAAAGC AAAAAAACTT ATTAAACCTA ACGTATTATT TATCGCAGTA GATGATTTGC GTGTGCAATA TGGACCCTAT GATTTTGACA AAGCTATAAC TCCTAATATT GACCGTTTGG TCAATCAAGG GGTCGCCTTT ACTCAGGCTT ATAGCAATGT GCCTGTGTGT GGTGCCTCGC GCGCTTCGAT GTTAACAGGT GTTCGTCCAA CGATTAATCG CTTTGTTGCT TTTGAAAGCG CTGATAAAGT GGCTCCTTGG GCACCATCTA TTGCCAAAGT TTTTTCTGAC AATGGTTATA CAACCTACAG TTTGGGAAAA ATATTTAATA ACTTAACTGA TCATGCAAAT GATTGGAGTG AATTTCCTTG GCGCCCTGAA GGGGCTAAAA ATGAAGATTC AACTAGCGGT AATAAAAAAC AAGCAAGCTT ATTATCACGG CATGATTATG TAACCTCAGA TGGTGTTGCA ATGGCTAAAA AAGGTATTAA AAATCATCCC GCTTTTGAAA AAGCAGATGT CGTGGATGAT GCTTATAAAA ATGGCAAGAT AGCAAAACGT GCCATTAGTG ATTTAAAACG ATTGAAAAAA GCAGGGAAAC CATTCTTTTT AGCCGTTGGT TTGAAAAAAC CACATTTACC TTTTAATGCG CCAAGTAAAT ATTGGGACCT GTATGATGAA AGCACAATTG AACTTACGAA AATACCCTTA AAAGCTAAGG ATTCTCCAAG TCAATCCGAT CATAATTGGA ATGAGCTAAG AAACTATGGC CATGATGGGG CTATGCCAAA AAAGGGCAAA ATGTCTGATG AAATGGCACG AAAACTTATC CATGGTTATC ATGCCGCTAC CAGCTATAGT GATGCATTAA TTGGCAATAT TTTAACTGAG TTGGAAAGTT TAGGTTTAGA AGAAAATACT ATTGTTGTTT TATGGGGGGA TCATGGCTGG AGCTTAGGTG AACATACTCA TTGGGCTAAA CACTCTTCTT ATGATGTCAC TAATCATATT CCATTAATTA TCAAAGTGCC GGGCATGACT AATGGAGAAT TTTCCAAAGG TTTAGTTGAG TCAGTAGATA TATTTCCTAC CTTAACCCAG TTGGCTGGAT TACCTGCACC AAGTTCATTA CAAGGTGACT CCCTTGTACC GATGTTGAAA AATCCACAGG CTACAGTCAA TGACGCTGTT TATCCTCGAT GGAAAAATGC TGACAGTATT CGTACCCCAA ATTATATGTA TACCGAATGG CGAAATAAGA AAAATAACAA AGTGATAGCA AGAATGTTAT TTGATCACCG TGTTGACCCT AGAGAGACAA TTAACGTCGC TGAAAATTTT AAATATGCTC AAGTGGTAGT TGATTTACAT AATCAGTTAG CTGCCCATAT TGCGCAGGTT GAAAAATAA
|
Protein sequence | MKKQSLVATS LLAAFITGAC TADTNIQAKP IVSQVKAKKL IKPNVLFIAV DDLRVQYGPY DFDKAITPNI DRLVNQGVAF TQAYSNVPVC GASRASMLTG VRPTINRFVA FESADKVAPW APSIAKVFSD NGYTTYSLGK IFNNLTDHAN DWSEFPWRPE GAKNEDSTSG NKKQASLLSR HDYVTSDGVA MAKKGIKNHP AFEKADVVDD AYKNGKIAKR AISDLKRLKK AGKPFFLAVG LKKPHLPFNA PSKYWDLYDE STIELTKIPL KAKDSPSQSD HNWNELRNYG HDGAMPKKGK MSDEMARKLI HGYHAATSYS DALIGNILTE LESLGLEENT IVVLWGDHGW SLGEHTHWAK HSSYDVTNHI PLIIKVPGMT NGEFSKGLVE SVDIFPTLTQ LAGLPAPSSL QGDSLVPMLK NPQATVNDAV YPRWKNADSI RTPNYMYTEW RNKKNNKVIA RMLFDHRVDP RETINVAENF KYAQVVVDLH NQLAAHIAQV EK
|
| |