Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPS_2984 |
Symbol | |
ID | 3520029 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Colwellia psychrerythraea 34H |
Kingdom | Bacteria |
Replicon accession | NC_003910 |
Strand | + |
Start bp | 3124275 |
End bp | 3125813 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 637285437 |
Product | sulfatase family protein |
Protein accession | YP_269684 |
Protein GI | 71279570 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00336122 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTAAAA ATAAACTAAA AATGTTGATG ATGGGTGCAA GCTTGATAGC GACTGCATCC GCAACAGCAG CGGAAAAGCC AAATATTTTA TTTTTCTGGG GCGATGATAT AGGACGTACA AATATCAGTG CCTACAGCCA CGGTATAATG GGTTTTAAAA CACCTAACAT CGATCGCATA GCTAAAGAAG GTATGATGTT CACCGATTAT TATGCAGATC AAAGCTGTAC CGCTGGTCGT TCAACGTTTA TCACTGGACA ATCAGGTTTA CGTACCGGCA TGACAAAAGT TGGCTTACCT GGCGCTAAAG AAGGCATTCA AGATAGAGAT ATTACTATTG CAGAAATGTT AAAAGCTAAG GGCTATACCA CAGGTCAATT TGGTAAAAAC CACTTAGGTG ATAAAGATGA ACATTTACCC TCTAATCATG GTTTTGATGA ATTTTTTGGT AACCTTTACC ATTTAAATGC AGAGGAAGAG CCAGAAGACC CTGATTACCC TAAAGATCCT GCTTTTAAGA AAAAATTTGG ACCACGCGGT GTTATTCACT CTTATGCCGA TGGTAAAATT GAAGATACCG GCCCTTTAAC TAAAAAACGC ATGGAAACAG CTGATGATGA ATTTGTCGCT GCAGCCATGA AATTCGTTGA TAAAGCAGTG AAAGCTAAAA AACCTTTCTT TGTTTGGGTT AATACTGCAG GCATGCACTT TAGAACACAC ATCAATCCAA AACATGTGGG TCTTTCAGGT CAAGGGTTCT ATAACGATGT GATGGTCGCT CACGATAATC ATGTTGGCAT GATGTTAGAT CAACTTGATA AGTTAAAAGT TACTGACAGT ACAATTGTCA TGTACTCTAC CGATAATGGC GTGCACTACA ATACTTGGCC AGATGCCGGT ATAACACCGT TTGATGGTGA AAAAAACAGT GAAAAAGAAG GTGCTTATCG TGTTCCAATG ATGGTGCGCT GGCCTGGTAA AATTAAAGCC GGTGAAGTTT CAAACGAAAT GATGGCTCAT TTAGATTGGA TGCCAACTTT AGCTGCCGCT GCAGGTGATA CTAAACTCAA AGAAGACATG CTTAAAGGCA AACGTCGCTT TGGTAATAAG CAATCAAAAA TTCATCTTGA TGGCTATAAT ATGCTACCCC ACCTTACGGG TAAAACAGAG AAAAGCCCAC GCAACATTTA TCATTATTTA AATGATGAAG GTTTCCCTGT TGCCATTCGT ATTGGTGATT GGAAAATGGT TTATGCAGAA AATCGTGGTA AAACCTTGGC CCTTTGGACA GAACCTTTCA CTATGCTAAG AATGCCTAAA ATCTTAAACT TACGTCGTGA CCCGTGGAGT AAAGCTGAAG AAAACTCTAA TTCTTACTAC GATTGGATGA TTGATAAAGC GCCGTATATC TATTTAGGTT TATCAGAAAC AGCTAAGTTT TTATCAACCT TTAAAGACTA TCCACCTAGC CAACCTACTG GCTCTTGGTC AGTTGAAGCG GTATATGATA CTTTTTTGAA AAAATCTGAA GGTAAATAA
|
Protein sequence | MIKNKLKMLM MGASLIATAS ATAAEKPNIL FFWGDDIGRT NISAYSHGIM GFKTPNIDRI AKEGMMFTDY YADQSCTAGR STFITGQSGL RTGMTKVGLP GAKEGIQDRD ITIAEMLKAK GYTTGQFGKN HLGDKDEHLP SNHGFDEFFG NLYHLNAEEE PEDPDYPKDP AFKKKFGPRG VIHSYADGKI EDTGPLTKKR METADDEFVA AAMKFVDKAV KAKKPFFVWV NTAGMHFRTH INPKHVGLSG QGFYNDVMVA HDNHVGMMLD QLDKLKVTDS TIVMYSTDNG VHYNTWPDAG ITPFDGEKNS EKEGAYRVPM MVRWPGKIKA GEVSNEMMAH LDWMPTLAAA AGDTKLKEDM LKGKRRFGNK QSKIHLDGYN MLPHLTGKTE KSPRNIYHYL NDEGFPVAIR IGDWKMVYAE NRGKTLALWT EPFTMLRMPK ILNLRRDPWS KAEENSNSYY DWMIDKAPYI YLGLSETAKF LSTFKDYPPS QPTGSWSVEA VYDTFLKKSE GK
|
| |