Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPS_2983 |
Symbol | |
ID | 3520535 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Colwellia psychrerythraea 34H |
Kingdom | Bacteria |
Replicon accession | NC_003910 |
Strand | + |
Start bp | 3122407 |
End bp | 3123975 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 637285436 |
Product | putative arylsulfatase |
Protein accession | YP_269683 |
Protein GI | 71280072 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0960108 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAATGA ATAATCGGCT TAAGAAGTTA GCACTAGGCA TAGGTGTACT TGCCATCGCC ACCAGTGCAG CAGCAACAAC CAACAAAGCT AAGCCTAATG TACTAGCTAT TTGGGGTGAT GATATTGGTT ATTACAATAT CAGTGCTTAT AACCAAGGCA TGATGGGTTA TCAAACACCA AATATCGACC GTATTGCTGA TGAAGGCGCT TTGTTTACCC ATCATTATGC ACAACAAAGT TGTACTGCTG GCCGTGCTTC TTTCATTTTA GGTCAAGAAC CCTTCAGAAC CGGTTTATTA ACTATTGGTA TGCCAGGTTC AACACACGGT ATTCCCGATT GGACACCTAC CATTGCTGAT CTTCTAAAAG AAAAAGGTTA CATGACTGCG CAATTTGGTA AAAACCATTT AGGTGATCAA GATAAACACT TACCGACTAA TCATGGTTTT GATGAGTTTT TTGGTAATTT ATATCATTTG AATGCCGAAG AAGAGCCTGA AACTTATTAT TATCCTAAAG ATAAAGAATT TCATAAAAAA TATGGTCCTC GCGGTGTTAT CCATTCATTC GCTGATGGAA AAATAGAAAA TACAGGTTCT ATGACGCGTA AACGCATGGA AACAGCTGAT GGAGAGTTTT TAGCGGGTAC CTTGAAGTTT ATTGATAAAG CGCATAAAGC CAAAAAGCCT TTCTTTATCT GGCATAGCTC AACTCGTATG CATGTATGGA CACGTTTGCA AGAAAAGTAT CGCGGTAAGT CAGGCGTAAG TTTAACGGCT GATGGTATGT TAGAACATGA TGATCAAGTG GGTATATTAC TTGATAAATT AGACGATTTA AAAATTGCAG ATAATACCAT TGTTATTTAT TCAACCGACA ATGGTGCAGA AAAATTTACT TGGCCTGATG GTGGTACATC ACCATTTAGA GGCGAAAAAG GAACGACAAC AGAAGGCGGT ATGCGTGTTC CTCAACTCGT TCGCTGGCCT GGTACAATCA AGGCAGGCAG TAAATTTAAT AACATGATGT CACATGAAGA TTGGATGCCA ACACTATTAG CGGCAGCGGG TGAGCCAAAC ATAGTTAACA AGCTTAAAAA AGGTTACAAA GCTAACGGTA AAAAATGGAA AATTCATCCT GATGGTCATA ACTTCTTACC TTTCTTTAAA GGCCAAGAAA AAGCATCTCC GCGCACGAGT AAATTATATT TCAATGCTGC CGGTGATTTG AATGCTGTAC GTTGGAATGA ATGGAAAATT GCCTTTGCAG AAGAAGAAGG CGGAATTAGC ACTGCATACC GTAAAGTCCC TGCATGGCCT ACCATTACCA ACTTACATGC AGATCCCTTT GAAACGGCTG CAAAAGAGTC AGGAATGTAC TTACGTTGGT ATGCGGATAA CATGTGGTTA TTTGTCCCGG CACAACAACA AGTTGCACAG TTCATGTCAA CTATTGACAA ATATCCTTTC CAAGAAGGTA GTAGTTTAAG TGCGAGTAAT ATTGGTTATA AAAGCATTAG AACACAGGCT GCACTAAAAA AAATACAACA ACTAAGTCCT AACCGATAA
|
Protein sequence | MEMNNRLKKL ALGIGVLAIA TSAAATTNKA KPNVLAIWGD DIGYYNISAY NQGMMGYQTP NIDRIADEGA LFTHHYAQQS CTAGRASFIL GQEPFRTGLL TIGMPGSTHG IPDWTPTIAD LLKEKGYMTA QFGKNHLGDQ DKHLPTNHGF DEFFGNLYHL NAEEEPETYY YPKDKEFHKK YGPRGVIHSF ADGKIENTGS MTRKRMETAD GEFLAGTLKF IDKAHKAKKP FFIWHSSTRM HVWTRLQEKY RGKSGVSLTA DGMLEHDDQV GILLDKLDDL KIADNTIVIY STDNGAEKFT WPDGGTSPFR GEKGTTTEGG MRVPQLVRWP GTIKAGSKFN NMMSHEDWMP TLLAAAGEPN IVNKLKKGYK ANGKKWKIHP DGHNFLPFFK GQEKASPRTS KLYFNAAGDL NAVRWNEWKI AFAEEEGGIS TAYRKVPAWP TITNLHADPF ETAAKESGMY LRWYADNMWL FVPAQQQVAQ FMSTIDKYPF QEGSSLSASN IGYKSIRTQA ALKKIQQLSP NR
|
| |