Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VEA_003601 |
Symbol | |
ID | 8557334 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio sp. Ex25 |
Kingdom | Bacteria |
Replicon accession | NC_013456 |
Strand | + |
Start bp | 2201613 |
End bp | 2203178 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 646406696 |
Product | arylsulfatase |
Protein accession | YP_003286226 |
Protein GI | 262394372 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAACC AATTGAGAAA AATCGCAGTA GGCGTCGGGC TGTTAGCAAC GTCCAGCGCA GCGATGGCGG CAGACAAACC CAACATACTT GCAATTTTCG GTGACGATGT CGGTTACTGG AACATCAGCG CATATAACCA AGGTATGATG CGTTACAAAA CCCCTAACAT CGATCGTATC GCGAACGAAG GGGCTTTGTT TACAGACCAT TACGGCCAAC AATCTTGTAC TGCCGGTCGA GCAGCATTCA TTACTGGCGA AGAACCATTT CGCACTGGCT TGCTGACGAT TGGTATGCCG GGTTCCGACC ATGGGATTCC AGATTGGGCA CCAACTATTG CTGACTTACT AAAAGACCAA GGTTATATGA CAGCTCAGTT CGGTAAAAAC CACTTGGGCG ATCAAGATAA GCACCTACCT ACCAATCATG GTTTTGATGA ATTCTTTGGT AACTTGTACC ACCTTAATGC AGAAGAAGAG CCCGAAACCT ACTATTACCC TAAAGACCCA GAGTTCCGTA AAAACTATGG TCCTCGTGGT GTAATTAAAT CTTATGCCGA CGGTAAGATT GAAGATACTG GCCCAATGAC GCGTAAACGC ATGGAACACG CGGACGAAGA GTTCCTCGAA AGCACTTTAT CGTTTATGGA GAAAGCGGTT AAAGCAGATA AGCCTTTCTT CATCTGGCAT AACTCTACGC GCATGCACGT GTGGACCCGA TTACAAGAGA AGTACCGAGG CAAATCCGGT GTTAGCATTT ACGCTGATGG CATGTTAGAG CATGACGATC AAGTCGGCGT ATTGCTCGAC AAACTCGATG AGTTAGGCGT TGCTGACAAC ACCATCGTTA TTTATACAAC AGATAACGGC GCAGAGACCA TGACTTGGCC TGATGGGGGC GCAACCCCAT TCCACGGTGA GAAAGGCACG ACTTGGGAAG GCGGTATGCG TGTGCCTCAA CTTGTGCGTT GGCCTGGTGT CATTAAGCCA GGAACGAAAA TCAACGAAAT GATGGCTCAT CAAGACTGGT TGCCAACATT GTTGGCGGCA GCTGGCGTTC CTGACGTGAA AGAAAAGCTT GCTGAAGGTT ACAAAGCAAA TGGTAAAGAC TGGCGTGTCC ATATAGACGG CTACAACTTC ATGCCATACT TCGAGGGAAA AGAAGAGAAA GGACCACGTG AGTCTCTGCT CTACTTCACC TCAAATGGTG AACTAAACGC AGTACGCTGG AATGACTGGA AACTTCACTT TGCAACTCTA GAAGGAAACC TAAGCGACGC CGTGCGATCT GTACCAAACT GGCCTAAGAT CATTCACTTA CGCGCTGATC CATTTGAAAA AGGTCCTAGC GAGTCACAAA TGTATCTACG TTGGATGGCA GACAACATGT GGTTGTTTGT ACCAATTCAA GACGTTGTTG GCGACTTCTT TAAAACGTTA CCTGAATACC CAATGCAACG CGGTACAACT ATGAACCCGG CTTCTTTAAA TTACGACTCT TTGCAGATGC AAGAAAAGAT GCAGCAGTTA GAAATGCTTA AACAGGAAGT GAATAAAGTT AAGTAA
|
Protein sequence | MANQLRKIAV GVGLLATSSA AMAADKPNIL AIFGDDVGYW NISAYNQGMM RYKTPNIDRI ANEGALFTDH YGQQSCTAGR AAFITGEEPF RTGLLTIGMP GSDHGIPDWA PTIADLLKDQ GYMTAQFGKN HLGDQDKHLP TNHGFDEFFG NLYHLNAEEE PETYYYPKDP EFRKNYGPRG VIKSYADGKI EDTGPMTRKR MEHADEEFLE STLSFMEKAV KADKPFFIWH NSTRMHVWTR LQEKYRGKSG VSIYADGMLE HDDQVGVLLD KLDELGVADN TIVIYTTDNG AETMTWPDGG ATPFHGEKGT TWEGGMRVPQ LVRWPGVIKP GTKINEMMAH QDWLPTLLAA AGVPDVKEKL AEGYKANGKD WRVHIDGYNF MPYFEGKEEK GPRESLLYFT SNGELNAVRW NDWKLHFATL EGNLSDAVRS VPNWPKIIHL RADPFEKGPS ESQMYLRWMA DNMWLFVPIQ DVVGDFFKTL PEYPMQRGTT MNPASLNYDS LQMQEKMQQL EMLKQEVNKV K
|
| |