Gene VEA_003601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVEA_003601 
Symbol 
ID8557334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio sp. Ex25 
KingdomBacteria 
Replicon accessionNC_013456 
Strand
Start bp2201613 
End bp2203178 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content47% 
IMG OID646406696 
Productarylsulfatase 
Protein accessionYP_003286226 
Protein GI262394372 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAACC AATTGAGAAA AATCGCAGTA GGCGTCGGGC TGTTAGCAAC GTCCAGCGCA 
GCGATGGCGG CAGACAAACC CAACATACTT GCAATTTTCG GTGACGATGT CGGTTACTGG
AACATCAGCG CATATAACCA AGGTATGATG CGTTACAAAA CCCCTAACAT CGATCGTATC
GCGAACGAAG GGGCTTTGTT TACAGACCAT TACGGCCAAC AATCTTGTAC TGCCGGTCGA
GCAGCATTCA TTACTGGCGA AGAACCATTT CGCACTGGCT TGCTGACGAT TGGTATGCCG
GGTTCCGACC ATGGGATTCC AGATTGGGCA CCAACTATTG CTGACTTACT AAAAGACCAA
GGTTATATGA CAGCTCAGTT CGGTAAAAAC CACTTGGGCG ATCAAGATAA GCACCTACCT
ACCAATCATG GTTTTGATGA ATTCTTTGGT AACTTGTACC ACCTTAATGC AGAAGAAGAG
CCCGAAACCT ACTATTACCC TAAAGACCCA GAGTTCCGTA AAAACTATGG TCCTCGTGGT
GTAATTAAAT CTTATGCCGA CGGTAAGATT GAAGATACTG GCCCAATGAC GCGTAAACGC
ATGGAACACG CGGACGAAGA GTTCCTCGAA AGCACTTTAT CGTTTATGGA GAAAGCGGTT
AAAGCAGATA AGCCTTTCTT CATCTGGCAT AACTCTACGC GCATGCACGT GTGGACCCGA
TTACAAGAGA AGTACCGAGG CAAATCCGGT GTTAGCATTT ACGCTGATGG CATGTTAGAG
CATGACGATC AAGTCGGCGT ATTGCTCGAC AAACTCGATG AGTTAGGCGT TGCTGACAAC
ACCATCGTTA TTTATACAAC AGATAACGGC GCAGAGACCA TGACTTGGCC TGATGGGGGC
GCAACCCCAT TCCACGGTGA GAAAGGCACG ACTTGGGAAG GCGGTATGCG TGTGCCTCAA
CTTGTGCGTT GGCCTGGTGT CATTAAGCCA GGAACGAAAA TCAACGAAAT GATGGCTCAT
CAAGACTGGT TGCCAACATT GTTGGCGGCA GCTGGCGTTC CTGACGTGAA AGAAAAGCTT
GCTGAAGGTT ACAAAGCAAA TGGTAAAGAC TGGCGTGTCC ATATAGACGG CTACAACTTC
ATGCCATACT TCGAGGGAAA AGAAGAGAAA GGACCACGTG AGTCTCTGCT CTACTTCACC
TCAAATGGTG AACTAAACGC AGTACGCTGG AATGACTGGA AACTTCACTT TGCAACTCTA
GAAGGAAACC TAAGCGACGC CGTGCGATCT GTACCAAACT GGCCTAAGAT CATTCACTTA
CGCGCTGATC CATTTGAAAA AGGTCCTAGC GAGTCACAAA TGTATCTACG TTGGATGGCA
GACAACATGT GGTTGTTTGT ACCAATTCAA GACGTTGTTG GCGACTTCTT TAAAACGTTA
CCTGAATACC CAATGCAACG CGGTACAACT ATGAACCCGG CTTCTTTAAA TTACGACTCT
TTGCAGATGC AAGAAAAGAT GCAGCAGTTA GAAATGCTTA AACAGGAAGT GAATAAAGTT
AAGTAA
 
Protein sequence
MANQLRKIAV GVGLLATSSA AMAADKPNIL AIFGDDVGYW NISAYNQGMM RYKTPNIDRI 
ANEGALFTDH YGQQSCTAGR AAFITGEEPF RTGLLTIGMP GSDHGIPDWA PTIADLLKDQ
GYMTAQFGKN HLGDQDKHLP TNHGFDEFFG NLYHLNAEEE PETYYYPKDP EFRKNYGPRG
VIKSYADGKI EDTGPMTRKR MEHADEEFLE STLSFMEKAV KADKPFFIWH NSTRMHVWTR
LQEKYRGKSG VSIYADGMLE HDDQVGVLLD KLDELGVADN TIVIYTTDNG AETMTWPDGG
ATPFHGEKGT TWEGGMRVPQ LVRWPGVIKP GTKINEMMAH QDWLPTLLAA AGVPDVKEKL
AEGYKANGKD WRVHIDGYNF MPYFEGKEEK GPRESLLYFT SNGELNAVRW NDWKLHFATL
EGNLSDAVRS VPNWPKIIHL RADPFEKGPS ESQMYLRWMA DNMWLFVPIQ DVVGDFFKTL
PEYPMQRGTT MNPASLNYDS LQMQEKMQQL EMLKQEVNKV K