Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VEA_000313 |
Symbol | |
ID | 8558618 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio sp. Ex25 |
Kingdom | Bacteria |
Replicon accession | NC_013457 |
Strand | - |
Start bp | 337889 |
End bp | 339436 |
Gene Length | 1548 bp |
Protein Length | 515 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 646407978 |
Product | arylsulfatase |
Protein accession | YP_003287466 |
Protein GI | 262395613 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGCGA AATATGGCGT AAAACGTCGA TTAGCGATTC TTGCCGCCGC CTTAATTGGC GCGACAAGTA GCAGTATCGC GGCAGAGAAA CCTAACATTC TCGTCATTTG GGGTGATGAT ATTGGCCAGT CCAATCTCAG TGCTTACACC TTCGGCTTGA TGGGATACAA AACACCAAAC ATAGACAGCA TCGCGAAAGA GGGCATGATG TTCACGGATT ATTACGGTGA ACAATCTTGT ACTGCGGGCC GTTCTACTTT TATCACCGGT CAGACTGTCC TTAGAACGGG TTTAAGTAAA GTAGGGCTTC CTGGCGCAGA TCTTGGCTTG AAGGAAGAAG ATGCCACCAT TGCCGAAATG TTGAAGCCGA TGGGCTATAT GACGGGTCAA TTTGGTAAAA ATCATTTAGG CGACAAAGAC GAACACCTGC CTACTAACCA CGGCTTCGAT GAATTTTTTG GCAACCTTTA TCACTTGAAC GCCGAGGAAG AACCAGAAAA CGTCGATTAC CCTAAAGATC CAGAATTTCG TAAAAAATTT GGTCCACGAG GTGTTATTCG CTCGTACGCT GACGGCAAAA TTGAGGACAC CGGGCCTCTG ACTCGCAAGC GTATGGAAAC CGTGGATGAA GAGACGTTGG ATGCGGCTCT CGATTTTATG GATCGTGCCG TCAAAGCTGA AAAACCTTTC TTCGTTTGGT GGAACGCAAC GCGTATGCAT TTCCGCACCC ACGTGAAAGA AGAGAATTCG GGAAAAACAG GCATCAGTGA GTATGCAGAT GGCATGGTCG AGCATGATAA TCATGTCGGC CAACTACTGA AAAAAGTGGA CGATTTGGGC ATAAAAGACA ACACGATAGT CTTTTATTCA ACCGATAATG GTCCACACAT GAACTCGTGG CCAGATGCAG GTACAACGCC ATTCCGTGGA GAGAAAAATA CCAACTGGGA AGGTGCATAT CGAGTCCCAG CGATGGTACG TTGGCCGGGT AAAATTAAAG CTGGCTCTGT CTCGAATGAT ATCATGCACC ATATGGACTG GATGCCAACT TTCGTTGCTG CCGCGGGTGA TGACGGTATC AAAGAGAAAT TACTCAAAGG CTATAGTGCG AATGGTAAGA AATTCAAAGT GCATCTCGAT GGTTACAACT TCCTGCCATA TTTAACTGGC AAAGAAGAGA AAGCACCGCG TGAAGAGATT TTTTACTTCT CGGATGATGG AGACTTAACC GCACTTCGTT ATAACAAGTG GAAACTGGTT TTCATGGAGC AACGAGCAAA AGGTACGCTG CGCATTTGGG CGGAACCATT CACCACACTT CGAGTTCCTA AGATCTTCAA TTTACGTATG GACCCTTACG AAGTGGCAGA TATCACTTCC AACACCTACT ATGATTGGAT GCTAGACCGT GCTTACATGC TTGTTCCTGC ACAAGCTTAC GTAGGCAGAT TCCTCGAAAC ATTTAAAGAG TTCCCACCAA GGCAAAAAGC GGCAAGCTTC TCACTCGATC AGGTGATGGA GAAATTAAAA GAGAACCCAA ATAAGTAG
|
Protein sequence | MTAKYGVKRR LAILAAALIG ATSSSIAAEK PNILVIWGDD IGQSNLSAYT FGLMGYKTPN IDSIAKEGMM FTDYYGEQSC TAGRSTFITG QTVLRTGLSK VGLPGADLGL KEEDATIAEM LKPMGYMTGQ FGKNHLGDKD EHLPTNHGFD EFFGNLYHLN AEEEPENVDY PKDPEFRKKF GPRGVIRSYA DGKIEDTGPL TRKRMETVDE ETLDAALDFM DRAVKAEKPF FVWWNATRMH FRTHVKEENS GKTGISEYAD GMVEHDNHVG QLLKKVDDLG IKDNTIVFYS TDNGPHMNSW PDAGTTPFRG EKNTNWEGAY RVPAMVRWPG KIKAGSVSND IMHHMDWMPT FVAAAGDDGI KEKLLKGYSA NGKKFKVHLD GYNFLPYLTG KEEKAPREEI FYFSDDGDLT ALRYNKWKLV FMEQRAKGTL RIWAEPFTTL RVPKIFNLRM DPYEVADITS NTYYDWMLDR AYMLVPAQAY VGRFLETFKE FPPRQKAASF SLDQVMEKLK ENPNK
|
| |