Gene VEA_000311 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVEA_000311 
Symbol 
ID8558616 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio sp. Ex25 
KingdomBacteria 
Replicon accessionNC_013457 
Strand
Start bp335242 
End bp336768 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content45% 
IMG OID646407976 
Productarylsulfatase 
Protein accessionYP_003287464 
Protein GI262395611 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.63107 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATG CGACCAAAAC GAGCAGTAAA AAACTCGTAC TTAACGCATG TACGTTAGCA 
TTAGGCGCTG CATCTGCAGT TGCACACGCA GCGGACAAAC CCAACATTCT TGTCATCTTT
GGTGATGATG TTGGTTACTG GAACCTTAGT ACCTACAACC AAGGCATGAT GGCGTACAAC
ACGCCGAACA TCGATAGCAT TGCTAAAGAA GGTGCGAAGT TCACTAACTT CTACGCACAG
CAAAGTTCGA CAGCAGGCCG TTCTGCATTC ATTACTGGTC AAATGCCAAA ACGTACTGGT
CTATCAAAAG TGGGCTTACC AGGCGCTCCG CAAGGTATCT CGGAAAAAGA TCCAACGATT
GCAACCGTGC TTAAGCAAAT GGGTTATGCA ACCGGACAAT TTGGTAAGAA CCATTTAGGC
GACCGAGACG AACACCTTCC TACCAACCAC GGTTTTGATG AGTTTTTCGG CAACCTATAT
CACTTGAACG CGGAAGAAGA GCCAGAGAAC GTGGATTATC CTAAAGATCC AGAGTTCAAG
AAGAAATTTG GCCCTCGTGG CGTAATCCAC TCTTACGCTG ACGGCAAAAT TGAAGATACC
GGTCCACTTA CGCGTAAGCG TATGGAAAAT GTTGACGGTG AATTCCTTGA TGCCGCGGAA
ACCTTCATAG AAAAACAAGT AAAAGCGGAT AAGCCGTTCT TTACTTGGTT TAACACCACT
CGCATGCACA ACTTCACGCA TGTTCCTGAA GAATACCAAG GTAAAACAGG TGCTGGTTTC
TACGCGGATG GTGTCAAGCA GCACGATGAT CAAATCGGTC AGCTACTTAA TAAGATCAAA
GAGTTGGGTG TAGATGACAA TACAATCATT GTTTACACCA CTGACAACGG TCCTATGGTT
GATTTGTGGC CTGATGCAGG GATGACACCA TTCCGCTCAG AGAAAAACAC TGGTTGGGAA
GGCAGTTTCC GTGTGCCGAT GTTGATTAAA TGGCCAGGAA ACATCGAAGC AGGTAAAACG
TTCAACGGCA TGATGTCATT AGAGGATTTC TTCCCGACAT TGGTTGCTGC TGCGGGTGAC
ACCAAAGTTA AAGATGAATT ACTAAAAGGT AAGAAAGTTG GTGATATGGA CTATAAAGTT
CACCTAGATG GTTACAACCA ACTGCCATAT CTAACGGGTA AGTCAGACAA ATCTGCTCGT
AACGAATTCG TTTACTGGAG CGATGACGGC GATCTTGTGG CATTACGTCA AGGAAAGTAC
AAGTTCCACT TTATGATTCA AGAGAATGAA ACGGGCATGG ATGTATGGCG TAAGCCTTTC
ACTAAACTTC GTGTTCCATT GATCTTCGAC TTGAGTATCG ATCCATTCGA ACGTGGTGAC
CAAGGTATGG GTTACTCTCG TTGGATGTAC GAGCGTTCAT TCCTAATGAT GCCAGCGGTA
GAGAAAGTGA AAGAGGTAAT GGGTACGTTT AAAGAGTTCC CACCTCGCAT GGAAGCAGGT
TCATTCGTAC CTAAGTCTTC TAAGTAA
 
Protein sequence
MKNATKTSSK KLVLNACTLA LGAASAVAHA ADKPNILVIF GDDVGYWNLS TYNQGMMAYN 
TPNIDSIAKE GAKFTNFYAQ QSSTAGRSAF ITGQMPKRTG LSKVGLPGAP QGISEKDPTI
ATVLKQMGYA TGQFGKNHLG DRDEHLPTNH GFDEFFGNLY HLNAEEEPEN VDYPKDPEFK
KKFGPRGVIH SYADGKIEDT GPLTRKRMEN VDGEFLDAAE TFIEKQVKAD KPFFTWFNTT
RMHNFTHVPE EYQGKTGAGF YADGVKQHDD QIGQLLNKIK ELGVDDNTII VYTTDNGPMV
DLWPDAGMTP FRSEKNTGWE GSFRVPMLIK WPGNIEAGKT FNGMMSLEDF FPTLVAAAGD
TKVKDELLKG KKVGDMDYKV HLDGYNQLPY LTGKSDKSAR NEFVYWSDDG DLVALRQGKY
KFHFMIQENE TGMDVWRKPF TKLRVPLIFD LSIDPFERGD QGMGYSRWMY ERSFLMMPAV
EKVKEVMGTF KEFPPRMEAG SFVPKSSK