Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_2553 |
Symbol | |
ID | 4664189 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008751 |
Strand | + |
Start bp | 2980469 |
End bp | 2982031 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639820802 |
Product | sulfatase |
Protein accession | YP_967996 |
Protein GI | 120603596 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.083937 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAAAGA AAGACAACAT CAAGAACGTC ATCTTCATCA TGCTCGACAC GTTGCAGTTC AACTATCTTG GCTGCTACGG CAACGATGTG GTGAAGACGC CGAACCTCGA CAAGTTCGCC CAAAACGGCT TCCTGTTCGA GAACGCCTAC AGCGAAGGGC TGCCCACCAT CCCCGTGCGG CGCGCCATCA TGACCGGCCG TTTCACGCTG CCCTACAGCG GCTGGCGTCC GCTGACCACC GAAGACACCT CCATCACGGA CATGCTCTGG TGCCGTGAGG TGCAGACAGC GCTGGTGTAT GACACCCCCC CCATGCGTCT TCCCAAGTAC GGCTACTCTC GCGGGTTCGA CTACGTACGT TTCTGCAACG GTCACGAACT GGACCACGAG ACCTTCTGCA ACGTGCCGCT TGACGAAGAG TTCAAGGCCG AGGACTACCT TTCGCCCAAC TGGCTGAAGA AGGATGAGAA CGGCGAATAC GACTCGTCGA GCAAGTCGCT CATCCGCGAG ACGGAATGCT ACCTGCGCCA GCGCCAGTTC TGGGCTTCCG ATGCGGACAA CTACGCCTCC GTGGTCATCT CCGAGGCCGA CAACTGGCTG AAGATGAAGC GCAACCCGCA GCGCCCCTTC TTCCTGTGGC TCGACTCGTT CGACCCGCAT GAGCCCTGGG ACCCGCCGTC AATGTGGGAG AAGAAGCCCT GCCCCTACGA CCCCGACTAC ACGGGCAACC CGCTGCTGCT CGCCCCGTGG ACCGAAATCG ACGGTGTGAT GACCGAAGAG GAATGCGCCC ACATCCGCGC CCTCTACGCG GAGAAGGTGA CCCTCGTCGA CAAATGGCTC GGCAAACTGT TCGACTCGCT CAAGGCGCAG GGGCTGTGGG ACGACACCAT GATCGTCATC ACCTCCGACC ACGGACAGCC CATGGGCAGC GGCGAACACG GTCATGGACT GATGCGCAAA TGCCGTCCGT GGCCCTACGA AGAACTGGTG CACGTGCCCC TGCTCATCCG GGTCCCCGGT CTTGAGGGTG GCAAGCGCAT ATCGTCGTTC GTACAGAACG TGGACATCAC CGCCACGGTG GTCGATGGGC TGGGCATGGG GCTCGAAGCC CTTGCCGAGG CCGGGCACGA AGGCATCACC ACCTATGCAG GCGACGACAT GCACGGTATC AGCCTGTTGC CCGTCATGCG CGGCGAGACC GACAAGGTGC GCGATTTCGC CATCGCGGGC TATTACGGCA TGTCATGGTC CATCATCGAT CACGACTACA GCTACATCCA CTGGCTGCAG AGGGAGATCG ACACGGATTC CATGAACAAG GTCTTCTACG ACGGCTCCGG CAAGGGCGGC AACGCCGGTG CCCAGTCTGC CAAGCTGGAG ATGAAGGAAG AGATGTGGAC CTGCGTGCCG GGGGCCGAAG TATCCGTCCC CCACACCGAC GAGCTGTACG ACAGGCGGAA CGACCAGTTC CAGATGAAGA ACCTCATCGG TGAGCAGCCG GAAAAGGCCA AGGAACTTCT GCAGAAGCTC AAGCTCTTCA TCGGCGAGCT GCGCACGTCG TAG
|
Protein sequence | MSKKDNIKNV IFIMLDTLQF NYLGCYGNDV VKTPNLDKFA QNGFLFENAY SEGLPTIPVR RAIMTGRFTL PYSGWRPLTT EDTSITDMLW CREVQTALVY DTPPMRLPKY GYSRGFDYVR FCNGHELDHE TFCNVPLDEE FKAEDYLSPN WLKKDENGEY DSSSKSLIRE TECYLRQRQF WASDADNYAS VVISEADNWL KMKRNPQRPF FLWLDSFDPH EPWDPPSMWE KKPCPYDPDY TGNPLLLAPW TEIDGVMTEE ECAHIRALYA EKVTLVDKWL GKLFDSLKAQ GLWDDTMIVI TSDHGQPMGS GEHGHGLMRK CRPWPYEELV HVPLLIRVPG LEGGKRISSF VQNVDITATV VDGLGMGLEA LAEAGHEGIT TYAGDDMHGI SLLPVMRGET DKVRDFAIAG YYGMSWSIID HDYSYIHWLQ REIDTDSMNK VFYDGSGKGG NAGAQSAKLE MKEEMWTCVP GAEVSVPHTD ELYDRRNDQF QMKNLIGEQP EKAKELLQKL KLFIGELRTS
|
| |