Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dhaf_3734 |
Symbol | |
ID | 7260755 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfitobacterium hafniense DCB-2 |
Kingdom | Bacteria |
Replicon accession | NC_011830 |
Strand | - |
Start bp | 3969430 |
End bp | 3971316 |
Gene Length | 1887 bp |
Protein Length | 628 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 643563658 |
Product | Arylsulfotransferase |
Protein accession | YP_002460186 |
Protein GI | 219669751 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCCGA TCAAAAGTGA ACAGATTCCC CATATTATCC ATAGACAAAA GGATTTGGAA GAAGCTTTTC TGGCTGAATT CTCAGCGGGT CACTATACGC TGGAGAATCC GCTGGTTAAG CTCAACCCTT ATGATATTTG CCCCTTAACG GCCATGGTCT TATTTGAAAC ACCCGTAGCC ACCGAGGCAA CCATAATCGT TCGTGGCAAA GAGCACCCCG GAGATATCCG GCATACCTTC CCCGCCGATA AAAAACATAT TCTGCCGGTT TATGGTCTCT ATGCCGATTA CGAAAATAAG ATCGAAATCG TCCTGGCCAA TGGGCAGAAG AACACCATAA CCCTTAAGAC CGAGCCCCTT CATCCCGATG TCCCGGTGGC CACCTCGATC AAGACCACCC CGGAATACAT GGGCAACAAC CTGATGTTTC TGACGGCAGC GATGAAGGCT ATGCCTGTAG GCTATGACTA TGCCGGAGAA GTTCGCTGGT ATGCCACAAG GAACTTTGCC TTTGATCTCA AGCGTATGCC CAATGGACAT ATTCTCATTG GTACGGAGCG TTTGGTCAAA TTGCCCTATT TCACCACAGG TCTATATGAA ATGGCCTTTA GTGGGAAGAT ATTCAAAGAA TACCGTCTAT CCGGCGGATA CCACCATGAT CAATTTGTCA TGGAAGATGG CAACATTCTG GTGCTTACTT TTGATTTCTA CTCAGGTACG GTTGAAGATA TGTGCGTGCT CCTGGATGCC ATAACGGGAG AAATTCTCAA GTCATGGGAC TATAAAAGGG TCCTGCCCCA GGATGTAGCC GGTTCCGGAA GCCAGGATGC CCACGATTGG TTTCATAATA ACGCCGTTTG GTACGATAAG AAGACACACA GCTTAAGCTT CTCCGGTCGT CACCAGGATG TGGTGATCAA CCTTGATTAT GATACAGGTG AGTTAAACTG GATCATTGGG GATCCTGAAG GATGGCCCCA GGACATGGTG GACAAATATT TCTTTACCCC GGTTGGGGAA GGGGAATTTG ACTGGCAGTA TGAGCAGCAT GCTTGCGTCG TTTTACCTGA TGGGGATATC ATGCTCTTTG ATAACGGCCA CTTCAGAGCG AAGAAAAAAG AGAATTACTT GCCCAACGGC CGGAATTTCA GCCGTGGTGT GAGGTACCGT ATTGATACCG AAAAGATGAC CATAGAACAA GTATGGCAAT ATGGAAAAGA GCGGGGCGCG GAGTTCTTCT CTCCCTACAT TTGCAATGTG GAGTATTACA ATGAAGGCCA TTACCTGGTT CACTCCGGCG GCATCGGCTA TGAAAACGGT GAAACCTGCG AAGGTATGGC AGTTATGAAA GTCCTGCAAC CGGAGTTTAA GGATAGTGTG TTTACCTTCA ATTCCATTAC CTGTGAGCTT AAAGATGACG TGCTGATGTA TGAGTTGCAA GTACCGGCCA ATTGTTACCG GGCTGAAAAA TTGCCCCTCT ACTATGCCCA CGAAACGGCT GAATTAGGTG CGGGCGAAAT ACTGGGCAAT TTAATTGAGA CCCAGGAGAC AAAGATGAAG ATCAAGGCTG TGGAGACAGG TGAAAGAGTG CCGGATCATT ATGAGGCATC CATCACAGAA GAAGAGGATC GGGTTCTCTT TAACGCCATC TTCGAGGCCG GGGAAATGGC TCAGCTGCTT TTGGTGGACG GAGACGGCGG GGTAAAGAGA TATCCTGTCA ATACTGTGCC TCAGGCCTTC CAAGCCATGT GTGTAGGGAC GTTCCAGAAA GCTGACCCCC GCAATATCGA TGTTTATATC AACAAGACCG GATTATCCGG AAAATATCAA GTAAAGCTCA TCGCAGAAGA AAAACTCTAT GAGACCGGAG TGTCTATTAC AGCTTAA
|
Protein sequence | MNPIKSEQIP HIIHRQKDLE EAFLAEFSAG HYTLENPLVK LNPYDICPLT AMVLFETPVA TEATIIVRGK EHPGDIRHTF PADKKHILPV YGLYADYENK IEIVLANGQK NTITLKTEPL HPDVPVATSI KTTPEYMGNN LMFLTAAMKA MPVGYDYAGE VRWYATRNFA FDLKRMPNGH ILIGTERLVK LPYFTTGLYE MAFSGKIFKE YRLSGGYHHD QFVMEDGNIL VLTFDFYSGT VEDMCVLLDA ITGEILKSWD YKRVLPQDVA GSGSQDAHDW FHNNAVWYDK KTHSLSFSGR HQDVVINLDY DTGELNWIIG DPEGWPQDMV DKYFFTPVGE GEFDWQYEQH ACVVLPDGDI MLFDNGHFRA KKKENYLPNG RNFSRGVRYR IDTEKMTIEQ VWQYGKERGA EFFSPYICNV EYYNEGHYLV HSGGIGYENG ETCEGMAVMK VLQPEFKDSV FTFNSITCEL KDDVLMYELQ VPANCYRAEK LPLYYAHETA ELGAGEILGN LIETQETKMK IKAVETGERV PDHYEASITE EEDRVLFNAI FEAGEMAQLL LVDGDGGVKR YPVNTVPQAF QAMCVGTFQK ADPRNIDVYI NKTGLSGKYQ VKLIAEEKLY ETGVSITA
|
| |