Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dhaf_3558 |
Symbol | |
ID | 7260576 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfitobacterium hafniense DCB-2 |
Kingdom | Bacteria |
Replicon accession | NC_011830 |
Strand | - |
Start bp | 3781272 |
End bp | 3782564 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 643563481 |
Product | peptidase U32 |
Protein accession | YP_002460012 |
Protein GI | 219669577 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00000000993466 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTAGAGC AAAACCAATC CAGGAAACCT GAACTCCTTG CGCCGGCAGG AGATTATGAA AAGTTGAAAT TCGCCATCGC CTATGGGGCA GATGCGGTCT ATATGGGAGG GCCTGCCTTC GGGCTGCGAG CTTATGCGGG GAATTTTACC ATGGAGCAGA TGGCTGAGGC TATTCAGTAT ACCCACCATG CCGGCCGCAA GCTCTATGTG ACAGTAAACA TCTTTGCCCA TGAGCAGGAT TTTGAAGAAA TGGCTGCGTA TTTGAAACAA TTGGAGTCCT TAGGTGCCGA TGGAGCCATT GTCTCAGACC CCGGCATCAT CGCCTTGGCT CAAGAAGCGG CCCCTAAACT TCCTCTGCAC CTCAGTACCC AGATGAACAG CACTAATTCT TACAGCATAA ACTTTTGGCT GAAGCAGGGA TTGGAGCGGA TTGTTCTGGC CCGGGAGCTG ACCTTGGCAG AGATCCGAGC AGTACGGGAG AAGGTACCCG GGGAGCTGGA AATGTTCATT CATGGAGCCA TGTGCATGTC CTATTCCGGA AGATGTCTGC TCTCCAATTA TCTCACCGGA AGAGATGCCA ACCGGGGGGA ATGCACCCAG CCCTGCCGTT GGGGCTATGG CCTAGTGGAA GAGAAACGCC CCGGTCAGGT ATTTCCCGTA GAGGAAGATG AGCGGGGCAC TTATATCTTT AATTCCCATG ATCTCTGTTT GCTTCCTTAT CTGCCCATGC TGAAGCCACT GGGTATTGAC AGCTATAAGA TTGAAGGGCG TATGAAGAGC ATCCATTATG TCTCCAGCAC CGTGAAGGTT TATCGGGAGG CTATCGATAC CCTTTGGGAA CAAGGGGAAG AGGCCTTTAA AGCCAAACTC TCCAGCTGGC TGGAAGAAAT GGACAAGGTC AGTCATCGGG ATTATTCACC GGGATTCCTC TTTGGCAAGC CTGGAGCAGA ATCCCATAAT ATTGAGAGCT CCAACTATAT CCGGGATTAT GAATTTGTCG CCTTTGGCTT AGCTGCAGAT AATCGGGAAC ATCCCCAAAT TCCGACTTTA GTCAAAGATG AATTCAGTCA AGGATATTGG GTGGAACAAC GCTATCATTT CCAAAAAGGG GAGCTTATTG AAGTATTCTC ACCTCATGAA GAACCTTGGA CCTTTGAGGT CAAAGGCATT CACACGGTAG AAGGGGAAGA AGTGGACGTT GCCCGTCATG CCAAGGAGAT CCTTAAGCTG GAACTGCCCC GGCCTTTGCC CCCTTTTGCT ATCTTGAGGA GAGCGAAGAA GGATAAGAAA TGA
|
Protein sequence | MLEQNQSRKP ELLAPAGDYE KLKFAIAYGA DAVYMGGPAF GLRAYAGNFT MEQMAEAIQY THHAGRKLYV TVNIFAHEQD FEEMAAYLKQ LESLGADGAI VSDPGIIALA QEAAPKLPLH LSTQMNSTNS YSINFWLKQG LERIVLAREL TLAEIRAVRE KVPGELEMFI HGAMCMSYSG RCLLSNYLTG RDANRGECTQ PCRWGYGLVE EKRPGQVFPV EEDERGTYIF NSHDLCLLPY LPMLKPLGID SYKIEGRMKS IHYVSSTVKV YREAIDTLWE QGEEAFKAKL SSWLEEMDKV SHRDYSPGFL FGKPGAESHN IESSNYIRDY EFVAFGLAAD NREHPQIPTL VKDEFSQGYW VEQRYHFQKG ELIEVFSPHE EPWTFEVKGI HTVEGEEVDV ARHAKEILKL ELPRPLPPFA ILRRAKKDKK
|
| |