Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_1030 |
Symbol | |
ID | 5693865 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | + |
Start bp | 1215813 |
End bp | 1217783 |
Gene Length | 1971 bp |
Protein Length | 656 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641263627 |
Product | peptidase U32 |
Protein accession | YP_001528917 |
Protein GI | 158521047 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGACA CAAAAAACCA TAAACCCCAG ATTCTTGCCC CGGCCGGGGG AAAGGCATCG TTTCTGGCGG CCCTGGCTGC CGGCGCCGAC GTGATCTATT GCGGCCTGAA AAGTTTTTCC GCCCGCATGG CGGCAGAAAA CTTTGCGCCC GGTGAACTTC GCGCGCTGAC GGAGCTGGCC CATAAAAAAG GGGTGAAGGT CTTTGTGGCG CTTAATACCC TGGTCCGTCC GGGGGAGATT CCCCAGGTCC GGCAGCTGGT CCACATCCTT GGCAGAGAGG TGGGCGCCGA CGCGCTGATC GTTCAGGATT TGAGTGTTGT GGAACTGGCG AAACAGGCCG GTTTTAAAGG AGAACTGCAC CTTTCCACCC TGGGGGCGGT CACCTTTTCA AAGGCGCTGG GCCTGATTTC CAGTGCCCTG GGTGTCAGCC GGGTGGTGCT GCCCAGGGAG TTTCACATCG ATGAGATCAA GCAGATGGCC CAGTCCTGTC CCCCGGGCAT GAGCCTGGAG GTGTTCGTTC ACGGGGCCCT GTGCTACGGG GTGTCGGGCC GGTGCTACTG GAGCAGTTAC ATGGGCGGCA AAAGCGGGCT GCGGGGTCGG TGCGTGCAGC CCTGCCGCCG GACCTACACC CAGGGCCGGT CCGAGGGCCG CTGGTTTTCC TGTCTTGATT TTTCCGTGGA CGTGCTCACC AAGACCCTGC TGCCCCTGCC GCAGATCACC GCCTGGAAGA TAGAGGGCCG TAAAAAAGGG CCCCACTATG TTTACTACAC CACCACTGCC TATAAAATGC TGCGGGACCA CGGCAACGAC CCGAAGATTA AAAAGGATGC CATGGGCTAT CTTGAACAGG CCCTGGGCAG AAAAACCACC CACTACAACT TTCTGCCCCA GCGGCCCTAT CCGCCGTCAG GCCAGGAGGA GCAGACCGGG TCCGGGCTTC TGGCCGGACG GGTGAAGAGC GACGGCGGCA GGCCGGCCCT CTCCCCCCGC ATGGGGTTGA TCAAGGGGGA CCTGCTGCGT ATTGGTTACG AGGACAAGCC CGGCCACAGC CTGCTTCGGG TGCCGGCCGC CGTACCGGCC AGGGGCCGGC TGGTGTTAAA GGTCCGTGGC GCTGTGCCCG CGGCCGGAAC ACCGGTTTTT CTCATTGACC GCATGGAAGA CGCCCTGGAA ACCATGATCG GGGATCTGGC GGCTGAACTG ACTGACGCCC CCTGCCGGGA AACAACCTCG GCCGCACCGG ACCGGGCGGC CCGGCAGCGA CGGCCGGCAT CCAGGTCGCC GGAAGAGATG ACGGTTTACC GGTCCCTGCC AAGGGGAAGG CAGGCCCACA GTGTGGGCTT CTGGCTCTCT CTTGAGGGAG CCAGAGGTGC CGCCAGGCTG AATGCCCAAC AGTGGCTCTG GCTGCCGCCG GTGGTGTGGC AGGAGGACGC AGACCGGTGG CAGGCGCTGG TGAACCGTAT GGTCAAACAG GGTGCCCGGC GGTTCGTGCT GAACGCGCCC TGGCAGATAT CCCTGTTTGA ACGGACAAGA AATCTGGACC TGTGGGCCGG GCCGTTCTGC AACCAGGCAA ACGGGGTCTC GATTCAGGTA CTGGCAGGGA TGGGTTTTTC CGGTGTTATC GTGAGCCCCG AGCTGGGGAA AGAAGATTAC GCGGTCATTC CGGGCCAGAG CCCGGTCCCC TTGGGAGTGG TTGTTTCGGG CAGCCTGCCC CTGTGCGTGG CCCGTACCCT GCCGGGACCG GTTCGGGAGA AAAAACTGTT TTCAAGCCCC AGGAAGGAAA ATGCCTGGGC CGAGAAGCAC AGCGGCCTTG TGTGGTTGTA TCCCGACTGG ATGGTGGATC TGCGGCCCCG GCAGAAAATC CTGGAACAGT ACGGGTATGC GCTCTTTATT CATCTCCACC ACAGCCCGCC GCCGGGCGTG AAAATCAGGC AGCGGCCGGG GATGTGGAAC TGGGATGTCG GCCTGCCGTG A
|
Protein sequence | MIDTKNHKPQ ILAPAGGKAS FLAALAAGAD VIYCGLKSFS ARMAAENFAP GELRALTELA HKKGVKVFVA LNTLVRPGEI PQVRQLVHIL GREVGADALI VQDLSVVELA KQAGFKGELH LSTLGAVTFS KALGLISSAL GVSRVVLPRE FHIDEIKQMA QSCPPGMSLE VFVHGALCYG VSGRCYWSSY MGGKSGLRGR CVQPCRRTYT QGRSEGRWFS CLDFSVDVLT KTLLPLPQIT AWKIEGRKKG PHYVYYTTTA YKMLRDHGND PKIKKDAMGY LEQALGRKTT HYNFLPQRPY PPSGQEEQTG SGLLAGRVKS DGGRPALSPR MGLIKGDLLR IGYEDKPGHS LLRVPAAVPA RGRLVLKVRG AVPAAGTPVF LIDRMEDALE TMIGDLAAEL TDAPCRETTS AAPDRAARQR RPASRSPEEM TVYRSLPRGR QAHSVGFWLS LEGARGAARL NAQQWLWLPP VVWQEDADRW QALVNRMVKQ GARRFVLNAP WQISLFERTR NLDLWAGPFC NQANGVSIQV LAGMGFSGVI VSPELGKEDY AVIPGQSPVP LGVVVSGSLP LCVARTLPGP VREKKLFSSP RKENAWAEKH SGLVWLYPDW MVDLRPRQKI LEQYGYALFI HLHHSPPPGV KIRQRPGMWN WDVGLP
|
| |