Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_1323 |
Symbol | prtC |
ID | 4240835 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | - |
Start bp | 1512910 |
End bp | 1514286 |
Gene Length | 1377 bp |
Protein Length | 458 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 638104897 |
Product | collagenase prtC |
Protein accession | YP_719535 |
Protein GI | 113461466 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.768171 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAACTC AATTCAAACC GGAATTATTA TCTCCAGCCG GTTCGTTAAA AAATATGCGT TATGCTTTTG CTTATGGTGC AGATGCCGTA TATGCCGGTC AACCTCGATA TAGTTTGCGA GTTCGCAACA ATGAATTTAA TCATGCTAAT TTAAAAATCG GTATTGATGA AGCACATGCA CTTGGTAAAA AATTCTATGT TGTAGTGAAT ATTGCACCAC ATAATTCTAA ATTAAAAACC TTTATCAAAG ATTTACAACC CGTGATTGAT ATGCAACCTG ATGCACTAAT CATGTCAGAC CCTGGCTTGA TTATGTTGGT TCGTGAGCAT TTCCCAAATA TTGATATTCA TCTTTCTGTA CAAGCAAATG CAGTGAATTG GGCGACAGTG AAATTCTGGA AACAATTGGG TTTAACTCGA GTGATTCTAT CTCGTGAACT CTCCTTGGAA GAAATCGCTG AAATTCGCCA GCAAGTACCG GATATTGAAG TTGAAATATT TGTGCATGGT GCTTTATGTA TGGCATATTC AGGACGCTGT TTATTATCAG GCTATATTAA CAAGCGTGAT CCCAATCAAG GTACTTGTAC CAATGCCTGT CGCTGGGAAT ATTCTGTTGT AGAAGGAAAA ACAGATGAAG TAGGCAACAT TGTGAATGTT GGTGAAGAAA TTCCCGTTAA AAATGTCGCA CCGACGTTAG GTGAAGGAAA TACAACAAAT AAAGTCTTTT TATTAGCGGA AAATCAGCGA CCTGAAGAGC AAATGTCAGC CTTTGAAGAT GAGCATGGTA CTTATATTAT GAACTCTAAA GATCTGCGTG CAGTGCAACA TGTAGAAAAA CTTTCACAAA TTGGTGTTCA TTCTTTAAAA ATTGAAGGGC GTACAAAATC TTTCTATTAT TGTGCAAGGA CGGCTCAAGT TTATCGCAAG GCTATTGATG ACGCAGTAGC AGGCAGACCT TTTGATGAAA GTTTAATGGA TACATTGGAA AGTTTGGCAC ATCGTGGCTA TACAGAAGGT TTCTTACGCC GTCATACGCA TGATGAATAC CAAAATTATG ATTATGGGTA TTCTATTTCT GAACGCCAAC AATTTGTCGG AGAATTTACC GGTAAACGTA ATGAACAAGG TATGGCGGAA GTTGCGGTTA AAAATAAATT CTTGTTAGGT GATGAAGTTG AATTAATGAC GCCCAAAGGC AATGTGGTTT TTACCATTGA GCGTATGCTT AATCGTAAAA ATGAACACAT TGATGCCGCA CTTGGTGATG GGCATTTTGT TTTTTTAGAT GTTCCTCAAG ATATTCAACT TGATTATGCG TTACTCATGC GTAATTTAGT TAATGCAAAT ACAAGAAATC CACATAATAA AAAATAA
|
Protein sequence | MTTQFKPELL SPAGSLKNMR YAFAYGADAV YAGQPRYSLR VRNNEFNHAN LKIGIDEAHA LGKKFYVVVN IAPHNSKLKT FIKDLQPVID MQPDALIMSD PGLIMLVREH FPNIDIHLSV QANAVNWATV KFWKQLGLTR VILSRELSLE EIAEIRQQVP DIEVEIFVHG ALCMAYSGRC LLSGYINKRD PNQGTCTNAC RWEYSVVEGK TDEVGNIVNV GEEIPVKNVA PTLGEGNTTN KVFLLAENQR PEEQMSAFED EHGTYIMNSK DLRAVQHVEK LSQIGVHSLK IEGRTKSFYY CARTAQVYRK AIDDAVAGRP FDESLMDTLE SLAHRGYTEG FLRRHTHDEY QNYDYGYSIS ERQQFVGEFT GKRNEQGMAE VAVKNKFLLG DEVELMTPKG NVVFTIERML NRKNEHIDAA LGDGHFVFLD VPQDIQLDYA LLMRNLVNAN TRNPHNKK
|
| |