Gene HS_1323 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1323 
SymbolprtC 
ID4240835 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1512910 
End bp1514286 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content37% 
IMG OID638104897 
Productcollagenase prtC 
Protein accessionYP_719535 
Protein GI113461466 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.768171 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACTC AATTCAAACC GGAATTATTA TCTCCAGCCG GTTCGTTAAA AAATATGCGT 
TATGCTTTTG CTTATGGTGC AGATGCCGTA TATGCCGGTC AACCTCGATA TAGTTTGCGA
GTTCGCAACA ATGAATTTAA TCATGCTAAT TTAAAAATCG GTATTGATGA AGCACATGCA
CTTGGTAAAA AATTCTATGT TGTAGTGAAT ATTGCACCAC ATAATTCTAA ATTAAAAACC
TTTATCAAAG ATTTACAACC CGTGATTGAT ATGCAACCTG ATGCACTAAT CATGTCAGAC
CCTGGCTTGA TTATGTTGGT TCGTGAGCAT TTCCCAAATA TTGATATTCA TCTTTCTGTA
CAAGCAAATG CAGTGAATTG GGCGACAGTG AAATTCTGGA AACAATTGGG TTTAACTCGA
GTGATTCTAT CTCGTGAACT CTCCTTGGAA GAAATCGCTG AAATTCGCCA GCAAGTACCG
GATATTGAAG TTGAAATATT TGTGCATGGT GCTTTATGTA TGGCATATTC AGGACGCTGT
TTATTATCAG GCTATATTAA CAAGCGTGAT CCCAATCAAG GTACTTGTAC CAATGCCTGT
CGCTGGGAAT ATTCTGTTGT AGAAGGAAAA ACAGATGAAG TAGGCAACAT TGTGAATGTT
GGTGAAGAAA TTCCCGTTAA AAATGTCGCA CCGACGTTAG GTGAAGGAAA TACAACAAAT
AAAGTCTTTT TATTAGCGGA AAATCAGCGA CCTGAAGAGC AAATGTCAGC CTTTGAAGAT
GAGCATGGTA CTTATATTAT GAACTCTAAA GATCTGCGTG CAGTGCAACA TGTAGAAAAA
CTTTCACAAA TTGGTGTTCA TTCTTTAAAA ATTGAAGGGC GTACAAAATC TTTCTATTAT
TGTGCAAGGA CGGCTCAAGT TTATCGCAAG GCTATTGATG ACGCAGTAGC AGGCAGACCT
TTTGATGAAA GTTTAATGGA TACATTGGAA AGTTTGGCAC ATCGTGGCTA TACAGAAGGT
TTCTTACGCC GTCATACGCA TGATGAATAC CAAAATTATG ATTATGGGTA TTCTATTTCT
GAACGCCAAC AATTTGTCGG AGAATTTACC GGTAAACGTA ATGAACAAGG TATGGCGGAA
GTTGCGGTTA AAAATAAATT CTTGTTAGGT GATGAAGTTG AATTAATGAC GCCCAAAGGC
AATGTGGTTT TTACCATTGA GCGTATGCTT AATCGTAAAA ATGAACACAT TGATGCCGCA
CTTGGTGATG GGCATTTTGT TTTTTTAGAT GTTCCTCAAG ATATTCAACT TGATTATGCG
TTACTCATGC GTAATTTAGT TAATGCAAAT ACAAGAAATC CACATAATAA AAAATAA
 
Protein sequence
MTTQFKPELL SPAGSLKNMR YAFAYGADAV YAGQPRYSLR VRNNEFNHAN LKIGIDEAHA 
LGKKFYVVVN IAPHNSKLKT FIKDLQPVID MQPDALIMSD PGLIMLVREH FPNIDIHLSV
QANAVNWATV KFWKQLGLTR VILSRELSLE EIAEIRQQVP DIEVEIFVHG ALCMAYSGRC
LLSGYINKRD PNQGTCTNAC RWEYSVVEGK TDEVGNIVNV GEEIPVKNVA PTLGEGNTTN
KVFLLAENQR PEEQMSAFED EHGTYIMNSK DLRAVQHVEK LSQIGVHSLK IEGRTKSFYY
CARTAQVYRK AIDDAVAGRP FDESLMDTLE SLAHRGYTEG FLRRHTHDEY QNYDYGYSIS
ERQQFVGEFT GKRNEQGMAE VAVKNKFLLG DEVELMTPKG NVVFTIERML NRKNEHIDAA
LGDGHFVFLD VPQDIQLDYA LLMRNLVNAN TRNPHNKK