Gene HS_1071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1071 
Symbolprc 
ID4240570 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1186675 
End bp1188702 
Gene Length2028 bp 
Protein Length675 aa 
Translation table11 
GC content36% 
IMG OID638104632 
Productcarboxy-terminal protease 
Protein accessionYP_719283 
Protein GI113461214 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.461658 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTAC CTCAACAACT TCGTTATTTG CTTTGTTTGC TGTTTGGCTT TGTTTTAAGT 
TTAAATTACG CCGTCGCAGT TGAACCAAAA CTGAAACTAA CGGATCTTGT TATTCCACAG
ATAACGCAAG AAAATCAATT AGCAACTAAA AGAACTGCTG CTCGTTTGGT TCATTCACAT
TATCAAGCAG TAGAACTCAA TGATGAGTTT TCTCAGCGGA TTTTTGATCG TTATTTGAAA
GCACTTGATT TTAATCGTAA TACTTTTTTG CAATCTGATA TTGATAATAT GAGAGCAATG
TATGCGGATA AAATTGATGA TGTGTTAAAT GAGGGAAACC TGGATATTGC TTTTGATATG
TATGAGTTAT TGATGAAGCG TCGCTATGAG CGTTATCGTT ATGCTCTATC TTTATTGGAT
GAAGAGCCTG ATTTAAATGG CAATGATCAA ATTGAAATAG ATCGGGAAAA AGCGGATTGG
CCAAAAAATG AAACTGAGGC AAATAAGTTA TGGGAACATC GAGTCAAAAA TGATGTGATT
AACTTAAAGT TAAAAGGTAA AAAATGGTCT GAAATTCAAA CGAGGTTAGT TAAACGTTAT
AATTTAGCTA TTCAGCATCT AAGTAAAGTA AAATCTGATG ATATTGTACA ATTGTATTTG
AATGCTTTTG CAAGAGAGAT CGATCCCCAC ACCAGTTATC TAGCGCCTCG CAAAGCTAAA
AATTTCAATG AAAACATGAA CTTGTCTCTT GAGGGAATAG GTGCAACTTT ACAGTCAGAA
GATGGTGAAA CCACTATAAA ATCGCTTGTT CCCGGTGCTC CGGCAGATCG TAGTAAAAAG
CTTAAAGCCG GTGATAAAAT TATTGGAGTC GGGCAAGCGA CAGGGGAAAT TGAAGATGTA
GTCGGTTGGC GTATTGATGA TGTTGTTGAT AAAATCAAGG GGAAGAAAGG AACAAAAGTT
CGTTTAGAAA TAGAACCAGC TAAAGGTGGA AAATCTCAAA TTATTACGTT AGTGCGAGAT
CGTGTTCGTT TAGAAGATCA AGCTGCTAAA CTAACAGTTG AAACTGTTGC TGGTAACAAG
ATCGCCGTGA TTAAAATTCC GGGGTTTTAT AATGGCTTAA CTGAGGATGT ACGTAAATTA
CTTGTTGAAG TTGAAGCTAA AAAAGCGGAA GCTTTAATCA TTGATTTACG TGGAAATGGT
GGTGGCTCTT TACCGGAAGC TATTGAGTTA ACCGGTTTAT TTATTACTGA TGGTCCTGTG
GTTCAAGTTC GGGACGCACA TCAACGTATT CGTATATATG ATGATCCTGA TACAGAGCAA
GTTTATTCCG GTCCCTTGCT TGTTATGATT GACCGATTTA GTGCATCCGC ATCGGAGATT
TTTTCAGCTG CGATGCAAGA TTATAATCGA GCCATTATCC TTGGGCAAAA TACTTTTGGC
AAAGGAACGG TTCAGCAAAG TAGATCACTG AACTTTGTAT ATGATTCAAA TAGTATGGCT
CCTTTGGGTT TACTGCAATA TACTATTCAA AAGTTTTATA GAATTAACGG TGGCAGTACT
CAATTAAAAG GAGTCGCTCC GGATATTATT TTTCCCTCCT CTATTGATGA TGAAGAATAT
GGGGAAGAAA AAGAAGATAA TGCGTTGCCT TGGGATAAAA TTCCATCAGC GTCATATTCT
GAAGTCGGTA ATGCACGCCT GCCAGTAGAT ATATTGAATC AGAAACATCT TGAACGTATT
GCGAAAGACC CTGAGTTTAT TGCACTGGAT GAAGATTTAA AGATTCGTGA TGAAAGAAAA
GAACGTAAGT TTTTATCGTT GAACTTTGCT CAAAGAAAAG CTGAAAATGA TAAAGATGAT
GAAAAACGCT TGAAAGATCT TAATGCTCGT TTCAAACGAG AAGGGAAAAA ACCACTAAAA
GATCTTGATG CTTTGCCGAA AGATTATGAG GATCCTGATT TTTATTTAAA AGAAGCTCAG
AAGATTGCAG TAGATTTAAT TGAATTTAAT AAAAAAATGG CTGAGTAA
 
Protein sequence
MKLPQQLRYL LCLLFGFVLS LNYAVAVEPK LKLTDLVIPQ ITQENQLATK RTAARLVHSH 
YQAVELNDEF SQRIFDRYLK ALDFNRNTFL QSDIDNMRAM YADKIDDVLN EGNLDIAFDM
YELLMKRRYE RYRYALSLLD EEPDLNGNDQ IEIDREKADW PKNETEANKL WEHRVKNDVI
NLKLKGKKWS EIQTRLVKRY NLAIQHLSKV KSDDIVQLYL NAFAREIDPH TSYLAPRKAK
NFNENMNLSL EGIGATLQSE DGETTIKSLV PGAPADRSKK LKAGDKIIGV GQATGEIEDV
VGWRIDDVVD KIKGKKGTKV RLEIEPAKGG KSQIITLVRD RVRLEDQAAK LTVETVAGNK
IAVIKIPGFY NGLTEDVRKL LVEVEAKKAE ALIIDLRGNG GGSLPEAIEL TGLFITDGPV
VQVRDAHQRI RIYDDPDTEQ VYSGPLLVMI DRFSASASEI FSAAMQDYNR AIILGQNTFG
KGTVQQSRSL NFVYDSNSMA PLGLLQYTIQ KFYRINGGST QLKGVAPDII FPSSIDDEEY
GEEKEDNALP WDKIPSASYS EVGNARLPVD ILNQKHLERI AKDPEFIALD EDLKIRDERK
ERKFLSLNFA QRKAENDKDD EKRLKDLNAR FKREGKKPLK DLDALPKDYE DPDFYLKEAQ
KIAVDLIEFN KKMAE