Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HMPREF0424_1344 |
Symbol | |
ID | 8709082 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gardnerella vaginalis 409-05 |
Kingdom | Bacteria |
Replicon accession | NC_013721 |
Strand | + |
Start bp | 1606408 |
End bp | 1608192 |
Gene Length | 1785 bp |
Protein Length | 594 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 646483429 |
Product | trypsin |
Protein accession | YP_003374526 |
Protein GI | 283783772 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0379667 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGACG ATATGTATGG TGCAAATCAG CAGGAATCCA ATGAAACTGC AAATAATGGA TATTACGAGC AGCAGGAACA GCCGACGCAA CCAGCACAAC CAACGCAAAC AGCACAAACA TATAGCCCAG CTCCTGAATT CGGAGCTTAC GGCCCAACAA ATAACGAAAA TGAAGTTGGT ACAAACACTA CACAGTATCC AGCTAACAAT TACGCCGTAA ACCAAAATAA CGATGCGGAT AATACAAATA TCAATAATGC TCCTACGCAA TACATTGGAT CACAAAATTA CTACGGAAAT AATAATTTTA ACGGCTTCGG AAATCCGTAT AATTACGGCA CAGACAATAA TTCTTACCCA AATAATGAAG TTGGCAATCA AACACCAGCG CAACAGAATT TTAACAATAA TGCAAGTAAC GAAGAAAATG AAAATATAGC AAAAACAAGC ATTATTAGCA CAAATGCAGC AAATACAAAC GACACAAACA AAAAAGGTAA CAAAAGAAAA ACAAAAAGCT CTTCGTCAAC TGCTTTTGTT GCCATTTTAT CCTCCGCAAT TTCTGCAATA GTTTGCGTAG TTGTAGTTCT ATTTGTAATC TCGCAAGGTC TTATTTCAAT TCCACAAAGC GGCTCTTTCG CAAACATCGG CTCTCATTCT TCTGGCCCTG GGACCGCAGT AGTTAAAGGC GGACAATCTC CGGATTGGCA AGGAGTTGCA AAAAATGTTT CCGGAGCCGT TGTTTCTATA CAAACTCGTT TAGAGAAAGG CATGGGGAAG GGTTCTGGAG CAATTATTGA TTCAAAAGGT TATGTTGTAA CAAACAATCA CGTAATTGCT AATGCAAAAG AAATTCAAGT AACCCTTTCT AATGGTCAAA TTTATTCAGC TACATTAGTC GGAGCAGATA AAACTACTGA TTTGGCAGTA CTTAAATTAG ACAATTCACC AAATAATTTA AAGACAGTCC AATTTGCAGA TTCTAATCTG CTTTCCGTTG GCGAACCGGT TATGGCAATC GGAAATCCGC TTGGATATGA CGATACAGCT ACCACAGGCA TTGTTTCGGC TTTAAATCGT CCAGTATCAG TTATGGACGA CCAAAGCCGC TCTGAAATTG TAACTAATGC AGTACAAATT GATGCAGCTA TTAATCCAGG AAATTCAGGC GGCCCAACTT TTAACGCTGC CGGAAAGGTC ATAGGTATTA ATTCTTCTAT TGCAGCGACA TCAGCTCAAG GGGAGACTAC AGGATCTATC GGCATCGGCT TTGCAATTCC AGCAAATCTG GTGAAACGAG TAGTTACAGA AATTATTAAG AATGGTTCTG TAAAGCACGT AGCACTAGGG ATCATGATTA AAAGCACAGC AGTTGAGTCC GAAGGAATTA CTCGCGGAGG AGCTCAAATT GTTTCTGTCA ATCAAGGAAG CCCTGCTGAA AAAGCCGGAC TCAAAGCCAA TGACACTATT GTTGCCTTTG ACGATAAGCC TGTATCCAAT AATTACGCAC TCCTCGGGTA TGTGAGAGCA ACAGCGTTCA ATCAAAAAGC TACGCTCACA ATAGTTCGTA ACGGCAACAC ACTTAAGTTG CAAGTTACAT TCAACCAAGA AGAAACTGCT GTTAATGGCA CAAATAAGCA AGAAAAGAAA TTAAAGAAAA ACCAAAAGAA GCCTGGTAAA AAACGCGGAA GTAACTCGTA CGATGGTGAC GATGACGATT TACAACAACG TGGAGATGAC GATGGTGACG ATGGTGGAAT ATTTGATCCA TTCGGTTTCT GGTAA
|
Protein sequence | MADDMYGANQ QESNETANNG YYEQQEQPTQ PAQPTQTAQT YSPAPEFGAY GPTNNENEVG TNTTQYPANN YAVNQNNDAD NTNINNAPTQ YIGSQNYYGN NNFNGFGNPY NYGTDNNSYP NNEVGNQTPA QQNFNNNASN EENENIAKTS IISTNAANTN DTNKKGNKRK TKSSSSTAFV AILSSAISAI VCVVVVLFVI SQGLISIPQS GSFANIGSHS SGPGTAVVKG GQSPDWQGVA KNVSGAVVSI QTRLEKGMGK GSGAIIDSKG YVVTNNHVIA NAKEIQVTLS NGQIYSATLV GADKTTDLAV LKLDNSPNNL KTVQFADSNL LSVGEPVMAI GNPLGYDDTA TTGIVSALNR PVSVMDDQSR SEIVTNAVQI DAAINPGNSG GPTFNAAGKV IGINSSIAAT SAQGETTGSI GIGFAIPANL VKRVVTEIIK NGSVKHVALG IMIKSTAVES EGITRGGAQI VSVNQGSPAE KAGLKANDTI VAFDDKPVSN NYALLGYVRA TAFNQKATLT IVRNGNTLKL QVTFNQEETA VNGTNKQEKK LKKNQKKPGK KRGSNSYDGD DDDLQQRGDD DGDDGGIFDP FGFW
|
| |