Gene HMPREF0424_1344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_1344 
Symbol 
ID8709082 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp1606408 
End bp1608192 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content40% 
IMG OID646483429 
Producttrypsin 
Protein accessionYP_003374526 
Protein GI283783772 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0379667 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGACG ATATGTATGG TGCAAATCAG CAGGAATCCA ATGAAACTGC AAATAATGGA 
TATTACGAGC AGCAGGAACA GCCGACGCAA CCAGCACAAC CAACGCAAAC AGCACAAACA
TATAGCCCAG CTCCTGAATT CGGAGCTTAC GGCCCAACAA ATAACGAAAA TGAAGTTGGT
ACAAACACTA CACAGTATCC AGCTAACAAT TACGCCGTAA ACCAAAATAA CGATGCGGAT
AATACAAATA TCAATAATGC TCCTACGCAA TACATTGGAT CACAAAATTA CTACGGAAAT
AATAATTTTA ACGGCTTCGG AAATCCGTAT AATTACGGCA CAGACAATAA TTCTTACCCA
AATAATGAAG TTGGCAATCA AACACCAGCG CAACAGAATT TTAACAATAA TGCAAGTAAC
GAAGAAAATG AAAATATAGC AAAAACAAGC ATTATTAGCA CAAATGCAGC AAATACAAAC
GACACAAACA AAAAAGGTAA CAAAAGAAAA ACAAAAAGCT CTTCGTCAAC TGCTTTTGTT
GCCATTTTAT CCTCCGCAAT TTCTGCAATA GTTTGCGTAG TTGTAGTTCT ATTTGTAATC
TCGCAAGGTC TTATTTCAAT TCCACAAAGC GGCTCTTTCG CAAACATCGG CTCTCATTCT
TCTGGCCCTG GGACCGCAGT AGTTAAAGGC GGACAATCTC CGGATTGGCA AGGAGTTGCA
AAAAATGTTT CCGGAGCCGT TGTTTCTATA CAAACTCGTT TAGAGAAAGG CATGGGGAAG
GGTTCTGGAG CAATTATTGA TTCAAAAGGT TATGTTGTAA CAAACAATCA CGTAATTGCT
AATGCAAAAG AAATTCAAGT AACCCTTTCT AATGGTCAAA TTTATTCAGC TACATTAGTC
GGAGCAGATA AAACTACTGA TTTGGCAGTA CTTAAATTAG ACAATTCACC AAATAATTTA
AAGACAGTCC AATTTGCAGA TTCTAATCTG CTTTCCGTTG GCGAACCGGT TATGGCAATC
GGAAATCCGC TTGGATATGA CGATACAGCT ACCACAGGCA TTGTTTCGGC TTTAAATCGT
CCAGTATCAG TTATGGACGA CCAAAGCCGC TCTGAAATTG TAACTAATGC AGTACAAATT
GATGCAGCTA TTAATCCAGG AAATTCAGGC GGCCCAACTT TTAACGCTGC CGGAAAGGTC
ATAGGTATTA ATTCTTCTAT TGCAGCGACA TCAGCTCAAG GGGAGACTAC AGGATCTATC
GGCATCGGCT TTGCAATTCC AGCAAATCTG GTGAAACGAG TAGTTACAGA AATTATTAAG
AATGGTTCTG TAAAGCACGT AGCACTAGGG ATCATGATTA AAAGCACAGC AGTTGAGTCC
GAAGGAATTA CTCGCGGAGG AGCTCAAATT GTTTCTGTCA ATCAAGGAAG CCCTGCTGAA
AAAGCCGGAC TCAAAGCCAA TGACACTATT GTTGCCTTTG ACGATAAGCC TGTATCCAAT
AATTACGCAC TCCTCGGGTA TGTGAGAGCA ACAGCGTTCA ATCAAAAAGC TACGCTCACA
ATAGTTCGTA ACGGCAACAC ACTTAAGTTG CAAGTTACAT TCAACCAAGA AGAAACTGCT
GTTAATGGCA CAAATAAGCA AGAAAAGAAA TTAAAGAAAA ACCAAAAGAA GCCTGGTAAA
AAACGCGGAA GTAACTCGTA CGATGGTGAC GATGACGATT TACAACAACG TGGAGATGAC
GATGGTGACG ATGGTGGAAT ATTTGATCCA TTCGGTTTCT GGTAA
 
Protein sequence
MADDMYGANQ QESNETANNG YYEQQEQPTQ PAQPTQTAQT YSPAPEFGAY GPTNNENEVG 
TNTTQYPANN YAVNQNNDAD NTNINNAPTQ YIGSQNYYGN NNFNGFGNPY NYGTDNNSYP
NNEVGNQTPA QQNFNNNASN EENENIAKTS IISTNAANTN DTNKKGNKRK TKSSSSTAFV
AILSSAISAI VCVVVVLFVI SQGLISIPQS GSFANIGSHS SGPGTAVVKG GQSPDWQGVA
KNVSGAVVSI QTRLEKGMGK GSGAIIDSKG YVVTNNHVIA NAKEIQVTLS NGQIYSATLV
GADKTTDLAV LKLDNSPNNL KTVQFADSNL LSVGEPVMAI GNPLGYDDTA TTGIVSALNR
PVSVMDDQSR SEIVTNAVQI DAAINPGNSG GPTFNAAGKV IGINSSIAAT SAQGETTGSI
GIGFAIPANL VKRVVTEIIK NGSVKHVALG IMIKSTAVES EGITRGGAQI VSVNQGSPAE
KAGLKANDTI VAFDDKPVSN NYALLGYVRA TAFNQKATLT IVRNGNTLKL QVTFNQEETA
VNGTNKQEKK LKKNQKKPGK KRGSNSYDGD DDDLQQRGDD DGDDGGIFDP FGFW