Gene HMPREF0424_0302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_0302 
Symbol 
ID8709150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp326633 
End bp329089 
Gene Length2457 bp 
Protein Length818 aa 
Translation table11 
GC content43% 
IMG OID646482420 
Productpeptidase, S9A/B/C familie, catalytic domain protein 
Protein accessionYP_003373557 
Protein GI283782803 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1770] Protease II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.327018 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAATC AAGATTTTGC GAAGCCAAAA GCGGCAGAAC GCATACCGTC GACGCGTGAA 
TTCCATGGCG ACACTTATAT AGACGAATAT GCTTGGATGA AAGATCGCAA TAATCCTAAG
CTTATGGAAT ACATCAATAG TCAAAATGCT TATACTGAAC AGCGTGTAAA GCACTTAGAA
AACTTGCGTT CTACATTATT TGACGAGTTG CGCTCAAGAG TTCAAGAAAC AGACATGTCT
GTTCCTGTGC GCATGAATAA TTACTGGTAT TACGTGCGCA CTGAAAAAGG CAAACAATAC
GCTGTGCAAT GCCGTATGAA AGTCAAAAAT AGTGACGATT GGAATCCGCC AACTATCGAC
AATTCTGCAA AACCTGGCAC AACTGATGGT GAAGAAATTT TATTTGATGC AAACCTTGAA
TCTGAGCACT CTGGTGGCAA GTTCTTTAGA GTCGGTGGAA TGGATTTGAG TACCGACGGT
TTGAGAATGC TGTACGGTGT CGACACTCAA GGCGATGAAC GCTTTAACTA TTTTATTCGT
GACTTTTCTA CAAAAACATG TTCTTGGTTG CAGTTAGAAG AATCGTGGGA AAATCTTGCT
TCTGCTTCAC TTTCTCCAGA TGGAAAATGG GTGTTTTATG TAAAGCTTGA TGATGCTTGG
CGTCCTTATC AAGTTTGGAG GCATAAGGTT GGCACTAGCG TTTCTAATGA TGTAAAAGTT
TTTGAAGAAA CTGACGAGCG ATTTTTTGTT GACGTGTATG AAAGTTTTGA CGAACGATAC
ATGATGATCA GCAGTAGTTC AAAGACTACG TCGCGCGTGC TTATGCTTCC TTTGAGCAAT
CCTGAAGGCG AATTCCGTAT GGTTATTGAG CCGGTTGAGG GGGTGGAATA CGATATTTCT
TTCGCTTGCT TTGAGAATGC TGGCGAAAAT GGTGAAGATA TTCCAATTGC GATTGTGTGC
CATAACGCTA AAAATCCTAA TTTTGAAATT GACATTATTG ATATGAGAGG AAATTCCGGC
AAGTTTGATG AGCATATAAC GTATAAGCTT TCGGATGGAG TATGTGTTGC GTCGGGTTCT
CCTTATGGTT GCGAGCAGGG AGATGCTTGG CAACAAGGTG CTGGAAGTGA GCCTATTACA
AAGCCTTATA ATTCGCCTCA AAATCCTGAA ATATTGCAAA ATGCAGTTGG ATTGAGCATA
AGCGGATTGT CAATGTACAA AAATTATGTG GCACTAGCTT ATCGTTCTCA AGGTTTGCCG
CATTTGGCTG TAATGAGCAA GAAGCGCGCT GTTGAAGATT TTTTGCAAGC AAAACCGTGG
CGTTTTTGCG AAGTGCGACC GCTTTCTTCC CAGCTTGCGA ATCAATTTAC GGATCAACTT
TCTTCTCAAC TTGTGGATCA AGCGCAAGCC GAATCACGTG CTGCGCTCGC AAATATTTCA
CAAGAAGAAA TCAACAATAA TCGCCTATTG AGTATTTCTA TGACTGGAAA TCCTTCGTAC
GAAGCTCCGC GTATGCGTTA CGCTTTTGGT AGTTATGCGA CTTTAGGTCA ACTTCGCGAG
TTGGATCCTT TAACAGGCGA AGATGTGTTG CTCAAGCAAG GTAAAATTTT AGGTGAATTT
AATGAGCGCA ATTATGCTGA AAAGCGCGTT TGGATTACTG CTCGAGATGG TGAGCGTGTG
CCAGTGTCTC TTGTGTGGCG TCCGGATAAA ATTTCTCAAA CTGACTCAAT GTTTATTACT
GGTTATGGAG CGTATGAAGT TAGTTCAGAC CCTACTTTTT CTGTCGGTCG CTTGAGTTTG
CTGGATCGCG GCGTGTTATA TGCGCAAGTG CATGTGCGAG GCGGCGGCGA AATGGGTCGT
GCGTGGTATG AACAGGGTAG GCGCGTCAAT AAAAAGCATA CTTTTGAAGA TTTTGCTGAT
GCCACGCGTG CTTTGCAAAA TGCTGGTTTT GCTTCCGCTT GTCATACTGT TGCAAACGGT
GGTTCAGCAG GAGGCTTGCT TATGGGTGCT ATAGCAAATA TGGCTCCGGA ATGTTATGCA
GGAATTGAAG CAGACGTGCC TTTTGTAGAT GCCCTTACGT CTATGCTCGA CTCTTCTTTG
CCGCTTACTG TAACAGAGTG GGATGAGTGG GGCAATCCAC TCGATGATCC TGAAGCTTAT
GCTTATATGA AATCGTATAC TCCTTACGAA AATGTGCCTT GCGTAAAAAT GGAAGATGGT
TGCAAAAAGT TAGCTGATTT TCCGAAAATC TTAATCACGT CCTCTATTCA CGACACGCGC
GTGCTTGTTA CGGAACCTTT AAAATGGCTC GCTAAATTGC AAGCTTCTGG AGTTGACGCA
ATTGCTCGTA TTGAAACTGA CGGTGGTCAC GGTGGCACTT CTGGCAGGTA CAGGCAATGG
CAGGAGTTAG CTTATGAAAA CGCATGGTGC CTGCATGTTA TGGGTATAAA CAGTTAG
 
Protein sequence
MNNQDFAKPK AAERIPSTRE FHGDTYIDEY AWMKDRNNPK LMEYINSQNA YTEQRVKHLE 
NLRSTLFDEL RSRVQETDMS VPVRMNNYWY YVRTEKGKQY AVQCRMKVKN SDDWNPPTID
NSAKPGTTDG EEILFDANLE SEHSGGKFFR VGGMDLSTDG LRMLYGVDTQ GDERFNYFIR
DFSTKTCSWL QLEESWENLA SASLSPDGKW VFYVKLDDAW RPYQVWRHKV GTSVSNDVKV
FEETDERFFV DVYESFDERY MMISSSSKTT SRVLMLPLSN PEGEFRMVIE PVEGVEYDIS
FACFENAGEN GEDIPIAIVC HNAKNPNFEI DIIDMRGNSG KFDEHITYKL SDGVCVASGS
PYGCEQGDAW QQGAGSEPIT KPYNSPQNPE ILQNAVGLSI SGLSMYKNYV ALAYRSQGLP
HLAVMSKKRA VEDFLQAKPW RFCEVRPLSS QLANQFTDQL SSQLVDQAQA ESRAALANIS
QEEINNNRLL SISMTGNPSY EAPRMRYAFG SYATLGQLRE LDPLTGEDVL LKQGKILGEF
NERNYAEKRV WITARDGERV PVSLVWRPDK ISQTDSMFIT GYGAYEVSSD PTFSVGRLSL
LDRGVLYAQV HVRGGGEMGR AWYEQGRRVN KKHTFEDFAD ATRALQNAGF ASACHTVANG
GSAGGLLMGA IANMAPECYA GIEADVPFVD ALTSMLDSSL PLTVTEWDEW GNPLDDPEAY
AYMKSYTPYE NVPCVKMEDG CKKLADFPKI LITSSIHDTR VLVTEPLKWL AKLQASGVDA
IARIETDGGH GGTSGRYRQW QELAYENAWC LHVMGINS