Gene HMPREF0424_1161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_1161 
Symbol 
ID8708745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp1357478 
End bp1358611 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content49% 
IMG OID646483251 
Productsortase family protein 
Protein accessionYP_003374359 
Protein GI283783605 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3764] Sortase (surface protein transpeptidase) 
TIGRFAM ID[TIGR01076] LPXTG-site transpeptidase (sortase) family protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000131719 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAGAAAAG AAAACCACTC AAACTCGCGC AACAACGCCA AAAACTGGCA CTTCTCTAAG 
TCCTGCATAG TCCCTGCTTT GCTGTTGCTT GCATCGTTTG CGTCGCTTAT TTATCCGCAC
GCCGCCATGT GGCTAAGCCA ATACCACATG TCGGAAGTTG CTGCAGAGTA TGCGAAGCTT
ATTACACACG CGGTTCCTGC TCCGCATGAG CAGCTCCGCC GCGCTCGTGA GTATAACAGT
AAGCTTTCGT CGGGTGCTAT TTACGAAGCG AATACGAATA TTCCAACTTC GCATGGCGAA
ACGTCGGATG CTAGTCAGGA TTATTGGGAT CAGTTGAAAG TGAACGACGA CGGGTTGATG
GCTCGTTTGC GAATCAAAAA AATTGATCTT GATTTGCCTG TGTATCACGG CACAGAGGAT
GCCACGTTGC TTAAGGGTCT TGGTCATTTG CGTGGAACTT CGCTTCCGGT TGGCGGCAAG
GGGACGCGCT CTGTGATTAC TGGGCATCGC GGTTTGGCAA GTGCGGAAAT GTTCACCAGG
CTGGATGAAG TTGGAAAAGG TGATACTTTT ACGATTGAGG TTTTCGACGA GATTCTTACC
TACAAGGTTG TTGACAAGAT CGTTGTAAAC CCTGATGAGA CGAAGAAGAT TGCAGCCGTT
CCTGGTAAGG ATTTGATGAC GCTGATTACT TGTACGCCGC TTGGCATTAA CACGCAGCGC
ATTTTGGTTA CTGGTGAACG CGTGGTGCCA ACGCCTGCGG CAGACAAAGC ACTTCGAGGT
AAAAAGCCTG ATGTGCCGCG ATTCCCGTGG TGGATTGTTG CTTGCTTTGG AAGCTTGTGT
ATTGTTGGCG GCTACATTTG GTGGGCTGGT TTGCCTGTAA AGAAGAAAAA GAAAACTGAA
AAGAGCGATG CTGCTGCGGA TGCGTCTGCT GCTACTAGTG ACGACATAAG TGCAGAAAGC
TCATCGACTA GCACGGTAAG CAGTGCGGAT CTTGTGCTCG AGGAAGCTAA GGCGATTAAA
AATCGTCCGA AAAAACGGAC GAAAGTCAAA AAGTCTAGTT TGCGCAAGAA ACGCAAGCGG
AGTGGTCACG GCGAAAAGAC TTTCGACTGG CTTATAAGCA TCCTTGGTTT ATGA
 
Protein sequence
MRKENHSNSR NNAKNWHFSK SCIVPALLLL ASFASLIYPH AAMWLSQYHM SEVAAEYAKL 
ITHAVPAPHE QLRRAREYNS KLSSGAIYEA NTNIPTSHGE TSDASQDYWD QLKVNDDGLM
ARLRIKKIDL DLPVYHGTED ATLLKGLGHL RGTSLPVGGK GTRSVITGHR GLASAEMFTR
LDEVGKGDTF TIEVFDEILT YKVVDKIVVN PDETKKIAAV PGKDLMTLIT CTPLGINTQR
ILVTGERVVP TPAADKALRG KKPDVPRFPW WIVACFGSLC IVGGYIWWAG LPVKKKKKTE
KSDAAADASA ATSDDISAES SSTSTVSSAD LVLEEAKAIK NRPKKRTKVK KSSLRKKRKR
SGHGEKTFDW LISILGL