Gene HMPREF0424_0427 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_0427 
Symbol 
ID8709078 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp467417 
End bp468514 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content50% 
IMG OID646482542 
Productsortase family protein 
Protein accessionYP_003373674 
Protein GI283782920 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3764] Sortase (surface protein transpeptidase) 
TIGRFAM ID[TIGR01076] LPXTG-site transpeptidase (sortase) family protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000168911 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGCTAG CTGATTTTGC AGAGTCTGCT CGCGCGCTAT TTAAGCGACT GACTCGTATG 
CGCGCTAATA AGTCCTCCAA AAGTGCGCGA GATTCCGCAA GCTCGCAAGA TTACGCAAGC
TCTGAAATCG CTGCATCTTC CGAAAATACT GAATCTGCGC AATCTCCATC ATCGCAAGAG
CAGTCTTCCC TAAAACCAAT GTCTACACTT CGCAAGCTGG CTGAGCCGAT TGCGCTTGTT
GCTATTGGCA TTTTGTGTTT CAGCTATCCT GTGGTTTCTA CTTTGGTCAA CAATCATGCT
GCCAAGGAAC TCTCAATCGA GTACGACAAA TTAAATAAGG AAAAGCCTAA AGAAAATCGC
GCGGAAATTT TGCGAAAAGC GCGTGAATAC AACGCTCGCC ACAAGGCGAT TATTAGCGCG
GACCCGTATA ACGGCAATAA CGATTACATG GACACTCCCG AATATAAAGA GTACGAAAAA
GTGCTTAGTG AGCCGATGGG AATTATGGGC ATCGTGAAAA TACCAAAAAT TGGCGTGAGA
CTGCCGATTT ACCACGGAAC TACTCAAGAT ACACTAGCAA TGGGCGCGGG GCATTTGTAC
GGCACGGATT TGCCGGTGGG GGGCAAAAGC AGGCACACGG TTGTGACGGC GCATACGGGT
ATGCCGGATG CCACGATGTT CGATGATTTA AACACGTTGA AAAAAGGTGA CTACTTCTAT
TTTGATGTGC AAGGAAAGAC TCTTCGGTAC AAAGTGTTTC GCATAAATGT GGTGGAGCCG
AACGATATTC GTTTGCTGCG GCGTGAGAAG GGGCGCGACT TGGCGACACT GATTACGTGC
ACGCCGTATG GAATTAACAC GCACAGGCTG CTCGTGACGG GGTATCGCGT GCTGCCGGAT
CCTGCTAACG TGCCGGGTGA CCATATGCAG TGGCCGTTGT GGATGACGCT GTTTGTGATA
TCGATGGTGA TGTCTGCGGT GTTGATGGCG ATGATGCTGG TTGCGTCGTT GCGAAAGAAG
AATGGTGTGA GTAGTCTGCA AGGCAGGCAT TTGCTGGCGG TTTCGCGCAA AATGCTGCGT
AAGTTACGGC GCAAGTAG
 
Protein sequence
MKLADFAESA RALFKRLTRM RANKSSKSAR DSASSQDYAS SEIAASSENT ESAQSPSSQE 
QSSLKPMSTL RKLAEPIALV AIGILCFSYP VVSTLVNNHA AKELSIEYDK LNKEKPKENR
AEILRKAREY NARHKAIISA DPYNGNNDYM DTPEYKEYEK VLSEPMGIMG IVKIPKIGVR
LPIYHGTTQD TLAMGAGHLY GTDLPVGGKS RHTVVTAHTG MPDATMFDDL NTLKKGDYFY
FDVQGKTLRY KVFRINVVEP NDIRLLRREK GRDLATLITC TPYGINTHRL LVTGYRVLPD
PANVPGDHMQ WPLWMTLFVI SMVMSAVLMA MMLVASLRKK NGVSSLQGRH LLAVSRKMLR
KLRRK