Gene HMPREF0424_1214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_1214 
Symbol 
ID8708978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp1441623 
End bp1442642 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content51% 
IMG OID646483302 
Productsortase family protein 
Protein accessionYP_003374408 
Protein GI283783654 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3764] Sortase (surface protein transpeptidase) 
TIGRFAM ID[TIGR01076] LPXTG-site transpeptidase (sortase) family protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000620808 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAAATTAA ACGACGCTGT TGTTAAAGTA TATGCTTTCT TCAAGCAACT CAAAGCAAAA 
CCCGTCGCTT CTGCAACCGA AACAACACCA GCCTCCACAG CAACGCAAAC CGCAGCCTCC
GCAACTGCAA CAAAACCCGC TAAAAAACCC AAATCCAAAC TCCGCCGTAT TTTAGAGCCA
ATCGTTCTCG CGCTTGCAGC GATTCTGTGC TTCACGTACC CGATTGCGTC AACGTTGTGG
AATAACAAGG TGTCGAAGGA GATTTCTGTA GCTTATGATC GGCTGAGTCG TAAGCAGACT
GCGAGCGCTA GAGAGAAGAT TTTGAACGCG GCTCGTCGTT ATAATGCGCG CCACAAGTCG
ATTATTACGG CTGATCCGTA CAATGGCACG ACGGATTATA TGAAGACGCC GGAGTATAAG
GAGTATGCGA AGCAGCTGGA TGAGCCGATG GGGATTATGG GTATTGTGAA AATCCCGAAG
ATTGGCGTTA AGCTGCCGAT TTACCACGGT AGTTCGCAGG AGGTTTTGGC GTATGGCGCT
GGACACTTGT ATGGCACAGA TTTGCCGGTT GGGGGTAAAA CGCGTCACGC TGTTGTGACG
GCGCATACGG GTTTGCCAAA TGCGACCATG TTTGATGATT TAGTGGAGTT GAAGAAGGGT
GATTTCTTCT ACTTTGACGT GCAGGGGGAG ACGCTGCGTT ACAGGGTATT CCGCATTAGC
GTGGTCGATC CGCATGATAT TAGGTTACTT CAACGCGAGA ATGGGCGTGA TCTGGCGACG
CTGCTTACCT GTACGCCGTA TGGTGTTAAT ACGCAGAGAT TGCTGGTGAC AGGCTATCGC
GTGCTGCCGG ATCCTGTTTC CGTGCCGGGC GATAAGTTGC AGTGGCCGCT TTGGATGACG
ATGTTTGTGC TCGCGCTTAT TGGTTGCGCT GTGATTCTTA CGACGATGAT TATTGCCGCG
GTTCGCAAGC GCCACCGCCG TGCTGCTTCA CTTGCTCATG CCAAACATAT AACTGCTTAG
 
Protein sequence
MKLNDAVVKV YAFFKQLKAK PVASATETTP ASTATQTAAS ATATKPAKKP KSKLRRILEP 
IVLALAAILC FTYPIASTLW NNKVSKEISV AYDRLSRKQT ASAREKILNA ARRYNARHKS
IITADPYNGT TDYMKTPEYK EYAKQLDEPM GIMGIVKIPK IGVKLPIYHG SSQEVLAYGA
GHLYGTDLPV GGKTRHAVVT AHTGLPNATM FDDLVELKKG DFFYFDVQGE TLRYRVFRIS
VVDPHDIRLL QRENGRDLAT LLTCTPYGVN TQRLLVTGYR VLPDPVSVPG DKLQWPLWMT
MFVLALIGCA VILTTMIIAA VRKRHRRAAS LAHAKHITA