Gene HMPREF0424_0103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_0103 
Symbol 
ID8709832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp121784 
End bp123334 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content47% 
IMG OID646482224 
Productthiol-activated cytolysin 
Protein accessionYP_003373370 
Protein GI283782616 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.258222 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAGCA CAAAGTTCTA CCGTAATGCA GCAATGTTGT TGCTCGCGGG CGCGACTATT 
ATTCCACAAT GCTTAGCAGT TCCAGCAATG GCAGATTCTT CTGCAAAGCC TTCTGCACCA
GCTGCATCTT GTGCAGCTAG CAAGGACTCG CTGAACAACT ACTTGTGGGA TTTGCATTAC
GATAAAACGA ATATTCTCGC ACGTCACGGC GAAACAATTA ACAATAAATT CTCTAGCGAT
AGCTTCAACA ATAACGGTGA ATTCGTTGTT GTTGAGCACC AGAAGAAGAA CATCACAAAT
ACAACATCAA ACTTGTCGGT AACTTCGGCC AACAATGATC GCGTATATCC GGGTGCATTA
TTCCGCGCTG ACCAGAATTT GATGGATAAC ATGCCAAGCT TGATTTCAGC AAATCGTGCT
CCATTAACTT TGAGTATTGA TTTGCCAGGC TTCCATGGCG GCGAGAGTGC TGTAACCGTG
CAACATCCAA CCAAGAGCTC CGTTACTTCA GCAGTTAACG GCTTAGTTTC TAAGTGGAAT
GCACAATATG CAGCAAGCCA TCATGTGGCA GCTCGTATGC AGTATGATTC CGCAAGCGCA
CAGAGCATGA ACCAGCTCAA AGCTAAGTTT GGTGCTGATT TTGCAAAGAT TGGCGTACCG
CTGAAGATTG ATTTCGATGC AGTACACAAG GGTGAGAAGC AGACTCAAAT TGTGAACTTC
AAGCAAACTT ACTACACAGT AAGCGTTGAC GCTCCAGATA GCCCAGCAGA TTTCTTTGCT
CCATGCACTA CGCCAGAAAG CTTGAAGAAC CGTGGCGTTG ACAGCAAACG CCCACCAGTT
TACGTATCAA ACGTAGCTTA TGGTCGCTCA ATGTACGTAA AGTTCGATAC CACCAGCAAG
AGCACTGATT TCCAGGCTGC AGTAGAAGCA GCAATTAAGG GCGTAGACAT CAAGCCAAAC
ACCGAATTCC ATCGCATTCT CCAGAATACC TCTGTTACTG CAGTGATTCT CGGCGGCAGC
GCTAACGGTG CAGCAAAGGT TATTACCGGC AATATCGATA CGCTAAAGGC TTTGATTCAG
GAAGGTGCAA ATTTGAGCAC CTCTAGCCCA GCTGTTCCAA TCGCTTACAC TACTTCCTTC
GTAAAAGATA ACGAAGTAGC AACTCTCCAG AGCAACAGCG ATTATGTTGA GACTAAGGTT
TCTGCTTACC GTGACGGTTA TTTGACCTTG GATCATCGTG GGGCTTATGT AGCTCGCTAC
TACGTCTACT GGGATGAGTA CGGCACTGAT ATCGACGGAA CTCCATACGT ACGTTCTCGC
GCTTGGGAAG GAAACGGCAA GTATCGTACC GCTCACTTCA GCACAACGAT TCAGTTCAAG
GGCAATGTGC GCAATCTTCG CATTAAGTTG GAAGAAAAGA CCGGCTTAGT ATGGGAACCA
TGGCGCACAG TGTACAACCG TACTGATTTG CCGTTGGTTC GCCAGCGCAC AATCAAGAAC
TGGGGTACAA CTTTATGGCC TCGCGTCGCT GAAACTGTAA AGAATGACTG A
 
Protein sequence
MKSTKFYRNA AMLLLAGATI IPQCLAVPAM ADSSAKPSAP AASCAASKDS LNNYLWDLHY 
DKTNILARHG ETINNKFSSD SFNNNGEFVV VEHQKKNITN TTSNLSVTSA NNDRVYPGAL
FRADQNLMDN MPSLISANRA PLTLSIDLPG FHGGESAVTV QHPTKSSVTS AVNGLVSKWN
AQYAASHHVA ARMQYDSASA QSMNQLKAKF GADFAKIGVP LKIDFDAVHK GEKQTQIVNF
KQTYYTVSVD APDSPADFFA PCTTPESLKN RGVDSKRPPV YVSNVAYGRS MYVKFDTTSK
STDFQAAVEA AIKGVDIKPN TEFHRILQNT SVTAVILGGS ANGAAKVITG NIDTLKALIQ
EGANLSTSSP AVPIAYTTSF VKDNEVATLQ SNSDYVETKV SAYRDGYLTL DHRGAYVARY
YVYWDEYGTD IDGTPYVRSR AWEGNGKYRT AHFSTTIQFK GNVRNLRIKL EEKTGLVWEP
WRTVYNRTDL PLVRQRTIKN WGTTLWPRVA ETVKND