Gene HMPREF0424_0711 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_0711 
Symbol 
ID8708766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp807432 
End bp808568 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content37% 
IMG OID646482816 
Productvon Willebrand factor type A domain protein 
Protein accessionYP_003373938 
Protein GI283783184 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.556532 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAATA ATGTTTGGGA ATACATATGG CCATGGAGAT GGCAATGGCC ATTGCTTTTT 
GTTGCAGGCG TTGTTGTTTC TGCATCAATT TTTGCTGTCT TACTAATTTT AGATTCCATT
CGTAGCAATA TTTTAAAAAA GAAGTATAAG AGTCAAACTA GTAATATTAC GCGAAAATTC
TATACTTTTA CTTTTGACAA TGATCTACAA GGAACTACTA CTAGTCATTC ATGGTATATG
TGGCGAATGT TTAGAAGGTT TGCTACGGCT GCATTGGTAG TATCTTTGTG CAGTGCGCTT
GCTGTTGCTT CCAGGCCAGC ACGAGTGTTT AATGCGAACG AGCAAGTTAG TTCTAGAGAC
ATTGTTTTAT GCTTAGACGT TTCAGGTTCT GCTCTACCAT ATGATCGCGA AGTGATTCAA
GCGTATCTTA ATTTTATTGA GCATTTTCAG GGTGAACGCA TTGGATTGAG CATTTTTAAC
TCCACATCTA GGACAGTTTT TCCACTGACT GACGATTATC GACTAGCTAA AAAACAATTG
CAGTATGCTG CAAATTTGCT TGGCGGAGTT CAATCGCAAA GTCGTATTAA TCGTTTACAA
CAACGCCAAT ATCAGGAAAT TTCTGACTGG CTTGAAGGAA CTCAAAATCG CAAAAACGCA
ACATCGCTTA TTGGAGACGG TTTAGTTAGT TGTGCAGCAA TGCTTCCAGG TTTTATTTAT
GGTTCTGTTC ATAATAACCA TAAAATACAA AGCCGTTTTA ATAGAAGTGC TTCCATTGTT
CTTGCTACAG ATAATGTTGT TTCTGGAAAA CAAACTTACT CATTAAAGCA AGCTTTAGAT
TTAACAAAGC AAGCTAAAAT TACTGTCGAC GGATTATATT CTGGCGCAAA ACAGAATGAG
AATGATGACA CTACGTTAGA AATGAAACAG CTTATTGAAT CTCATGGTGG AATTTTCCTT
TCTCAACGCA ATTCTGATTC TGTTATTAAT TTAGTAAAAG AAATTGAAAA AAGACATACA
GCAATTCCTC AAGGCGCTGC TCAGTCTGCA TTTAGTGACG ATCCTGGTCT TTGGGTTTTG
TTTACTGTGT TTAGCGTTGT GATTTGGCTA GCTATTGCAA AGAGGATGAA GCGATGA
 
Protein sequence
MSNNVWEYIW PWRWQWPLLF VAGVVVSASI FAVLLILDSI RSNILKKKYK SQTSNITRKF 
YTFTFDNDLQ GTTTSHSWYM WRMFRRFATA ALVVSLCSAL AVASRPARVF NANEQVSSRD
IVLCLDVSGS ALPYDREVIQ AYLNFIEHFQ GERIGLSIFN STSRTVFPLT DDYRLAKKQL
QYAANLLGGV QSQSRINRLQ QRQYQEISDW LEGTQNRKNA TSLIGDGLVS CAAMLPGFIY
GSVHNNHKIQ SRFNRSASIV LATDNVVSGK QTYSLKQALD LTKQAKITVD GLYSGAKQNE
NDDTTLEMKQ LIESHGGIFL SQRNSDSVIN LVKEIEKRHT AIPQGAAQSA FSDDPGLWVL
FTVFSVVIWL AIAKRMKR