Gene HMPREF0424_1062 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_1062 
Symbol 
ID8709441 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp1209370 
End bp1213053 
Gene Length3684 bp 
Protein Length1227 aa 
Translation table11 
GC content42% 
IMG OID646483154 
ProductLPXTG-motif cell wall anchor domain protein 
Protein accessionYP_003374265 
Protein GI283783511 
COG category 
COG ID 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain
[TIGR02167] bacterial surface protein 26-residue repeat 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0190621 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAGTA GTAGGAAACA TCGTATTCTT TCGACAATTG GTGCCGTTTC ATGCGTAGTC 
TCTATGATTT TAAGCGTAAC TGTGCTGCCT GCGATAGCTT ATGCCAACGT AACGAAGAAT
CCATTGTTTG TTGCTGCTAT GGCTGCTGCT GCCGAGAATA ATAGGGGTCA AAATAAGTCG
GGTGAAGGGG GATCTAAAAA GGGAGTTCAA GGGGAAGCTG GTAAAGGTAC TGGTAAAGAA
GTTAAAGGGG ATAAGGCTAA AGGGGTAGAT AAAAAAGAAG GAAACAGGGC AGCTAAAGGG
GAAGCTGTTA AAGATAATGA AGTAGTAGAC AATAAAGGTA GTAAGGGGAA TGGTGAAAAT
CCTAATGGGG GGGGGGCTGT AGATAACGGC AGTGCTGGCA AAGATGCTGG CAAAGATGCT
GGCGATGCTG CCGGGAATGC TGCAAAATCT GGCAAAGACG CTAACAACGC TGACAACAAA
TCTGCCAAAA AAGATGGAAA AGCTGGCAAA GATTGCAGCG CGGGTGGTGT TGAGAAATTA
AGCCTAGATG CAGTTAAAGC ACTTCCTGCT TGCGTGGCTA GTGTCATGGC AACTAATGCA
GATGGAACAA TCAGTACTCT TTCAATCGAA CCTCCATCCG GAGAAAACAA GTGTGCTGTT
AGTGCTGATC AACGCAGTGA TGAGTTTGGT AAAATTATAA ATGAAATAAA AAAACTTGCT
CCTACAGATA GCTCGTTTGG TTTTTACGGT GGCAAAGTGT ATGCGTACGG GCGTTTGTCA
AAATTGGACG GGAAGAATTA TGTAGGTCTT TTTTCAGGGA AAAATATTAG TAACATTGAT
AACATCGAAA AGTTTGATAT AAGTTATGTT ACAGACATGT CCCACCTTTT TAAGGATTCC
TCATTAACTG ATTTTAGTTT TTTGAGTAGC TGGGATGTGA GCAATGTTAC GAACATGCAA
AGCATGTTCG AGGGGTGCAC TGGTTTAGAA GATATTAGCG GTTTGGCTAA TTGGAAGGTT
GACAATGTTA CGAATATGGA GGGCATGTTC GAGGGGTGCA CTAGTCTGAA ATCTCTTAAA
GGGTTGGAAG ATTGGGATGT AAGCAATGTT AGAAGTATGT CTGGAATGTT CGCGTCTGTT
ATTAAGGATC AAGATCAACA TGATGCTTTG AATCCAGTAG ACGGGTATGC AGGTGAGATG
GCGATTGATT CGGTTAAACC TTTACGTAAG TGGAATGTTA GCAATGTTAT GAACATGAAC
AGAATGTTTG AGGGTTGCAC TAGCATAACA GATTTTACTG GTTTGGAAGG TTGGGATACT
AAAAGCGTTG TGGCAATGAT CGGTATGTTC GAATACTGCA AGAGTATGAG TTCGCTTGGA
TTTTTGAAAA AGTGGACTGT AAAGAACGTT GAATATATGA TGGCAATGTT TGCGCTGTGC
GACAAGATTA AAAATACTGA GGGCCTAGAA AATTGGAATG TTAGTAACGT TAAAAAAATG
GATGACATGT TTGCAGAATG CTCTAGCCTC GATAGTATCA GTGGATTATC TAATTGGAAT
ACAAGCGGTA AGAGCAGTAC AAGCAAATTA ACTAGCACGT ATAGAATGTT CGATAACTGC
AGTTTTCTTT CTGATCTTCA ACCTTTGAGC GGTTGGAATG TTGGCAGTGT TACTGACATG
CATGACATGT TTAACAATTG CGGCAGTCTT ACTGGTCTTG AACCTTTGAG TGGTTGGGAT
GTTGGCAGTG TTAAGAACAT GAACAGCATG TTCAGCAGTT GTAATGGTTT AACTAGTCTT
GAGTCTTTGA GTAAGTGGTT GAATGATAAG AGCAGTGTCA CGGACATGAG TAGCATGTTC
AGCGGTTGTA ATAGCTTGAG TGACCTTAAA GGTCTAGAAA ATTGGAATGT CGGCAATGTT
AAGAACATGA GCAGCATGTT CAGCGGTTGC GCTACGGATA TCTACGGTTC TGGAGATGAT
CCTAATCCTA TTGGTATAAA AGGATTGGCT GATATAAGTG CATTAAGTGG TTGGGATGTT
GGCAGTGTCA CGGACATGAG TAGCATGTTC AGCGGTTGTA ATAGCTTGAG TGACCTTAAA
GGTCTAGAAA ATTGGAATGT TGGCAGTGTT AAGAACATGA GCAGCATGTT CAGCCATGAG
GTTAGTGGTT CTGATTTTGA TAAGCTGACT GATATAAGTG GATTGAGTAA GTGGAATGTT
AGCAATGTTA AGGACATGTC TAGCATGTTC AATAGTTGTG TAACGTTGGA GAGTATTAAT
CCTTTGAGCG GCTGGAATGT TAGCAATGTT AAGGACATGT CTAGCATGTT TGCATATTGC
AAAAAACTTA CCGGCGCTGC AGATTTAAGC AAGTGGAATA TTTCAAAAGT TACGAATCTT
AGTTCAATGT TTTCCAATGC AGGAGCAGAA AGCGGCGACA GTCTTATATT AGATTTCAGC
GATAAGGCTT TTACAAAATC GGATACACCT GTATACGCGG ATGACGAAGG GAAGCAGCGT
TACTTTTCTG TTGATAGTAT GTTCAGTGGT TTTAAAGGCA CGCTGATTGC GAATAACTTG
AAGTCTAGTG GTTTTGAGAA TAATCCTGAG GATGATCCTA TCGCAAATTT TGCTAGTGGT
TACATATTCT CCCTCGATGA GAATGGTCAC AGCAATAATA TAGTCGTAAC TAATAATGTT
ACGATTTTAG ATACAATTTC AAACGATGAT GATCTTAAAA AGATTGCATA CTACATTCCT
GTTAAAGCCA AGCTGATGGA GCCTGGAGAT ATAAAAAATG ACAATATGAC ATGTGATTGC
AATAGAGATT CGGGTGGCAC TACTTATACT TACTATGTTC CTGCACTGTA TAATTCGAGT
AATTTAGAGA AGAAACAGGA TGAAAAATCG CAGGATTATG CGTACAGAAT TGTAAAAAAT
AGTTTCGCTT CAAAGTTGCG GTACGCTATA GATTCTCAAT CGGGCAAAGG TACTAGTACT
GAGGAACCAG GTGGAGATGA ACCTGGCGGA GATGAACCTG GTGAAGGTGA AGGTGGCGAT
GAATCAGGTG GTGATACTGA TACTGGCTCG CGTAGTGCAA CGTTGAAATC GCGTGTAAAG
AAACTGCGAT CCGCAAGGAA ATCGCGATCT GTAAGAGATG GCGATGATGA AGATGAAAAT
CCTGAACCTG AGCCTGACGA AAAACAGGGT AATACTTATA AAATCATGTT TAAGAAGTCA
AGCGAAGGTG GTTGCACTTG GACAGAACTT GGCGATGGTG ATGGTGCTGT GCCTGTTAGC
GATCCAAAGT CGCCTTTAGT GTTCTTCACT GCAACGTACT ATTTGGAGAC TTTTGTTAAG
CCAGTGCCTT TGCCGCATAC TGGCGGTCAG TCGGCTGTTA TGTTCACGTT CTTGTCGATT
GGACTGTTCT CAATGTTCGC TGTTGCTGGG GCATTTGGTC GTCATGGATG GGTTGCGTCT
GTGCTTCCAA GCTCTGCTTT AGCTGGGTTT TCGAAGGCTT TTGGTGGCGG CGCGCTCGAT
ATTGCTTCTG GTGCTAAGCA CTGCTTCGCG CATCCTGTTG TAAGTGGCTC GATGATTAAA
TCTGCTGTAA GTGCGGCAAC CGGCTTCTTT GCTCGGTTCC GAAGTGACGA TTCCACCTTC
GGCAAGCACA ACAAACAGCG CTAA
 
Protein sequence
MMSSRKHRIL STIGAVSCVV SMILSVTVLP AIAYANVTKN PLFVAAMAAA AENNRGQNKS 
GEGGSKKGVQ GEAGKGTGKE VKGDKAKGVD KKEGNRAAKG EAVKDNEVVD NKGSKGNGEN
PNGGGAVDNG SAGKDAGKDA GDAAGNAAKS GKDANNADNK SAKKDGKAGK DCSAGGVEKL
SLDAVKALPA CVASVMATNA DGTISTLSIE PPSGENKCAV SADQRSDEFG KIINEIKKLA
PTDSSFGFYG GKVYAYGRLS KLDGKNYVGL FSGKNISNID NIEKFDISYV TDMSHLFKDS
SLTDFSFLSS WDVSNVTNMQ SMFEGCTGLE DISGLANWKV DNVTNMEGMF EGCTSLKSLK
GLEDWDVSNV RSMSGMFASV IKDQDQHDAL NPVDGYAGEM AIDSVKPLRK WNVSNVMNMN
RMFEGCTSIT DFTGLEGWDT KSVVAMIGMF EYCKSMSSLG FLKKWTVKNV EYMMAMFALC
DKIKNTEGLE NWNVSNVKKM DDMFAECSSL DSISGLSNWN TSGKSSTSKL TSTYRMFDNC
SFLSDLQPLS GWNVGSVTDM HDMFNNCGSL TGLEPLSGWD VGSVKNMNSM FSSCNGLTSL
ESLSKWLNDK SSVTDMSSMF SGCNSLSDLK GLENWNVGNV KNMSSMFSGC ATDIYGSGDD
PNPIGIKGLA DISALSGWDV GSVTDMSSMF SGCNSLSDLK GLENWNVGSV KNMSSMFSHE
VSGSDFDKLT DISGLSKWNV SNVKDMSSMF NSCVTLESIN PLSGWNVSNV KDMSSMFAYC
KKLTGAADLS KWNISKVTNL SSMFSNAGAE SGDSLILDFS DKAFTKSDTP VYADDEGKQR
YFSVDSMFSG FKGTLIANNL KSSGFENNPE DDPIANFASG YIFSLDENGH SNNIVVTNNV
TILDTISNDD DLKKIAYYIP VKAKLMEPGD IKNDNMTCDC NRDSGGTTYT YYVPALYNSS
NLEKKQDEKS QDYAYRIVKN SFASKLRYAI DSQSGKGTST EEPGGDEPGG DEPGEGEGGD
ESGGDTDTGS RSATLKSRVK KLRSARKSRS VRDGDDEDEN PEPEPDEKQG NTYKIMFKKS
SEGGCTWTEL GDGDGAVPVS DPKSPLVFFT ATYYLETFVK PVPLPHTGGQ SAVMFTFLSI
GLFSMFAVAG AFGRHGWVAS VLPSSALAGF SKAFGGGALD IASGAKHCFA HPVVSGSMIK
SAVSAATGFF ARFRSDDSTF GKHNKQR