Gene HMPREF0424_0005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_0005 
Symbol 
ID8709593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp5200 
End bp7962 
Gene Length2763 bp 
Protein Length920 aa 
Translation table11 
GC content41% 
IMG OID646482128 
ProductLPXTG-motif cell wall anchor domain protein 
Protein accessionYP_003373279 
Protein GI283782525 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0810] Periplasmic protein TonB, links inner and outer membranes 
TIGRFAM ID[TIGR02331] Rib/alpha/Esp surface antigen repeat 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000417696 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGAAAAG TAAGTACCGT TCACACAATT AGTGGAATTG TATTAACTTT TGGTTTGATC 
ATTTCATCGC TGGTTTACAT ACCTTCGCTA GCAACAACCG CAGCGGCAGA AGAAACATCA
CCAGTGCACG TACCTCTTAC CGCACAAACC GCTCACGACA TTTCTACTGG TCGAATTCGT
TTGGAAAATA ATCAGATTGA TTCTTTGCTT TACCATAAAG CCGCGCTAGA TCCAGATCAA
GATAGCGACG GCGACGGTCT GAAGAACAGC GAAGAGCTTT ACACTTACGT AAAAAACGGA
CGCACATATT ACGGATATAA TTCGCATCCA CTTATTAAGG ACACCGACGG CGACGGATTA
GATGACAAAA ACGATAGCAA TCCTCGCAGG TGGGATATTA GCCCTCGCGA TATGGCGCTT
TTTATGGAGC TTGTGTATAG GGAAGACAGC TACGTCAAAG AAGTACTTGA CTACTCAAAA
CCGCTCAGGA TTATTTATGA AAATCGCAAT GAGTACCGCC TTATGCACAA CGAGCTTGCA
CCATTCTGGA AAGTTAAGGA AACCTACCAC TTCGGGAGTG GGTTTGACGC TATCCTTTTT
GAAAATGTGA ATCCTGCGTA CCCATTTATT AAGCAAAACG GCGTGCAAGT ATTAGCAATT
CGCGGCACTA AAGGCGCAAC AGATGTATGG AATGATGTCT TGCTTGGAAC TGGCTCAAAC
CCAGGCCAAG CTGCTGATAT AGATAAATTA ATTGATGAAT ATAAACAAAA GAACACTGTA
AGCAATTTAT ATGTTACAGG TCACTCTTTA GGTGGGTACT TAGCACAAAG AGCCTTAATT
CGAGCTAAAA GTAGTGGCGC AACTTTTAAT ATTAAAGCTT ATACTTTTAA CGCTCCGAGA
ATCAAAGGTA ATATATTCAA CAGATGGCTT TGGGAAACAT CTGATTTTGG CGATAAACTA
ACTAAAGAAG GTTATGCCGT TCATTATAAA GTTGATAACG ATACAGTAAT AGGCGGAATA
GGAAATTTCG AAGGTGCAAT AAGCGTAGGA AGTTCTGGAG CCGGTCATGG ATCCAGAAGC
TACTTTGAGC CTAGAATGAA TGCTTTCCAA GGATTCACAG TTGGCGAACG CAATAAAATT
GAGGGAACCG GTCGCAAAAT TGATGCACTC GACGGATTAG TTAACGAACC AGTTATTAAA
ACTGACGAAC AAGCTTTCGA ACCTAAAATT ACGAATATTG ACATTATGGA AAATGATCCA
TTGCCAAAAG GCACAACAAT AAAGCATCTT CTTAATCCTA ACGATATACC GCAAGGCTCT
ACCATTACAG ACATTACAGA ATACGACAAA ATTGACACTT CTCAATGCGG CAACTACGAA
GGGAAAATAC TACTCGTACT ATCTGGTAGA ATATTAAAAA CTATTATAGT TCCAATAACC
ATTCATAAAC GATTGGCTAA TATCGAGGAA CTTCCAAATG TTACTGCTGA ACCTGGCAAG
AAAATAGAAG TCAAAAAAGA CAATAAAATC CCCGATAATT TCGACTTCAA ACCATACTTA
AAAAACTTAC CAGAAAACAG CACAGTACAA GTTGTAAAAC AACCAAACAC AACAAACACT
GGAGAAACAA CATTTATCAT AGAAATAACC TTTGCAAACA AAGCAAAACA GAAAGTAACT
CTTAAAGCTC ACGTTGTAGA AATCATTCCA GCAACACCAC TAAACCCAGC TAAAAAGAGT
ATTACGTGGG CCGAGTTAAT TCCTGCAAAG AAATCAACAC CATGGATCGA GCTAACACCA
GCGACACCAG CTCCAACACC AACTCCCGTA CCTCCGACAC CTGTTCCACC GACACCGGAA
CCAACGCCTT CGCCTTCGCC TACGCCGACA CCGACGCCTG ATCAACAACC TGCAGTGCCT
CTGCAGCCGA AACCAAACCC GGTGAATCCA ACACCCGTTC CGCCAAAACC TGCGCCTAAA
CCAGAGTCAC AACCTGTTCC ACCAACACCA GAATCTCATT CTGACAACAG TGAACACTCT
AGCAATGGCG AACCTGATAA GTCTGCAAAT AGCACAACCC CACAGACTAA TAAGTTCGAT
CAAGACCATA AGGAAATTCC AAGCATGGGA TTTCCCGAAC CAACACATCC TTCCAAAACT
CTACAAACTC CTGCAGTTAG TAACGATTCT ATGCGCGAAA ACCACATAGA GCTTAACGCA
AACTCAGAAT CACACACATT ACAGCAACAG ATTCCATCAG CACCAAAGAG TATTGCAGAT
TTACTTCCAG CGTTGTGCAA CACTCTCACC GTCGGGAATA ATAACATTGC TTATGCGGGA
AAAAATAACC ATATCACTGT TACAGTAAAT GCAACATCAG AATTCATGCG CAGATTACAA
CTAGAAAAAG CAGTAAAAGC TTATGCCTAC ATATATTCCA ACCCTAAACT TCTTTATAGC
GCAAATGGAA TGAAATATGT AACAGTTCAC ATTAATGAAT ATGGCAGAAT TGTATTCGAC
GTAGTTATCC CAGAAGAATA TAAAGGCAAT CATACGATTG TCCTTATTGA CGAAAACGGT
AAACAAGTTG CGTGGACGAA TACTTTAGTA AAGAAAGACC CTCATTTACA CAAACAATCC
ACAATAGCCA ATAATGCACT CCCTCGTACT GGAATACAAA TAATTCCAAT TGTACTTAGC
GTTGCATTAA TGCTGTTTGT TAGCACAATT GTGACAATTG CAAGAAAAAG AAAGTTTAAC
TAA
 
Protein sequence
MRKVSTVHTI SGIVLTFGLI ISSLVYIPSL ATTAAAEETS PVHVPLTAQT AHDISTGRIR 
LENNQIDSLL YHKAALDPDQ DSDGDGLKNS EELYTYVKNG RTYYGYNSHP LIKDTDGDGL
DDKNDSNPRR WDISPRDMAL FMELVYREDS YVKEVLDYSK PLRIIYENRN EYRLMHNELA
PFWKVKETYH FGSGFDAILF ENVNPAYPFI KQNGVQVLAI RGTKGATDVW NDVLLGTGSN
PGQAADIDKL IDEYKQKNTV SNLYVTGHSL GGYLAQRALI RAKSSGATFN IKAYTFNAPR
IKGNIFNRWL WETSDFGDKL TKEGYAVHYK VDNDTVIGGI GNFEGAISVG SSGAGHGSRS
YFEPRMNAFQ GFTVGERNKI EGTGRKIDAL DGLVNEPVIK TDEQAFEPKI TNIDIMENDP
LPKGTTIKHL LNPNDIPQGS TITDITEYDK IDTSQCGNYE GKILLVLSGR ILKTIIVPIT
IHKRLANIEE LPNVTAEPGK KIEVKKDNKI PDNFDFKPYL KNLPENSTVQ VVKQPNTTNT
GETTFIIEIT FANKAKQKVT LKAHVVEIIP ATPLNPAKKS ITWAELIPAK KSTPWIELTP
ATPAPTPTPV PPTPVPPTPE PTPSPSPTPT PTPDQQPAVP LQPKPNPVNP TPVPPKPAPK
PESQPVPPTP ESHSDNSEHS SNGEPDKSAN STTPQTNKFD QDHKEIPSMG FPEPTHPSKT
LQTPAVSNDS MRENHIELNA NSESHTLQQQ IPSAPKSIAD LLPALCNTLT VGNNNIAYAG
KNNHITVTVN ATSEFMRRLQ LEKAVKAYAY IYSNPKLLYS ANGMKYVTVH INEYGRIVFD
VVIPEEYKGN HTIVLIDENG KQVAWTNTLV KKDPHLHKQS TIANNALPRT GIQIIPIVLS
VALMLFVSTI VTIARKRKFN