Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HMPREF0424_1288 |
Symbol | |
ID | 8708772 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gardnerella vaginalis 409-05 |
Kingdom | Bacteria |
Replicon accession | NC_013721 |
Strand | - |
Start bp | 1536873 |
End bp | 1539620 |
Gene Length | 2748 bp |
Protein Length | 915 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 646483375 |
Product | glycosyl hydrolase, family 31 |
Protein accession | YP_003374476 |
Protein GI | 283783722 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGTAG TAAATATGAG CAAGCAAGAT AAATCACAAA TTATCGGTGA AGCTGTGGCG TATTCGGATT CGCAATCTCA AATTTTTGCT GACGTTGCAA ATTCGCAGTA CGCAAAGGCG CAGGATAGTC AGGTAATTAA GGGCGAAAAG TGGCGAATTA CGGTTATTAC GGATTCTCTT ATTCGTCTTG AATGGAGCGA TGATGGAAAG TTTGTTGACA ATCCTACGCA AACTGTGATT TGTCGCAATT TTGCTCAGAA GAGCGTGGAC GCTCAGCCGC AATTTAGTGC GCGCACAAAC GAAAATGAAT GGCTGGAAGT CGAAACTGAA AAGCTTCACC TTACTTATAA TTGCGAAGCT TTTAGCAAAG AAGGTTTAAG TATTGTTGTG CGCGGAGTGC CGTGCACGCA GTTTAATACG TGGCATTACG GCGACGCGTG CCCGGGAAAT CTTCTTGGAA CTTTTAGAAC TTTGGATCAG GCGAATGGCC CGGTAAAACT TGGTAAAGGA ATACTTTCGC GCGACGGATG GTCTATTTTA GACGATTCTG CAACTTGCGA AATTGCTCCG GCTGAGGAAA TAAATGGCAA ACCAAACCCG TATGGGGCTT GGGTTAAGTC GCGTGCGCAA AAAGTTAGCG AGCAATCAGG CAGATACAAG GACTTGTATT TCTTCGGTTA TGGACACCGC TATATTCAAG CAATTCAAGA CTTTTATAAG CTTACGGGCG CTCAGCCGCT GCTGCCACGT TTTGCGCTAA GCAATTGGTG GAGCCGCTAC TATCGCTACC GTCAAGACGA ATATTTGCAA TTGCATAATC GTTTTAAGCG CGAAGGCATA CCTTTTAGCA CGGCTGTTAT TGACATGGAT TGGCATGTGA CTGATGTTGA TCCGAAGTAT GGATCCGGTT GGACGGGCTA TACGTGGAAC GAGGAACTTT TCCCGGATCA TCGTGCTTTT TTGCGTAAAC TTAACGCTGC TGGATTGTCT CCAACTTTGA ACTTGCATCC GCGCGACGGC GTGCGTGCGT TTGAAAAAGA TTATCCGCAA GTTGCGCAAG ATGCTGGCAT TGATCCGGCA AGCGGCAAAG CTGTGGAATT TGATTTAACG AACCCGCAGT TTGTGAAGGC TTACTTCAAC ATGCATCATC GTATGGAAGA TGAAGGTGTG CGATTTTGGT GGATTGACTG GCAGCAGGGG GGAGTGACGC GCGAGCCTGG CTTGGATCCT CTTTGGATGC TGAACCACAT GCATTATGAG GACGCTGCGC GCGAAGGTCG CTGGCCTATT ACGTTCTCTC GTTATGCTGG CCCTGGTTCG CATCGCTACC CAGTTGGATT CTCGGGAGAT ACCGTTACTA CTTGGGATTC CCTTGCTTTC CAAACGTACT TCACTTCTAC AGCTTCAAAT ATTGGCTACG GATGGTGGAG TCACGATATT GGCGGACACA TGCTTGGCGT TAGAAATAAC GAGTTGGAAG CGCGATGGTA CGCTTTCGGC GCGTTTAGTC CTATTAATCG TTTGCACTCG ACTTGCTCAC CGTTTGCTGG AAAAGAGCCG TGGAATTTCC CTCAGGAAAC TCGCGAAGCA ATGGTGAAAA TGCTTCGATT GCGCGCGGAA CTCTTGCCAT ACGTTTATAC GATGAATTAT CGTGCAGCTT TTGAAGGCAG GCCGATTATT GAGCCAATGT ACTGGCAGTC GCCGGAAGTT GGCATGGCTT ACGAAATTCC TAACGAATAC CGTTTTGGAA GCGAGCTTAT AGTAGCACCG ATTGTTTCGT CTAACGATGC GGCTGCTTTG CGCGGATGTG CTGGAGTTTG GTTGCCGGAA GGCGATTGGT ACGATTTGTT TGACGGACGC AGGTATGTGT CTCGCTGCTG GAATGGCCGC AGATTTGAAG CGTGGCGTTC ACTTGACCGC GTGCCGGCTT TTGCGCGCGC TGGAGCGATT GTACCTCTGC AAGTTTTGCC GGAAGTTGCA GATTGCGAGC GTAGTGCTGA GGCCGCTGAA TCTGTTAATA GCATTGAAAA TCCGCGGGCT TTGCGAGTTC TTGCATTCCC TGGAGCTGAC GGCGAATTTG TGATGCGAGA AGATAACGGC GATTTTGCTG CCGCGTCTGC TGGAAATACT GCTAATACAC GTATGAATTT TGTGTGGCGA GACGGCAATG GTTCTTCGCA ATTTATTATT TCCGGTGTTG CAGGATGTGA TGCTGCCGTA GAATCTGTGC CGCAAAAACG CAACTGGAAC GTTGTATTTA GAGGAGTTGC TTGCGCAGAT TTTGCTCACG TGCGCGTGTT TGTTGGGAGC CGAGAGCTTA ACGCAAGCGA GTTTGCGGTT TCTTACGAAG GCGAAGAGGT GACTTTGAGC TTGTCGGTTT CTGTGAAAGA CGTGCCAGCT CGTTCGGAAG TTCGCGTAAT TGTTGACGGT GGATTGCAAA TCGCTGCCGA CCCTAAAGTT GGCGACGCTT ACCGATTCCT GCTTCAAGCG CAAGTGCCGT ACAGAGGCAA GGAAATGGCT TTTGATGCAG TGCGAGATTC TGGCGGAAGT GCAAGTGCTA TTGCTGAAAT CTCAACTCTT GAATACGAGA ACGAATCGGA AGCAGAAAAG TATCGCAACA GCGTTGATAT GCTTAACGCT TACGCAACCG ACCAGCCTTC CGTAGTTAAG TGGGCGCAAT GGCGTTGCAC ACTGCCTGTT TCGGTAAAGC ACGCGCTTGA GGAGATTTTG CTGCGCTCGG TTGAGTAA
|
Protein sequence | MSVVNMSKQD KSQIIGEAVA YSDSQSQIFA DVANSQYAKA QDSQVIKGEK WRITVITDSL IRLEWSDDGK FVDNPTQTVI CRNFAQKSVD AQPQFSARTN ENEWLEVETE KLHLTYNCEA FSKEGLSIVV RGVPCTQFNT WHYGDACPGN LLGTFRTLDQ ANGPVKLGKG ILSRDGWSIL DDSATCEIAP AEEINGKPNP YGAWVKSRAQ KVSEQSGRYK DLYFFGYGHR YIQAIQDFYK LTGAQPLLPR FALSNWWSRY YRYRQDEYLQ LHNRFKREGI PFSTAVIDMD WHVTDVDPKY GSGWTGYTWN EELFPDHRAF LRKLNAAGLS PTLNLHPRDG VRAFEKDYPQ VAQDAGIDPA SGKAVEFDLT NPQFVKAYFN MHHRMEDEGV RFWWIDWQQG GVTREPGLDP LWMLNHMHYE DAAREGRWPI TFSRYAGPGS HRYPVGFSGD TVTTWDSLAF QTYFTSTASN IGYGWWSHDI GGHMLGVRNN ELEARWYAFG AFSPINRLHS TCSPFAGKEP WNFPQETREA MVKMLRLRAE LLPYVYTMNY RAAFEGRPII EPMYWQSPEV GMAYEIPNEY RFGSELIVAP IVSSNDAAAL RGCAGVWLPE GDWYDLFDGR RYVSRCWNGR RFEAWRSLDR VPAFARAGAI VPLQVLPEVA DCERSAEAAE SVNSIENPRA LRVLAFPGAD GEFVMREDNG DFAAASAGNT ANTRMNFVWR DGNGSSQFII SGVAGCDAAV ESVPQKRNWN VVFRGVACAD FAHVRVFVGS RELNASEFAV SYEGEEVTLS LSVSVKDVPA RSEVRVIVDG GLQIAADPKV GDAYRFLLQA QVPYRGKEMA FDAVRDSGGS ASAIAEISTL EYENESEAEK YRNSVDMLNA YATDQPSVVK WAQWRCTLPV SVKHALEEIL LRSVE
|
| |