Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HMPREF0424_0213 |
Symbol | gpsI |
ID | 8709739 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gardnerella vaginalis 409-05 |
Kingdom | Bacteria |
Replicon accession | NC_013721 |
Strand | + |
Start bp | 236259 |
End bp | 239039 |
Gene Length | 2781 bp |
Protein Length | 926 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 646482332 |
Product | guanosine pentaphosphate synthetase I/polyribonucleotide nucleotidyltransferase |
Protein accession | YP_003373474 |
Protein GI | 283782720 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG1185] Polyribonucleotide nucleotidyltransferase (polynucleotide phosphorylase) |
TIGRFAM ID | [TIGR02696] guanosine pentaphosphate synthetase I/polynucleotide phosphorylase [TIGR03591] polyribonucleotide nucleotidyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAGGGTC CCGAAATTAA GGCTGTAGAA GCCGTTATTG ATAATGGTTC ATTTGGTAAG CGTACGCTAC GCTTTGAAAC TGGTCGACTT GCTCAGCAAG CAGATGGTGC TGTTGCCGCA TATTTGGATG ACGATTCCAT GATTTTGTCG ACGACGACTG CAGGATCTAG TCCGAAGGAA AATTACGACT TCTTCCCATT AACTGTTGAT GTGGAAGAAA AAATGTATGC TGCTGGAAAG ATTCCAGGTT CGTTCTTCCG CCGTGAAGGT CGTCCTTCTA ACGAAGCAAC GTTGGCATGC CGTATTATTG ATCGCCCGTT GCGTCCGTTG TTCCCACACA CTTTACGTAA CGAAGTGCAA GTTGTTGAAA CAATTTTGGC AATTAATCCA GATGATGCTT ACGATGTTAT TGCTTTGAAT GCGGCGTCTG CATCTACTAT GATTTCTGGT TTGCCATTTG AGGGCCCAGT TTCTGGCGTT CGTTTGGCTT TGATTGATGG CCAGTGGGTT GCTTTCCCAC GTTGGAGTGA GCGCGAGCGT GCAGTATTTG AGATTGTTGT GGCTGGTCGT GTGATTGAAA ACGGCGATGT TGCTATTGCA ATGATTGAGG CTGGTGCTGG TAAGAATGCT TGGAATTTGA TTTACAATGA TGGTCAAACC AAGCCAGATG AGCAAGTCGT AGCCGGAGGT TTGGAAGCTG CAAAGCCATT TATTAAGGTT ATTTGCGATG CTCAAAATGA GTTGAAGCGT ATTGCAGCTA AGGAAACTAA GGAATTCCAA CTCTTCCCAG AATATACTGA CGAACTTTAT GCTCGTATTG ACGAGATTGC TCATGCTGAT TTGAATGAAG CTCTTTCTAT TGCTGAAAAG CTTCCTCGTC AAGATCGCAT CGCTGAAATT AAGGAAGGCG TGCGTGCTGC AATTGCTGAA GAATTCACTG ATATGGACGA AGCCGAAAAG GAAAAGGAAC TCGGCAATGC GTTTAAAGAA TTGCAGCGTC AAATTGTTCG TCGTCGTATT TTGACTGAAG ATTATCGTAT TGATGGTCGC GGTTTGCGTG ATATTCGTAC GCTTTCTGCA GAAGTTGATG TTGTGCCTCG CGTACACGGT TCTGCACTCT TCCAGCGTGG TGAAACCCAG ATTTTGGGTG TGACTACTTT GAATATGCTT AAGATGGAGC AGCAAGTTGA TGCACTTTCT GGTCCACAAA CCAAGCGTTA TATGCACAAC TACGAAATGC CTCCATACTC CACTGGTGAA ACTGGTCGCG TTGGTTCTCC AAAGCGTCGT GAAATTGGTC ACGGTGCTTT AGCTGAGAAG GCTTTGGTTC CAGTTTTGCC AGGTCGCGAA GAGTTCCCAT ACGCTATTCG TCAGGTGTCT GAAGCTATTG GTTCTAACGG TTCTACTTCT ATGGGTTCCG TTTGTGCTTC TACGCTTTCT TTGCTTGCTG CTGGCGTGCC TTTGAAGGCT CCAGTTGCTG GTATTGCTAT GGGCTTAGTT TCTGGTGATG TTGATGGAAA GCATATCTTC AAGACTTTGA CTGATATTCT CGGTGCAGAA GATGCATTTG GTGATATGGA CTTCAAGGTT GCTGGTACTT CTGAATTCAT CACTGCTTTG CAGCTCGATA CGAAGCTTGA TGGTATTCCA GCTGATATTT TGGCTGCCGC TTTGCAGCAG GCACATGAAG CTCGTGCAAC GATTCTTGAA GTTATTAACG AATGCATTGA TTGTCCAGCT GAGATGAGTC CATTCGCTCC ACGCATTATT ACAACCACAA TTCCTGTAGA CAAGATTGGT GAGGTAATTG GGCCTAAGGG CAAGATGATT AACCAAATTC AGGAAGAAAC TGGTGCTGAA ATCGCTATTG AAGATGACGG TACTGTTTAC ATTTCTTCTG AAGGCGGAGA AGCTGCTGAG AAGGCTAAGG GAATCATCGA TTCTATTGCA AATCCTCGTG TGCCAAAAGC TGGTGAAACT TTCACAGGCA AGGTTGTTAA GACCACAAGC TTTGGAGCTT TTGTGAATTT GACTCCTGGA ACTGACGGTT TGCTTCACAT TTCACAGATT CGCAATTTAG CAAATGGTGA GCGCATTGAC ACTGTTGAAG ATGTGCTGAA GGAAGGCGAT AATGTTGAAG TTATCGTACA AAGCGTAGAT GAGCGTGGCA AAATTTCTTT GGCTATTCCA GGTTTTGAAG ATCAAGAATC TTCTGCTCCA GCAGTGCGTG AAAATCGTTC CCGCAGTTCT CGCGACGATC GCGATTCTCG TGACTCTCGT GGCGATTACC GTCGTTCTCG CCGTGACGAT CGCGATAATC ATGACCGTGC GGATCGTGCA GATCGCGATG AGCGTCCTCG TCGCCGCATG CGCGATGACC GTGACGATCG AGATCGCATG GATCGAAATG ACCGATACAT GGATCGCGAT AACCGTTATG ATGACCGCAA CACTCGTCGA GACGACCGTC GTGCAGATCG TGCAGATCGC GATAACCGTT ACAGCGATAA CGATTTTGAT GAGCGTCCTC GTCGCCGCGT GCGCGATGAC CGTGACGATC GTTACGAAAA TCGCGATGAC CGTGACGATC GTGCAGATCG TGACGACCGT GAAGATCGCT CAAATCGTCG AGATTCAGAA AATCGTCGCG TATCTGATAG AAAGCCGCGT TATGCAGCTG ATGACGATCA CTATGACGAG TATCGTTCGG CTCGCGAAGA GCGCGCAGAG CGTCCTCGTC GCCGCGTGCG TCGCGATTTT GATCCATTTG AGGAAGATTA A
|
Protein sequence | MEGPEIKAVE AVIDNGSFGK RTLRFETGRL AQQADGAVAA YLDDDSMILS TTTAGSSPKE NYDFFPLTVD VEEKMYAAGK IPGSFFRREG RPSNEATLAC RIIDRPLRPL FPHTLRNEVQ VVETILAINP DDAYDVIALN AASASTMISG LPFEGPVSGV RLALIDGQWV AFPRWSERER AVFEIVVAGR VIENGDVAIA MIEAGAGKNA WNLIYNDGQT KPDEQVVAGG LEAAKPFIKV ICDAQNELKR IAAKETKEFQ LFPEYTDELY ARIDEIAHAD LNEALSIAEK LPRQDRIAEI KEGVRAAIAE EFTDMDEAEK EKELGNAFKE LQRQIVRRRI LTEDYRIDGR GLRDIRTLSA EVDVVPRVHG SALFQRGETQ ILGVTTLNML KMEQQVDALS GPQTKRYMHN YEMPPYSTGE TGRVGSPKRR EIGHGALAEK ALVPVLPGRE EFPYAIRQVS EAIGSNGSTS MGSVCASTLS LLAAGVPLKA PVAGIAMGLV SGDVDGKHIF KTLTDILGAE DAFGDMDFKV AGTSEFITAL QLDTKLDGIP ADILAAALQQ AHEARATILE VINECIDCPA EMSPFAPRII TTTIPVDKIG EVIGPKGKMI NQIQEETGAE IAIEDDGTVY ISSEGGEAAE KAKGIIDSIA NPRVPKAGET FTGKVVKTTS FGAFVNLTPG TDGLLHISQI RNLANGERID TVEDVLKEGD NVEVIVQSVD ERGKISLAIP GFEDQESSAP AVRENRSRSS RDDRDSRDSR GDYRRSRRDD RDNHDRADRA DRDERPRRRM RDDRDDRDRM DRNDRYMDRD NRYDDRNTRR DDRRADRADR DNRYSDNDFD ERPRRRVRDD RDDRYENRDD RDDRADRDDR EDRSNRRDSE NRRVSDRKPR YAADDDHYDE YRSAREERAE RPRRRVRRDF DPFEED
|
| |