Gene HMPREF0424_0213 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_0213 
SymbolgpsI 
ID8709739 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp236259 
End bp239039 
Gene Length2781 bp 
Protein Length926 aa 
Translation table11 
GC content47% 
IMG OID646482332 
Productguanosine pentaphosphate synthetase I/polyribonucleotide nucleotidyltransferase 
Protein accessionYP_003373474 
Protein GI283782720 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1185] Polyribonucleotide nucleotidyltransferase (polynucleotide phosphorylase) 
TIGRFAM ID[TIGR02696] guanosine pentaphosphate synthetase I/polynucleotide phosphorylase
[TIGR03591] polyribonucleotide nucleotidyltransferase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAGGGTC CCGAAATTAA GGCTGTAGAA GCCGTTATTG ATAATGGTTC ATTTGGTAAG 
CGTACGCTAC GCTTTGAAAC TGGTCGACTT GCTCAGCAAG CAGATGGTGC TGTTGCCGCA
TATTTGGATG ACGATTCCAT GATTTTGTCG ACGACGACTG CAGGATCTAG TCCGAAGGAA
AATTACGACT TCTTCCCATT AACTGTTGAT GTGGAAGAAA AAATGTATGC TGCTGGAAAG
ATTCCAGGTT CGTTCTTCCG CCGTGAAGGT CGTCCTTCTA ACGAAGCAAC GTTGGCATGC
CGTATTATTG ATCGCCCGTT GCGTCCGTTG TTCCCACACA CTTTACGTAA CGAAGTGCAA
GTTGTTGAAA CAATTTTGGC AATTAATCCA GATGATGCTT ACGATGTTAT TGCTTTGAAT
GCGGCGTCTG CATCTACTAT GATTTCTGGT TTGCCATTTG AGGGCCCAGT TTCTGGCGTT
CGTTTGGCTT TGATTGATGG CCAGTGGGTT GCTTTCCCAC GTTGGAGTGA GCGCGAGCGT
GCAGTATTTG AGATTGTTGT GGCTGGTCGT GTGATTGAAA ACGGCGATGT TGCTATTGCA
ATGATTGAGG CTGGTGCTGG TAAGAATGCT TGGAATTTGA TTTACAATGA TGGTCAAACC
AAGCCAGATG AGCAAGTCGT AGCCGGAGGT TTGGAAGCTG CAAAGCCATT TATTAAGGTT
ATTTGCGATG CTCAAAATGA GTTGAAGCGT ATTGCAGCTA AGGAAACTAA GGAATTCCAA
CTCTTCCCAG AATATACTGA CGAACTTTAT GCTCGTATTG ACGAGATTGC TCATGCTGAT
TTGAATGAAG CTCTTTCTAT TGCTGAAAAG CTTCCTCGTC AAGATCGCAT CGCTGAAATT
AAGGAAGGCG TGCGTGCTGC AATTGCTGAA GAATTCACTG ATATGGACGA AGCCGAAAAG
GAAAAGGAAC TCGGCAATGC GTTTAAAGAA TTGCAGCGTC AAATTGTTCG TCGTCGTATT
TTGACTGAAG ATTATCGTAT TGATGGTCGC GGTTTGCGTG ATATTCGTAC GCTTTCTGCA
GAAGTTGATG TTGTGCCTCG CGTACACGGT TCTGCACTCT TCCAGCGTGG TGAAACCCAG
ATTTTGGGTG TGACTACTTT GAATATGCTT AAGATGGAGC AGCAAGTTGA TGCACTTTCT
GGTCCACAAA CCAAGCGTTA TATGCACAAC TACGAAATGC CTCCATACTC CACTGGTGAA
ACTGGTCGCG TTGGTTCTCC AAAGCGTCGT GAAATTGGTC ACGGTGCTTT AGCTGAGAAG
GCTTTGGTTC CAGTTTTGCC AGGTCGCGAA GAGTTCCCAT ACGCTATTCG TCAGGTGTCT
GAAGCTATTG GTTCTAACGG TTCTACTTCT ATGGGTTCCG TTTGTGCTTC TACGCTTTCT
TTGCTTGCTG CTGGCGTGCC TTTGAAGGCT CCAGTTGCTG GTATTGCTAT GGGCTTAGTT
TCTGGTGATG TTGATGGAAA GCATATCTTC AAGACTTTGA CTGATATTCT CGGTGCAGAA
GATGCATTTG GTGATATGGA CTTCAAGGTT GCTGGTACTT CTGAATTCAT CACTGCTTTG
CAGCTCGATA CGAAGCTTGA TGGTATTCCA GCTGATATTT TGGCTGCCGC TTTGCAGCAG
GCACATGAAG CTCGTGCAAC GATTCTTGAA GTTATTAACG AATGCATTGA TTGTCCAGCT
GAGATGAGTC CATTCGCTCC ACGCATTATT ACAACCACAA TTCCTGTAGA CAAGATTGGT
GAGGTAATTG GGCCTAAGGG CAAGATGATT AACCAAATTC AGGAAGAAAC TGGTGCTGAA
ATCGCTATTG AAGATGACGG TACTGTTTAC ATTTCTTCTG AAGGCGGAGA AGCTGCTGAG
AAGGCTAAGG GAATCATCGA TTCTATTGCA AATCCTCGTG TGCCAAAAGC TGGTGAAACT
TTCACAGGCA AGGTTGTTAA GACCACAAGC TTTGGAGCTT TTGTGAATTT GACTCCTGGA
ACTGACGGTT TGCTTCACAT TTCACAGATT CGCAATTTAG CAAATGGTGA GCGCATTGAC
ACTGTTGAAG ATGTGCTGAA GGAAGGCGAT AATGTTGAAG TTATCGTACA AAGCGTAGAT
GAGCGTGGCA AAATTTCTTT GGCTATTCCA GGTTTTGAAG ATCAAGAATC TTCTGCTCCA
GCAGTGCGTG AAAATCGTTC CCGCAGTTCT CGCGACGATC GCGATTCTCG TGACTCTCGT
GGCGATTACC GTCGTTCTCG CCGTGACGAT CGCGATAATC ATGACCGTGC GGATCGTGCA
GATCGCGATG AGCGTCCTCG TCGCCGCATG CGCGATGACC GTGACGATCG AGATCGCATG
GATCGAAATG ACCGATACAT GGATCGCGAT AACCGTTATG ATGACCGCAA CACTCGTCGA
GACGACCGTC GTGCAGATCG TGCAGATCGC GATAACCGTT ACAGCGATAA CGATTTTGAT
GAGCGTCCTC GTCGCCGCGT GCGCGATGAC CGTGACGATC GTTACGAAAA TCGCGATGAC
CGTGACGATC GTGCAGATCG TGACGACCGT GAAGATCGCT CAAATCGTCG AGATTCAGAA
AATCGTCGCG TATCTGATAG AAAGCCGCGT TATGCAGCTG ATGACGATCA CTATGACGAG
TATCGTTCGG CTCGCGAAGA GCGCGCAGAG CGTCCTCGTC GCCGCGTGCG TCGCGATTTT
GATCCATTTG AGGAAGATTA A
 
Protein sequence
MEGPEIKAVE AVIDNGSFGK RTLRFETGRL AQQADGAVAA YLDDDSMILS TTTAGSSPKE 
NYDFFPLTVD VEEKMYAAGK IPGSFFRREG RPSNEATLAC RIIDRPLRPL FPHTLRNEVQ
VVETILAINP DDAYDVIALN AASASTMISG LPFEGPVSGV RLALIDGQWV AFPRWSERER
AVFEIVVAGR VIENGDVAIA MIEAGAGKNA WNLIYNDGQT KPDEQVVAGG LEAAKPFIKV
ICDAQNELKR IAAKETKEFQ LFPEYTDELY ARIDEIAHAD LNEALSIAEK LPRQDRIAEI
KEGVRAAIAE EFTDMDEAEK EKELGNAFKE LQRQIVRRRI LTEDYRIDGR GLRDIRTLSA
EVDVVPRVHG SALFQRGETQ ILGVTTLNML KMEQQVDALS GPQTKRYMHN YEMPPYSTGE
TGRVGSPKRR EIGHGALAEK ALVPVLPGRE EFPYAIRQVS EAIGSNGSTS MGSVCASTLS
LLAAGVPLKA PVAGIAMGLV SGDVDGKHIF KTLTDILGAE DAFGDMDFKV AGTSEFITAL
QLDTKLDGIP ADILAAALQQ AHEARATILE VINECIDCPA EMSPFAPRII TTTIPVDKIG
EVIGPKGKMI NQIQEETGAE IAIEDDGTVY ISSEGGEAAE KAKGIIDSIA NPRVPKAGET
FTGKVVKTTS FGAFVNLTPG TDGLLHISQI RNLANGERID TVEDVLKEGD NVEVIVQSVD
ERGKISLAIP GFEDQESSAP AVRENRSRSS RDDRDSRDSR GDYRRSRRDD RDNHDRADRA
DRDERPRRRM RDDRDDRDRM DRNDRYMDRD NRYDDRNTRR DDRRADRADR DNRYSDNDFD
ERPRRRVRDD RDDRYENRDD RDDRADRDDR EDRSNRRDSE NRRVSDRKPR YAADDDHYDE
YRSAREERAE RPRRRVRRDF DPFEED