Gene HMPREF0424_0110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_0110 
Symbolrho 
ID8709503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp130733 
End bp132703 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content45% 
IMG OID646482231 
Producttranscription termination factor Rho 
Protein accessionYP_003373377 
Protein GI283782623 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAACAG GCCAGAATCT TGAAAAAATG AAGCTTTCAG AATTGAAAGA TCTTGCGAAA 
CAAATGGGTT TGCGTGGTAC ATCTACAATG CGTAAGCCTG AGTTGGTGGC TACACTGACG
GCTGCTCGTA ATGGGGGAGA AGCGCCTGCA GGTGTAAGTG TGCGTGTTCC TCGTGATGTT
GTGAATGCTA ATAAAACTGC AGATACTAGT ACGGATATAG AAGATGTTAG GTCATCGAAA
GACAATAGTG ATTTAGAAGC GTTAGAAGTT TTGGTTCCCA ATGCTGCTTC AGATCGTCGA
TATAGAGACG AAGAAGATTT TGGGAATAAA AATTATCGAC GCGATGCGAA TCGTAATAAT
ACATTCCAGC GTCGTCGTTC CTCAGAAGAT CGTAATGATA ATCGCGATCG TAGAGAAGAT
GGGATGGATT CGCATGAGTT GGATCAAATT TTGGCAACAT TACCTGGTGA AGATTCTCAC
GCAGAGGGTG AGCAGCGTCG TCAGCGTGTA GCTTCTCGTG ACTTTGATAG AGAAGAACAG
CAAAATCGTG CTGATCGTTT CCAGCGTCGT ATGCGCGGTC GTGCGCGTGA TTACGATGAG
TCTCGTTCAG ATTATTCAGA TCGACAGAAT CGTTCAGAAC GCATGGATCG AGTAGAACGT
GACGATCGTG ATGAACGTCG TGCTGAGCGT GGAGAACGTC AAGATCGTAA TGATCGCAAT
GTTCGTCTTG ATCGCGATGC GCGTGATGCA CGTGACTCTC GTGAAGAACA TGATTTACGC
GAGATTCGCC GTGAAGAGCC TCAGGAAGAG CTCGTACCAG TAGCCGGAAT TGTTGATGTC
TTAGAATCTT ACGCATTTGT GCGCACTTCT GGTTATTTGC CTGGTCCTAA CGATGTGTAT
GTTTCAATGA GCCAAATTAA AAAATATGGT TTACGCAAGG GTGATGCAGT TCAAGGCTCC
ATTCGTGCTC CTCGCGAGGG TGATCGCAGG AATCAGCGTC AGAAGTTTGT TCCGTTGCAA
ACTGTTAATA GCATTAATGG CATGAGCGTT GAAGAAGCTC AGTCTCGACC GCAATTTGCT
AAGTTGACGC CGTTATATCC GCAGGAGCGT TTAAAGCAAG AAACTACTCC TAATCGCATG
TTGGGACGTG TCATTGATTT AGTTGCCCCA ATTGGTAAAG GTCAGCGCGG TTTGATTGTG
TCGCCACCAA AGGCTGGTAA AACTATTACT CTTCAGAATA TTGCAAATGC AATTACTACA
AATAATCCTG AAGTGCATCT TATGGTTGTG CTTGTCGATG AGCGTCCAGA AGAAGTTACC
GATATGGAAC GCACTGTGCA AGGCGAAGTG ATTTCTTCTA CATTCGATCG TCCTGCAACG
GATCACACTA CTGTTGCTGA GCTTGCTATT GAGCGTGCTA AGCGTTTGGT TGAGCTTGGT
CAGGATGTTG TGGTTTTGCT TGATTCTATG ACTCGTTTAG CTAGAGCCTA CAATATTGCT
GCTCCTACTT CTGGTCGTAT TCTTTCCGGT GGTGTTGATG CTCAGGCGTT GTACCCACCA
AAGAAGTTCT TCGGTGCTGC TCGAAATATT GAAGATGGCG GCTCTTTGAC TATTATTTCT
TCTGCATTGG TTGAAACCGG CTCTAAGATG GATGAAGTAA TCTTCGAAGA GTTTAAGGGC
ACTGGAAATA TGGAATTGCG CTTGAGTAGA GATTTGGCTG ATAAGCGTCT CTTCCCAGCA
GTAGATATTA ATGCCTCTGG AACTCGTCGT GAGGAGTTAA TTACTCCAGC TGCAGATTTG
CCAGTTATTT ATCGTTTGCG CCGTCTCTTT GGTGGTCTTG AGGCAGAGCA AGCGTATCAG
ACTTTGATTC CTCGATTGAA GAAAACTGCT TCAAACCGTG ATTTCTTAGC TGCTATAACT
CAACAAACTG GCACTGCTAC GAATAATTCT TCTTCTACGA CTATTGCGTA G
 
Protein sequence
MATGQNLEKM KLSELKDLAK QMGLRGTSTM RKPELVATLT AARNGGEAPA GVSVRVPRDV 
VNANKTADTS TDIEDVRSSK DNSDLEALEV LVPNAASDRR YRDEEDFGNK NYRRDANRNN
TFQRRRSSED RNDNRDRRED GMDSHELDQI LATLPGEDSH AEGEQRRQRV ASRDFDREEQ
QNRADRFQRR MRGRARDYDE SRSDYSDRQN RSERMDRVER DDRDERRAER GERQDRNDRN
VRLDRDARDA RDSREEHDLR EIRREEPQEE LVPVAGIVDV LESYAFVRTS GYLPGPNDVY
VSMSQIKKYG LRKGDAVQGS IRAPREGDRR NQRQKFVPLQ TVNSINGMSV EEAQSRPQFA
KLTPLYPQER LKQETTPNRM LGRVIDLVAP IGKGQRGLIV SPPKAGKTIT LQNIANAITT
NNPEVHLMVV LVDERPEEVT DMERTVQGEV ISSTFDRPAT DHTTVAELAI ERAKRLVELG
QDVVVLLDSM TRLARAYNIA APTSGRILSG GVDAQALYPP KKFFGAARNI EDGGSLTIIS
SALVETGSKM DEVIFEEFKG TGNMELRLSR DLADKRLFPA VDINASGTRR EELITPAADL
PVIYRLRRLF GGLEAEQAYQ TLIPRLKKTA SNRDFLAAIT QQTGTATNNS SSTTIA