Gene HMPREF0424_1224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_1224 
Symbol 
ID8709213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp1454110 
End bp1457391 
Gene Length3282 bp 
Protein Length1093 aa 
Translation table11 
GC content46% 
IMG OID646483312 
ProductS1 RNA binding domain protein 
Protein accessionYP_003374417 
Protein GI283783663 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1530] Ribonucleases G and E 
TIGRFAM ID[TIGR00757] ribonuclease, Rne/Rng family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000769273 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCCCAGAA TAATAAGAGA ACGCTTCAAT GAATCTGAAG CTGAAAATAG TAGAACTGTG 
TCTAAAACCG CTACTTCTGC GGAAATGTCC GGAATAATGC AATCTACAGA TTTTGCGGAA
AGTAATATAG CTGCCGGCGA TACTTTGCAG CAAAGCTCTG GCGTTACGCG TCGCCGTCGC
GGTCGACGAG TTACTCGTGG TGCCGGTGCG GCTGGAGAAG CAATGGAACT AAAAGTTGAG
GATTCTAGTA CGAAGGATTC TGAGCCGCAT GTTGTTAAAT CGCGCCCGAT AACTTCGCTC
CTATTCCAGG AGCCAGTTCT TCCTAAGGTA GTGAACAATC GTAGTGAAGA AAACAGCAGT
TCGCGTGATG ACTTGCGTGA CAACTCGCGT GATGTCTTGA ATGATGATTT GCGTGATGAT
TTACGTGATA TTTCTCGCGG AAGGCGCGGT GGACGCAACA GCCGCGATAG CCGTGATAGT
CGCGATAGTC ATGATAGCCG TGATAGTAGC GAACGCAAGT CACGTTCTGA AAGACGTGAG
AATTCAAGCG AAATAAAGCG TGATAGTGGC AGAGGGAATA GTCGTGAAAC ATCGCGTGAA
ACAAGTCATT CTATGTATTC TCAAGATTCT GGGTACGCTA AGAATTTTCA GGATTCGCAA
GATGATCGCG ATGAAAAAAC TCGCAAAAGC CAGCGTTCTC GCCGTTCTCG CAGATTGAAT
GCTGTTGAGC GACGTGCTGC AGCTGAAGTT GAGCAAATTG AAGAGGATCT TGCATACGAC
AATATTGTTT ATGCTCCCAT TGATACGCGC ATCGATGAAA CTCTTGATAT TAGCGATATT
TTATCTGCGG AAAATGCTTC AAATAGGCGT TCTAATCGTC GCAATATGTC GAATAACATC
TCTAATGATG ATGACGCTCG AAATGAAAAT GGTGCAGTAG GCAGCAGAGA TGGTAGCAAT
ATCAGAGATA ACAGAGATAG CCGCAATAGC AGTGACGAAG GCGATAACAG AGGTATAAAC
GACGATTATA ATGCGAATAA TGGCAACGAT GACGATGAAA TAATGATTAC GCGTCGCCGT
CGTCGCCGTC GTAGCTCTCG CGCTGATCGC AACGAAAGTC GCAACGAAAG CCGCGATTTT
GACATCGATA TTTCTAATGA TATTGACGAT GACGCGAATT CTCGCACAAA ACGCGCAAGA
ATAGATGAAG TTTCTCAAGA AGATCGCGAT TTAATTCCAG ATCCAGAAGA ACTTCGTGAG
CAAGATGAGG AAGAAACTCT TTATACGCGT CGCCGTCGTC GCCGCCGTTC TAAATCCTCC
GATGACGCAA ATTCCGCGTC TTCTCAGGAA GAGTCATCCG ACTACACTAA TGCGAACGAG
GATGACTTAA CTTCTCGCCG CTCGCGCAAG CAGCAGTATA TAGACGAAAT TACGGATATT
GAAGGCTCTA CGCGACTTGA GGCAAAGCGC CAACGACGTA GAGATAATAG ACGTGAGCGT
AGCCGTCAGA ACCTTCTTAT GGAGCAAGAT TTCCTTGCTC GTCGCGAAAA TGTTGAACGA
TTAATGGTTG TTCGTGAGCG CGATCGCCAC ACGCAAATTT CCGTTATTGA AGACAATATT
CTTGTAGAAC ATTATGTTTC TGATATTCAA GAAGTTGCTA CTGTTGGAAA TATTTATCTT
GGGCGCATTC AAAACGTGCT TCCAAGCATG GAAGCGGCAT TCGTAGATAT TGGTCAAGCC
AGGAATGGCG TGCTTTATGC AGGCGAAGTT AATTGGGATA TGGCTCGTCT TGAAGGTCAG
CCTCGTAGGA TTGAGCTTGC GTTTAAGTCT GGTGATCCTG TGCTTGTGCA GGTTACTAAG
GATCCAATTG GGCATAAGGG TGCTCGATTG ACTTCGCAAG TAACTCTTGC AGGGCGTTTC
TTAGTGCTTG TGCCATCTGG CGGCATGACC GGTGTGAGTC GCAAGCTTTC GGATCGAGAG
CGCTCGCGTT TGCGTGCTAT TGTGTCGAAG ATTGCACCAA AAGATATGGG CGTTATTATT
CGTACTGCTG CTGACGGCGC AAGCGAGGAG TCGATTGAAA AAGATTTAGA TTCGCTTGTT
AAGCAGTGGG AGCGCATTGA AGCTAAGCGC GAAGAATACT TGCATGGAAA GCGCCCTAAG
CTGTTACAGG GTGAGCCTGA TGTGGCAATT CGTGTTGTGC GCGATATTTT CAACGATGAC
TTCCAAAAGC TTATTGTTGA AGGCGAGGGA GTGCGTCAGC GAATTGATGA ATATCTTGAG
ATGATGGCTC CAGATTTGCG CAGCAAGGTT GAATATTGGG ATCCTGCTGA GCATGAAGGC
AAGGACGTTT TCGATAAGTG GCAGATTGAT AGTCAACTTC GTAAGGGTAT GGAGCGTCAG
GTTTATTTGC CGGCTGGTGG CTCTATCGTG ATTGACCGTA CGGAAGCAAT GACTACGATC
GATGTGAATA CTGGACGATT TATTGGGCGT GGTAAGTCTT TGGAAGAAAC GGTTACTCGT
TGTAATTTGG AGGCTTCTGA AGAGATTGCG CGCCAGCTTC GTTTGCGCGA TATTGGTGGC
ATGGTCATGA TCGATTATGT CGATATGGTT ATGCCAGCGA ACCGCGATTT AGTTTTGCGT
AGGCTTTTGG AATGCTTGGC TAGGGATCGC ACGAAGCATC AAGTTGCTGA AGTTACGTCT
CTTGGTCTTG TGCAAATGAC GCGTAAACGC GTTGGTCAAG GCTTAGTTGA AGCGTTCTCT
GAAGAGTGTC CAACGTGCAA GGGACGTGGG TTTATTTTGC ACGATGAGCC TACTGTTTCT
TCGGAATATG CAGATCCTTA CGCGCTTAAG GGCGGAGATC CTTTTGTTAA AACAAACAAG
TATGGTCACG GAAGCGCAAC GGAACAGCCT GCGCCTGTTA AGCAGCAGGG TTCTAGTGCG
GAAGTGAGAG CTAAGTTAGC GCGTATAGCT GCCGCGTCTG TAATTACTGA CGCAGCTCAT
AAATCTGAAG AAGAAACGCA GTATAGCAAT TTTTTGAATT CTGATGAAGA GTCTGTATCT
TCTAAGGCGG ATGAAACTAG CGAAGTTGTG GATGCTGGCG AAACCGCGGA AACTAGTGAA
GTGGATAAAA CTTCTGAAGA ATTAGTTGAA AATAATTTAG AGGATACTAC GCAAGAGATT
GCTGAGTATA CTGCGGAGCA ACCTACGCAA GAAACTATGC AGGAAACTGC GCAAGATAAT
ACGCAGGAAA TTATGCAAGA AACTACAGAA AGTGATGATT GA
 
Protein sequence
MPRIIRERFN ESEAENSRTV SKTATSAEMS GIMQSTDFAE SNIAAGDTLQ QSSGVTRRRR 
GRRVTRGAGA AGEAMELKVE DSSTKDSEPH VVKSRPITSL LFQEPVLPKV VNNRSEENSS
SRDDLRDNSR DVLNDDLRDD LRDISRGRRG GRNSRDSRDS RDSHDSRDSS ERKSRSERRE
NSSEIKRDSG RGNSRETSRE TSHSMYSQDS GYAKNFQDSQ DDRDEKTRKS QRSRRSRRLN
AVERRAAAEV EQIEEDLAYD NIVYAPIDTR IDETLDISDI LSAENASNRR SNRRNMSNNI
SNDDDARNEN GAVGSRDGSN IRDNRDSRNS SDEGDNRGIN DDYNANNGND DDEIMITRRR
RRRRSSRADR NESRNESRDF DIDISNDIDD DANSRTKRAR IDEVSQEDRD LIPDPEELRE
QDEEETLYTR RRRRRRSKSS DDANSASSQE ESSDYTNANE DDLTSRRSRK QQYIDEITDI
EGSTRLEAKR QRRRDNRRER SRQNLLMEQD FLARRENVER LMVVRERDRH TQISVIEDNI
LVEHYVSDIQ EVATVGNIYL GRIQNVLPSM EAAFVDIGQA RNGVLYAGEV NWDMARLEGQ
PRRIELAFKS GDPVLVQVTK DPIGHKGARL TSQVTLAGRF LVLVPSGGMT GVSRKLSDRE
RSRLRAIVSK IAPKDMGVII RTAADGASEE SIEKDLDSLV KQWERIEAKR EEYLHGKRPK
LLQGEPDVAI RVVRDIFNDD FQKLIVEGEG VRQRIDEYLE MMAPDLRSKV EYWDPAEHEG
KDVFDKWQID SQLRKGMERQ VYLPAGGSIV IDRTEAMTTI DVNTGRFIGR GKSLEETVTR
CNLEASEEIA RQLRLRDIGG MVMIDYVDMV MPANRDLVLR RLLECLARDR TKHQVAEVTS
LGLVQMTRKR VGQGLVEAFS EECPTCKGRG FILHDEPTVS SEYADPYALK GGDPFVKTNK
YGHGSATEQP APVKQQGSSA EVRAKLARIA AASVITDAAH KSEEETQYSN FLNSDEESVS
SKADETSEVV DAGETAETSE VDKTSEELVE NNLEDTTQEI AEYTAEQPTQ ETMQETAQDN
TQEIMQETTE SDD