Gene HMPREF0424_0871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_0871 
Symbol 
ID8709435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp984221 
End bp986119 
Gene Length1899 bp 
Protein Length632 aa 
Translation table11 
GC content38% 
IMG OID646482971 
Producthypothetical protein 
Protein accessionYP_003374088 
Protein GI283783334 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCAGG CGGTCGCAGA TGGACTAACT GTTTCTATTA AGTATCATCC TCGTATTGCC 
AAGGTGTTAC TTGATAAGAC CAAAGCTAAA GAAATTGAAA ATTACTATAA AAAGTGTGCC
GATGATGGTG CAACGTATGA TGATATTGAA GCCAGTAAAC GTGCAATGAG TTCTATGGAA
GTTATCCTTG GCGAGCCTTC TCGCTTGGAG AGGCTTGCTA TAGATATTCA TGACCATTAT
ATTTCATCTT GTGATAGTGA CCCAGATAGA ATTCAGAAAG CAATGATTGT CTGCTCTAGC
AGAAAAATTG CATATTCACT TCTTTTGAAA TTCAAGGATA AATATCCGGA ATGGTTTGAA
GAAAAGAAAA CTCCTGACGG AGTTACTGCA ACAGATGAAG AACTTAAAGA ATTAAAGCCT
ATGCCATTTA TGGCTATGGT ATCGAGTGTT GGAAGTAATG ATGAAGCTGA GATGTATAAC
TATCTTGGTG GAGTTAAAAA TGATAAACGT TGCGAAGAAC TGGATGCAGC CTTTAAACAG
GAAAAGTCTA ATTTCCATAT TGTTATCGTA GTTGATATGT GGATTACGGG ATTTGATGTT
CCGTCACTTA CATATTTATA TAACGATAAG CCTTTGAAGA AGCACTTGCT GATTCAAACT
ATCAGTCGTG TAAACAGAAA ATATCCCGGC AAAGAATATG GTATGGTTAT CGACTATATT
GGCATTCGCG ATAATATGCG TGAGGCCATG AAGGTTTACG GTGGTGATAC ATCAGTTGCT
CCCACATCGG ATGACGTTGA GCAGGCTACC TCTGCATTCA GGGAGGAGAT TGAAATACTT
AAAGCACTGT TTACAGATTA TGACTTAGCT CCATTCCTTA ATACGAAATG TGACCCTGTT
GAAAGATATA AATTACTTTC CAAAGCGGCT GAGTATGTCT TTACATCAGC ACAGGTCTTG
CAGACAGAAG GAAACGGAAA AACTAATCAG GTCTCATTTA AGACATATTT CTTAAAGTCT
GTTAAGAGAA TGAGAAGTGC TTTTGATATT TGCCAACCTT CTGGAAATCT GGGCGAAGAA
GAATCTGCGC TTGCACAGTG CTTTATGGCT ATAGCAGGCT TCGTTCGTAA AATGAGTGGA
ACCAGCGATG TTGATGCTGA AACTATGAAT CTTGTGGTAT CAAAGATGGT TGAAGAGGCA
TTAAAATACA ATAAGGTAGA AAGTGTTCTT GAAAGTGGCG AGGAAGAAGA TATCTTCTCT
CCTGAATACT TTGAAAAATT ATCAGATGTA GAAATGCCTG CTACAAAGTT AGAATTACTT
ATCAAAATGC TCCGTAAGCA AATAAAGGAA TATGGCAGGG TTAACCAATT AGCAGCGAAA
TCTTTCCAAG AAATGATTGA AAAGACTATT GATGAGTACC ATGAAAGACG TAAGCACCTT
ACTGCTGAAG AAGCCGGGGC AACACAGGAA GCTGCCTCAG AAGATATTAT TAAGGCTGCA
ACTGAACAAG CTTTAGTGAT ACTTCGTCAG ATGAACGAAA ATCGTGAGAG TTTCCGTAAA
ATAGGATTAA CCTTTGAAGA AAAGGCATTC TATGATATTT TGATTGCTCT TCGTGACCAG
TATAAGTTTG AATATGGCAT TGACAAAGAA GTTAACGGTG TTGTGATAAA CGATAAATGC
AAGACACTTG CTAAAAAAGT TAAGGAAATA ATAGATACCG AGTCTTCATT TGTGGACTGG
CTCAACAATC AGAATGTGCG TAATCAGTTA AAATTGAAGA TAAAGATTTG TTTGGTTAAG
AACGGCTATC CGCCACAGTA TAGTCCTGAA GTGTTCACAA AGGTGATGGA GCAGGTAGAA
AATTTTGAGG AGCATTCTGA AACAACTTCA GAAAGTTAG
 
Protein sequence
MDQAVADGLT VSIKYHPRIA KVLLDKTKAK EIENYYKKCA DDGATYDDIE ASKRAMSSME 
VILGEPSRLE RLAIDIHDHY ISSCDSDPDR IQKAMIVCSS RKIAYSLLLK FKDKYPEWFE
EKKTPDGVTA TDEELKELKP MPFMAMVSSV GSNDEAEMYN YLGGVKNDKR CEELDAAFKQ
EKSNFHIVIV VDMWITGFDV PSLTYLYNDK PLKKHLLIQT ISRVNRKYPG KEYGMVIDYI
GIRDNMREAM KVYGGDTSVA PTSDDVEQAT SAFREEIEIL KALFTDYDLA PFLNTKCDPV
ERYKLLSKAA EYVFTSAQVL QTEGNGKTNQ VSFKTYFLKS VKRMRSAFDI CQPSGNLGEE
ESALAQCFMA IAGFVRKMSG TSDVDAETMN LVVSKMVEEA LKYNKVESVL ESGEEEDIFS
PEYFEKLSDV EMPATKLELL IKMLRKQIKE YGRVNQLAAK SFQEMIEKTI DEYHERRKHL
TAEEAGATQE AASEDIIKAA TEQALVILRQ MNENRESFRK IGLTFEEKAF YDILIALRDQ
YKFEYGIDKE VNGVVINDKC KTLAKKVKEI IDTESSFVDW LNNQNVRNQL KLKIKICLVK
NGYPPQYSPE VFTKVMEQVE NFEEHSETTS ES