Gene HMPREF0424_0425 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_0425 
Symbol 
ID8709814 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp462642 
End bp465593 
Gene Length2952 bp 
Protein Length983 aa 
Translation table11 
GC content41% 
IMG OID646482540 
Producthypothetical protein 
Protein accessionYP_003373672 
Protein GI283782918 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000374386 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTTCCGC AGCGTCGTCG TAACAGTCTT TCTAAGATTA TGCGCGTAAT TTGCGGTTTC 
ACGTCCTGCG TGCTGACTCT AGCAACTCTC GCGATTGTGC CTATCACTCC GTCGAGTGCT
TCAACCGAAG AAACAACGCC TAAGACCTCG AAGAGTGCTG AGATTAATGG TAATCAAGAG
CTTAAGAATA AAGACGAACA TAAGTCTGAT AAAAAAGCTG AGCAGCAACC TGTTAATAAT
CTTGAACAAC ATACTTCCGC AACGGCTAAT ACAAATCCTA AGAACGAAGT TTCCAATAAA
GATAAAGAAG CTGATCAAAA CAGTAAAAAT AAAAAAGCTA AAAACGCTGC TGAAGCCGCG
CAAACTGGTT CGAACGGTGA CAGAAGTTCT ACCAACACTA CCGAAAACGG AGCGCAATTC
TATAATCCTG AGTATTTACG TACTCTGAAT ATTGTTGCTG GAAATCTTTC TACAAGTGCA
AACAATAAGT CGGGCGCTCC TAGGAGTAAG CCAATTCACG GCAATAGGAA AATTAAGCGT
GGATATCTTC CTGTTGGTAC GCGGTTTGAA ATTAAGCCAG ATGCAAATGA TAAAGAAGAC
GCGTACGAGT GGGCTAATTT TGAAGGCGAG CAATGCGCTG TAGCCAATGA TGCGTCAGCC
AATAATAAAC AGTGCAAGGA AACGAAAGCT ATTCCATCCG ATAATTCAAA AAGTTATGGC
GTTATAACTT TCCGTCCAAA TAGGTGGACT AAAGCAGGTC GTTACAAAGT GCATGTAATT
GTGCACTATC CAGATGGAAA ATCTACTTCC GATGATGCAA ATGCTGGAAA TAGCAAAGGT
GGTAAGCCTT CGCCTGTTTA TGCAAATGTA GTTGTTACGC GATTTGATCC TCACGATAGT
GATTTACAAC TTTCAATAAT TAGCAAAGAC AAAGAAGATT TGAAATATGG ACAATCTGAT
AACTCAGATT TAGTGTTATT GGCTGGTCAA GAGGTTAAGA AAACCACTTT TGATGCTTCT
GCGCATTTTG GCATAGGCAA TATTAATCAG CGTGTGATTT GCTATAAGAA AGATAAGAAT
GGAAATCCTG TTGATGGAAA ATACGAGTCT GGTGGAATTA ACGGGCTTGA ATTAAAACAA
GATGCTAACG TAACAGTTTG GAAGCATGCC AGTTACGATC AGCAAAAGAA GTGCTTTGAT
GATCCTGCTA ATGGCTGCAG TGTAGATGAT TTGCTTTATG ACGATTACGT GTACAACGAG
TATGTTAAAA CACATCCGAA TTTTGAACCT AAGCGAGTAA ATGAGCGTAC AGTAGGACAG
TTTAAGGGAA CTGTAAAGAA AACCGGTGAT TTTGTGTGCA AAGTTTATGC TTTAAGAAAT
TCTGTTAAAG ATGACGGCGC GCAAGACAAG CAAGATGATG CTTTAGTAAA GAAGTTTGAT
CAAATAGCTG CAGAAAAACA CAATAAGATT GATGAAATTG ATTCTGCGCT AAAGTCAGAA
AATTTGTTTA AGTCAAGTAA AGGCATTACT TGGGAAGCTA AGACTTTGAA CATTACTGTT
CGCAAAATGT CTTACTATTA TCAGCCTAAT TATGGTGATG GAATTAATAC TCTTCCAGGT
CAATATGCGA CTTCTATTGT GCCGCTTAAC CAGTGTGGAG TTGGCGAAAA TTGCAGTAAG
AAGTTAGCTA GAACTCGTAA TCTTCCTGAC GGAACTTGGT TTGAAATAAA GCCATATAAA
AATGATTCTG AGCATCCAGT TCCTGATTGG GCCTCATTTA TTGATGAAGA TAACATTGAT
CAAAGTAACA AACCTACCAA AGCTGGGAGT GAAGTCGGAG ACGATAGTAG TGATGCTTCC
GGTGCAGTGT ACGGAAAAAT TACTGTAAGA ATGAGCACTT GGATTAAAAC AGGGCATTAC
AATGTGCCAG TGGTTGCGCA TTATCCTGAT GGCTCCTCTT CAGAGGATGA AGATTCTAGT
AACGAAGGTA AGCCTATCTA TTTGAAAGTC TCGGTCAATA ATTCTCCACG GATTAATAAT
GATGATTTAA AGTTACGTGT GACGACCGAA AAAGCTTCTG CTGATGGTGA GTCTGACTCG
CAAAACGACG ATTATGGTGA TGTAGATCCA GACCAGGGCA TCACAATGAT GCGCGGAATG
CGATTTTTGA ATCCGTATAT TGATGCGTGG TCATTGCGTG AAGTGGGTAA GAAAATTTCG
CTGAAAGTTT TGTGCACTAA GGTTAGCAAA GATAGCAAAG CTAGCAAAGA CGGCTCTGTT
AACGCTAATA GCTCTAGTGG TGTGGGTGTT TGGTCTAGTA GCATTAATGG CTTGACTGCT
CCTACGGAGA ACCAAATTCA TAATTGGGAT CATATTAAAA CTGTTGCTGA ACTTGAAAAG
TGCAAGAATA ATTCAGCTTC GTGTGATGCA AGTCGTACCC TATTTAGACG AGACGTTGAA
GCTAGCGATT GGGATGAAGA AAATCCATTC TATGCAGTCG AACGCACGGA TTCGATTATT
GGCGGTGCGC CTAAAGAAAC GGGCGATTAT CAATGCGTTG TGTATGCGTT GAAGCCGACT
GCGCTTGCTG CTTATACAAA TAAAGTGGGT AGCGCAACTT CCGTACAAAA TGGCGATACT
CTTCTTAACG GCGTCGCTGG TCTCGAAAAA GGCAAGGACT GGACGATGAC CGCGGTCAAA
ATTCACGTTG TTGAGCCTTT TAAACTGCCA AAGACGGGAT TCGCCGGCTG GAACATGATT
CTGAGCGTTG CTACAACGAT TTTCACAAGT CTTATGGTTC TTGCGTTTGC TCTCGACCAA
ACACAGTGGG GGCGCGCATT TATGAAAAAT TTTGTGTACC AAAATTCTGC TCAAAAGGTC
AGTGAAATAG CTGTTCAAAA GGGCACTGAA GTATCTGCTC GCAAGGGCAC TGAAATAAAG
GAGAAGAAAT GA
 
Protein sequence
MFPQRRRNSL SKIMRVICGF TSCVLTLATL AIVPITPSSA STEETTPKTS KSAEINGNQE 
LKNKDEHKSD KKAEQQPVNN LEQHTSATAN TNPKNEVSNK DKEADQNSKN KKAKNAAEAA
QTGSNGDRSS TNTTENGAQF YNPEYLRTLN IVAGNLSTSA NNKSGAPRSK PIHGNRKIKR
GYLPVGTRFE IKPDANDKED AYEWANFEGE QCAVANDASA NNKQCKETKA IPSDNSKSYG
VITFRPNRWT KAGRYKVHVI VHYPDGKSTS DDANAGNSKG GKPSPVYANV VVTRFDPHDS
DLQLSIISKD KEDLKYGQSD NSDLVLLAGQ EVKKTTFDAS AHFGIGNINQ RVICYKKDKN
GNPVDGKYES GGINGLELKQ DANVTVWKHA SYDQQKKCFD DPANGCSVDD LLYDDYVYNE
YVKTHPNFEP KRVNERTVGQ FKGTVKKTGD FVCKVYALRN SVKDDGAQDK QDDALVKKFD
QIAAEKHNKI DEIDSALKSE NLFKSSKGIT WEAKTLNITV RKMSYYYQPN YGDGINTLPG
QYATSIVPLN QCGVGENCSK KLARTRNLPD GTWFEIKPYK NDSEHPVPDW ASFIDEDNID
QSNKPTKAGS EVGDDSSDAS GAVYGKITVR MSTWIKTGHY NVPVVAHYPD GSSSEDEDSS
NEGKPIYLKV SVNNSPRINN DDLKLRVTTE KASADGESDS QNDDYGDVDP DQGITMMRGM
RFLNPYIDAW SLREVGKKIS LKVLCTKVSK DSKASKDGSV NANSSSGVGV WSSSINGLTA
PTENQIHNWD HIKTVAELEK CKNNSASCDA SRTLFRRDVE ASDWDEENPF YAVERTDSII
GGAPKETGDY QCVVYALKPT ALAAYTNKVG SATSVQNGDT LLNGVAGLEK GKDWTMTAVK
IHVVEPFKLP KTGFAGWNMI LSVATTIFTS LMVLAFALDQ TQWGRAFMKN FVYQNSAQKV
SEIAVQKGTE VSARKGTEIK EKK