Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HMPREF0424_0425 |
Symbol | |
ID | 8709814 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gardnerella vaginalis 409-05 |
Kingdom | Bacteria |
Replicon accession | NC_013721 |
Strand | + |
Start bp | 462642 |
End bp | 465593 |
Gene Length | 2952 bp |
Protein Length | 983 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 646482540 |
Product | hypothetical protein |
Protein accession | YP_003373672 |
Protein GI | 283782918 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.000374386 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTTCCGC AGCGTCGTCG TAACAGTCTT TCTAAGATTA TGCGCGTAAT TTGCGGTTTC ACGTCCTGCG TGCTGACTCT AGCAACTCTC GCGATTGTGC CTATCACTCC GTCGAGTGCT TCAACCGAAG AAACAACGCC TAAGACCTCG AAGAGTGCTG AGATTAATGG TAATCAAGAG CTTAAGAATA AAGACGAACA TAAGTCTGAT AAAAAAGCTG AGCAGCAACC TGTTAATAAT CTTGAACAAC ATACTTCCGC AACGGCTAAT ACAAATCCTA AGAACGAAGT TTCCAATAAA GATAAAGAAG CTGATCAAAA CAGTAAAAAT AAAAAAGCTA AAAACGCTGC TGAAGCCGCG CAAACTGGTT CGAACGGTGA CAGAAGTTCT ACCAACACTA CCGAAAACGG AGCGCAATTC TATAATCCTG AGTATTTACG TACTCTGAAT ATTGTTGCTG GAAATCTTTC TACAAGTGCA AACAATAAGT CGGGCGCTCC TAGGAGTAAG CCAATTCACG GCAATAGGAA AATTAAGCGT GGATATCTTC CTGTTGGTAC GCGGTTTGAA ATTAAGCCAG ATGCAAATGA TAAAGAAGAC GCGTACGAGT GGGCTAATTT TGAAGGCGAG CAATGCGCTG TAGCCAATGA TGCGTCAGCC AATAATAAAC AGTGCAAGGA AACGAAAGCT ATTCCATCCG ATAATTCAAA AAGTTATGGC GTTATAACTT TCCGTCCAAA TAGGTGGACT AAAGCAGGTC GTTACAAAGT GCATGTAATT GTGCACTATC CAGATGGAAA ATCTACTTCC GATGATGCAA ATGCTGGAAA TAGCAAAGGT GGTAAGCCTT CGCCTGTTTA TGCAAATGTA GTTGTTACGC GATTTGATCC TCACGATAGT GATTTACAAC TTTCAATAAT TAGCAAAGAC AAAGAAGATT TGAAATATGG ACAATCTGAT AACTCAGATT TAGTGTTATT GGCTGGTCAA GAGGTTAAGA AAACCACTTT TGATGCTTCT GCGCATTTTG GCATAGGCAA TATTAATCAG CGTGTGATTT GCTATAAGAA AGATAAGAAT GGAAATCCTG TTGATGGAAA ATACGAGTCT GGTGGAATTA ACGGGCTTGA ATTAAAACAA GATGCTAACG TAACAGTTTG GAAGCATGCC AGTTACGATC AGCAAAAGAA GTGCTTTGAT GATCCTGCTA ATGGCTGCAG TGTAGATGAT TTGCTTTATG ACGATTACGT GTACAACGAG TATGTTAAAA CACATCCGAA TTTTGAACCT AAGCGAGTAA ATGAGCGTAC AGTAGGACAG TTTAAGGGAA CTGTAAAGAA AACCGGTGAT TTTGTGTGCA AAGTTTATGC TTTAAGAAAT TCTGTTAAAG ATGACGGCGC GCAAGACAAG CAAGATGATG CTTTAGTAAA GAAGTTTGAT CAAATAGCTG CAGAAAAACA CAATAAGATT GATGAAATTG ATTCTGCGCT AAAGTCAGAA AATTTGTTTA AGTCAAGTAA AGGCATTACT TGGGAAGCTA AGACTTTGAA CATTACTGTT CGCAAAATGT CTTACTATTA TCAGCCTAAT TATGGTGATG GAATTAATAC TCTTCCAGGT CAATATGCGA CTTCTATTGT GCCGCTTAAC CAGTGTGGAG TTGGCGAAAA TTGCAGTAAG AAGTTAGCTA GAACTCGTAA TCTTCCTGAC GGAACTTGGT TTGAAATAAA GCCATATAAA AATGATTCTG AGCATCCAGT TCCTGATTGG GCCTCATTTA TTGATGAAGA TAACATTGAT CAAAGTAACA AACCTACCAA AGCTGGGAGT GAAGTCGGAG ACGATAGTAG TGATGCTTCC GGTGCAGTGT ACGGAAAAAT TACTGTAAGA ATGAGCACTT GGATTAAAAC AGGGCATTAC AATGTGCCAG TGGTTGCGCA TTATCCTGAT GGCTCCTCTT CAGAGGATGA AGATTCTAGT AACGAAGGTA AGCCTATCTA TTTGAAAGTC TCGGTCAATA ATTCTCCACG GATTAATAAT GATGATTTAA AGTTACGTGT GACGACCGAA AAAGCTTCTG CTGATGGTGA GTCTGACTCG CAAAACGACG ATTATGGTGA TGTAGATCCA GACCAGGGCA TCACAATGAT GCGCGGAATG CGATTTTTGA ATCCGTATAT TGATGCGTGG TCATTGCGTG AAGTGGGTAA GAAAATTTCG CTGAAAGTTT TGTGCACTAA GGTTAGCAAA GATAGCAAAG CTAGCAAAGA CGGCTCTGTT AACGCTAATA GCTCTAGTGG TGTGGGTGTT TGGTCTAGTA GCATTAATGG CTTGACTGCT CCTACGGAGA ACCAAATTCA TAATTGGGAT CATATTAAAA CTGTTGCTGA ACTTGAAAAG TGCAAGAATA ATTCAGCTTC GTGTGATGCA AGTCGTACCC TATTTAGACG AGACGTTGAA GCTAGCGATT GGGATGAAGA AAATCCATTC TATGCAGTCG AACGCACGGA TTCGATTATT GGCGGTGCGC CTAAAGAAAC GGGCGATTAT CAATGCGTTG TGTATGCGTT GAAGCCGACT GCGCTTGCTG CTTATACAAA TAAAGTGGGT AGCGCAACTT CCGTACAAAA TGGCGATACT CTTCTTAACG GCGTCGCTGG TCTCGAAAAA GGCAAGGACT GGACGATGAC CGCGGTCAAA ATTCACGTTG TTGAGCCTTT TAAACTGCCA AAGACGGGAT TCGCCGGCTG GAACATGATT CTGAGCGTTG CTACAACGAT TTTCACAAGT CTTATGGTTC TTGCGTTTGC TCTCGACCAA ACACAGTGGG GGCGCGCATT TATGAAAAAT TTTGTGTACC AAAATTCTGC TCAAAAGGTC AGTGAAATAG CTGTTCAAAA GGGCACTGAA GTATCTGCTC GCAAGGGCAC TGAAATAAAG GAGAAGAAAT GA
|
Protein sequence | MFPQRRRNSL SKIMRVICGF TSCVLTLATL AIVPITPSSA STEETTPKTS KSAEINGNQE LKNKDEHKSD KKAEQQPVNN LEQHTSATAN TNPKNEVSNK DKEADQNSKN KKAKNAAEAA QTGSNGDRSS TNTTENGAQF YNPEYLRTLN IVAGNLSTSA NNKSGAPRSK PIHGNRKIKR GYLPVGTRFE IKPDANDKED AYEWANFEGE QCAVANDASA NNKQCKETKA IPSDNSKSYG VITFRPNRWT KAGRYKVHVI VHYPDGKSTS DDANAGNSKG GKPSPVYANV VVTRFDPHDS DLQLSIISKD KEDLKYGQSD NSDLVLLAGQ EVKKTTFDAS AHFGIGNINQ RVICYKKDKN GNPVDGKYES GGINGLELKQ DANVTVWKHA SYDQQKKCFD DPANGCSVDD LLYDDYVYNE YVKTHPNFEP KRVNERTVGQ FKGTVKKTGD FVCKVYALRN SVKDDGAQDK QDDALVKKFD QIAAEKHNKI DEIDSALKSE NLFKSSKGIT WEAKTLNITV RKMSYYYQPN YGDGINTLPG QYATSIVPLN QCGVGENCSK KLARTRNLPD GTWFEIKPYK NDSEHPVPDW ASFIDEDNID QSNKPTKAGS EVGDDSSDAS GAVYGKITVR MSTWIKTGHY NVPVVAHYPD GSSSEDEDSS NEGKPIYLKV SVNNSPRINN DDLKLRVTTE KASADGESDS QNDDYGDVDP DQGITMMRGM RFLNPYIDAW SLREVGKKIS LKVLCTKVSK DSKASKDGSV NANSSSGVGV WSSSINGLTA PTENQIHNWD HIKTVAELEK CKNNSASCDA SRTLFRRDVE ASDWDEENPF YAVERTDSII GGAPKETGDY QCVVYALKPT ALAAYTNKVG SATSVQNGDT LLNGVAGLEK GKDWTMTAVK IHVVEPFKLP KTGFAGWNMI LSVATTIFTS LMVLAFALDQ TQWGRAFMKN FVYQNSAQKV SEIAVQKGTE VSARKGTEIK EKK
|
| |