Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HMPREF0424_0653 |
Symbol | |
ID | 8709977 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gardnerella vaginalis 409-05 |
Kingdom | Bacteria |
Replicon accession | NC_013721 |
Strand | + |
Start bp | 732491 |
End bp | 735541 |
Gene Length | 3051 bp |
Protein Length | 1016 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 646482759 |
Product | hypothetical protein |
Protein accession | YP_003373883 |
Protein GI | 283783129 |
COG category | [S] Function unknown |
COG ID | [COG1615] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.523938 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGTTA ATCTGAATTT TGCAAAATTT ACTTATACTG CAAAAAAGTG GAGGTGGATT CTTGCGATAG TAGTGTTAAT TGTCGCGTTT GCTGTAGCTG CAATATTTGC TCTCGCAAGG TTCATAACTG ATTGCATGTG GTATTCGCAG CTTGGATTTG AAAGTGTTAT TTGGATTCAG CTTGCAGCAA AAGTGGGCGT TTGGGTGCTT TATGCAATAT TAATGGCTGC TTTTGGATAC TTGTCTGCAG CTTTGGCAAT AAAAGCGCGT CCAAGCAATG CAGATGGAAC TTATATTCGA ATTAAAGGCA ATGTTATTGA TTTAAAGCAG GGAATTAGTT CAAAGCTTGC TTTGCATGTT GCAGGAATTT TTTCATTGAT TGTAGGCGCA GTATTTGGAT TTAAATTCTA CAATCATTGG GCGCAAATTT TGTTAATGTT TAACGCCCAA CCTTTTGGCA TTAAAGATCC GCAATTTGGA ATTGACAATG GCTTCTATGT ATTTGTGCTT CCAGGATTAA GGTTATTTAT AAGCGCATTT GTGCTTTTAT TGGTAATATC TCTTGTTTTC TCTATTATTA CGCACATTCT TATGGGAGCA ATTCGTTTTA GTATGCCTAT CGATGGTAAA GGCATATTAA GTATTACAAA ACGAGCACGT AGACAAATTT CTGTATGGTT TATGCTTGTG ATTTTGACAT GGGCACTTCA ACAGATTTTA GACATGTTCG CTTTAGTAAC TCTTGATGGA TCTCGAATTA CAGGCGGCTC ATACACTGAC ATGAATGCTG GTGTTCCTGG AAGCATAGCA ATGGCAGTGA TAACAGTGTT AGTTGGTTCT GTTATTGCAT TTTGGTTTAT GAAATCTCAT GCTTTATCTG TAGAAGCACC AATTTCAGTG CGTTTTAAAA CAGCAATTAA GGCATGGAAA ACACCAATTA TTGCTTTAGC TTTTATGATT GTTTGCTCAA TTGTTATATC TTTCGCTTGG CCGGCACTTG TGCAGCGATT TAAAGTTGCT CCTAATGCTC AGGAATTGGA AGCTACTTAT ATTCAGCGTA ATATCGATGC AACAAAGTTT GCTTACGGGC TAAATAACGT AAAGAAAGAA TCGTATAACG CTACTTCTAA AGGTAAATCA GGTGCATTAT CAAAAGAAGC AGAGTCAACT GCGCAAATTC GTTTGTTAGA TCCACAAGTT ACGTCTCCAA CGTTCCGTCA ATTGCAGCAA TCGAAGCAAT ATTACACTTT CGCAGATACG CTTTCAGTTG ATAAATACGA TATTGACGGT GTAAGTCAAG ATACGGTTAT TGCTGCGCGC GAGCTTGATT TAGCTGGCAA CGATAATCGA AACTGGGTTA ATGATCATAC AGTTTATACG CATGGCTATG GAATTGTTGC AGCTTACGGA AATAAAGTAA CAACAGATGG ACAGCCACAA TTTATGGAGT ATGGTATTCC TGCTCAAGGC AAACTTACTA AGTCACAAAA GTATGAACCG CGTATTTACT TCTCACCAAA TGCTCCTGAG TATTCTATCG TTGGTTCTCC AAAAGGGACA TCTCCATGGG AGTTTGATTA TCCGACTGGC TCTCAAGGTG CTTTAACTAC TTTCAAAGGC AATGGTGGTC CAAGTTTAGG AAACTTCTTC TCTAAACTTT TGCATGCAAT TAGATTTGAG TCTGATCAGA TTCTTTTTTC TGACCGTGTG ACTTCTGAAT CTCAAATTTT GTATGATCGT GATCCTAAGA CACGCGTGTC TAAGGTTGCT CCTTACTTAA CTTTGGATGG TCGAGTTTAT CCTGCAGTAG TAGATGGTCG CGTTAAGTGG ATTGTTGATG GCTATACTAC TTCTGATTCT TACCCTTATT CTCAAATGAC TGATTTTGGT AAGGTAACTC AAGATTCCAC AACAACAACT TCTCATTCTA TTAAAGGGTT GACAAATCAG CGCGCAAACT ACATTCGTAA TTCTGTGAAA GCTACAGTTG ATGCTTATGA TGGATCTGTT GATTTGTATG TGTGGGATAC GAAAGACCCT GTGATTAAAG CATGGCGTTC AATTTTCCCA GGTCATTATC ATGACATTTC TAAGATTTCT GGCGATTTGA TGAAGCATTT GCGTTACCCA GAAAGCTTGT TTAAAGTTCA ACGTCACTTA TTAGCAAAGT ATCACGTCGA TACAGCAAGC CAATTCTTCT CTGGAGAAGA CTTCTGGCAG ATTCCTGTTG ATCCAACTGA ATCTCAAAAA GCACAAAAAG AAGATATTTT GCAACCTCCA TACTATTTGA CTTTGCAAAC TGGAGACGCA AAGAAACCAG TATTTTCGCT TGTTTCTACT TATATTCCGG CAGGTAAAAG TACTCGTGAA ATATTAACTG GTTTCTTGTC TGTTGATTCA GATGCGGGGG ATACTCCTGG AAAGATTGGT CCAAATTACG GTAAGATTCG TTTGCAAGAA TTACCTAAGA TATCTAATGT TCCAGGACCT GGTCAAGCTC AGAATAATTT CAACGCTAAT GCAAATGTTT CTAAAGAATT GAATTTGCTT GAATCCGGTT CTACTAAAGT TAAGCGCGGC AATTTGATTA CATTGCCTCT TGGCGGTGGG CTGGTTTATG TAGAGCCTGT TTATGTGCAA TCTAGCGGTT CTACTAGCTA TCCTTTGCTT AAGAAGGTGT TAGTTGCGTT TGGAGATCAA GTTGGATTTG CAGATACGTT AGATGAAGCA CTTAATCAAG TATTTGGCGG AAACTCGGGC GCTTTGGCTG GAGATGCTTC TAATAATTCT TCTTCAAACA ATGCTGCTGA AAATAATGAA TCTAAGGATG CTGATTCTAA AGAAGGTACT TCTAAGGAAA ATGAGTCTAC GAATTCAGAA AAATCTCATT CTATGAGCCA AAAAGCTAAA GAAGCACTTA AACGTGCGGC TCAAGCATTA AAAGATTCTG ATTCTGCTAT GCGATCCGGA AACTGGGAAG CTTACGGAAA GGCACAAAAA GAGTTGAGCG ATGCTATTAA TGAAGCAATG AAAGAAGAAT CAGGTAAGTA A
|
Protein sequence | MKVNLNFAKF TYTAKKWRWI LAIVVLIVAF AVAAIFALAR FITDCMWYSQ LGFESVIWIQ LAAKVGVWVL YAILMAAFGY LSAALAIKAR PSNADGTYIR IKGNVIDLKQ GISSKLALHV AGIFSLIVGA VFGFKFYNHW AQILLMFNAQ PFGIKDPQFG IDNGFYVFVL PGLRLFISAF VLLLVISLVF SIITHILMGA IRFSMPIDGK GILSITKRAR RQISVWFMLV ILTWALQQIL DMFALVTLDG SRITGGSYTD MNAGVPGSIA MAVITVLVGS VIAFWFMKSH ALSVEAPISV RFKTAIKAWK TPIIALAFMI VCSIVISFAW PALVQRFKVA PNAQELEATY IQRNIDATKF AYGLNNVKKE SYNATSKGKS GALSKEAEST AQIRLLDPQV TSPTFRQLQQ SKQYYTFADT LSVDKYDIDG VSQDTVIAAR ELDLAGNDNR NWVNDHTVYT HGYGIVAAYG NKVTTDGQPQ FMEYGIPAQG KLTKSQKYEP RIYFSPNAPE YSIVGSPKGT SPWEFDYPTG SQGALTTFKG NGGPSLGNFF SKLLHAIRFE SDQILFSDRV TSESQILYDR DPKTRVSKVA PYLTLDGRVY PAVVDGRVKW IVDGYTTSDS YPYSQMTDFG KVTQDSTTTT SHSIKGLTNQ RANYIRNSVK ATVDAYDGSV DLYVWDTKDP VIKAWRSIFP GHYHDISKIS GDLMKHLRYP ESLFKVQRHL LAKYHVDTAS QFFSGEDFWQ IPVDPTESQK AQKEDILQPP YYLTLQTGDA KKPVFSLVST YIPAGKSTRE ILTGFLSVDS DAGDTPGKIG PNYGKIRLQE LPKISNVPGP GQAQNNFNAN ANVSKELNLL ESGSTKVKRG NLITLPLGGG LVYVEPVYVQ SSGSTSYPLL KKVLVAFGDQ VGFADTLDEA LNQVFGGNSG ALAGDASNNS SSNNAAENNE SKDADSKEGT SKENESTNSE KSHSMSQKAK EALKRAAQAL KDSDSAMRSG NWEAYGKAQK ELSDAINEAM KEESGK
|
| |