Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HMPREF0424_0486 |
Symbol | |
ID | 8708938 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gardnerella vaginalis 409-05 |
Kingdom | Bacteria |
Replicon accession | NC_013721 |
Strand | + |
Start bp | 530279 |
End bp | 531907 |
Gene Length | 1629 bp |
Protein Length | 542 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 646482600 |
Product | extracellular solute-binding protein |
Protein accession | YP_003373727 |
Protein GI | 283782973 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.410451 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAATA AGGCTTTGGC TTTCGTTGCT GCTGCCTGCT CGCTGGCAAT GTTACTTGGT GGCTGTGGCT CTTCCGCTAA GAATGCACAA ACAAACGGTG GCAAGGTTAT TATTACCGTT TCAAACTCTG AGCCACAAAA CGAATTAGTC CCTGGAAATA TTAACGAAAA TGCTGGCGCT CGTCCTGCCA TGTTGGTTAA TTCCACGTTG GTAACTTTCG ATGAAAAAGG TAATCCTGTG AATGAGGATG CTGAAAGCAT CACTCCAAAT GCTGATGCTA CCCAGTACAC CGTGAAGGTT AAGAAAGGAA AGAAGTTCAG CGATGGCACT CCTATTACAG CCGAAAGCTT TGTGAAAGCT TGGAGCTTCG TCGCTAACGC TAAGAATGCT CAGAAGTGTG CTTCCTTCTT CCAAACCATT AAGGGTTATG CTGATTTGCA GAAGGATGGT ACTAAGGGTG ACGAACAGCT TTCTGGTTTG AAGGTTGTTG ATGAAAATAC TTTTACCGTT GATTTGGAAC AGCCAGATTC TGTATTCCCA ATTAAGGTTG GCTACTTGGC ATTTGCTCCA CTTCCGGAAT CCTTCTACAA GGATCCAAAG GCTTATGGCG AAAAGCCTGT TTCTTCTGGT CCATACTTGT TCAAGTCTTG GGATCACAAC AAGCAGATTG AAGTTGTGAA GAACCCAGAT TACGATGGTC CACGTAAGGC TCAGAATGAT GGCGTAACTT TCAAGGTTTA TACAGATGGT AACGCTGCAT ACCGCGATGT GCAGGCTGGA AACCTCGATA TGACTGATAA TATTCCAGAT ACCCAAACTA AGACTTTCCA GAAAGATACA ACTGTTAAGG CTTACAACCG TCCAGGTTCT GTAATCCAAC AGTTCACTAT TCCTTCAAGT CTCCCACACT TCGATGTAAA GACTGAAGAA GGTAAGCTTC GTCGTCAAGC TATTTCTATG GCTATAGACC GTAAGGTGAT TATCAACAAG ATTCTCAATG GCACTGCTTC TCCTGCGAAC GAATTTACTT CTCCATTGAC TCCAGGCTAT AAGGCTGATC TTAAGGGTCA TGAGAACGTA GAATTTAATG CGAAGAAAGC TAAGGAACTT TGGGCTAAGG CAGATAAGAT TTCTAAGTAT GATGGATCTC TTACTTTCTC CTACAATGCC GATGGAAATG CAAAGTCTGT GTTTGATGCT GTTGTGAATT CCCTCAAGAA TAATCTCGGA ATCAAGGCTG AGACCACTCC AATTCCTACA TTCCAGGAAT TCCGTAATGC TTGCGCTAAG CGTCAGATTA AGGGTGCATG GCGTGCAGGT TGGATACCAG ATTATCCAAG TGCAGAAAAT TACTTGACTC AGGAATTTGC TTCCGTTGCA GCTGATGGCA ATGGTTCCAA CGAAGGTGAT TATAAGAATC CAAAGTTCGA CGATTTACTT AAGAAAGCTG CATCTGCTAA GCCAGGAGAG GCTATTAAGC TCTATCAGCA AGCAAACGAA ATCCTTCTTG AGGATCTTCC ATCTGTCCCA TTGTTCTACT CGAACGCTAA GGCTGTGATG GTTCCAACTT TGAAGGGCTT CACAATGGAT TGGCAGAATA TGCCGCTGTA CTATCAGCTT CACAAATAA
|
Protein sequence | MKNKALAFVA AACSLAMLLG GCGSSAKNAQ TNGGKVIITV SNSEPQNELV PGNINENAGA RPAMLVNSTL VTFDEKGNPV NEDAESITPN ADATQYTVKV KKGKKFSDGT PITAESFVKA WSFVANAKNA QKCASFFQTI KGYADLQKDG TKGDEQLSGL KVVDENTFTV DLEQPDSVFP IKVGYLAFAP LPESFYKDPK AYGEKPVSSG PYLFKSWDHN KQIEVVKNPD YDGPRKAQND GVTFKVYTDG NAAYRDVQAG NLDMTDNIPD TQTKTFQKDT TVKAYNRPGS VIQQFTIPSS LPHFDVKTEE GKLRRQAISM AIDRKVIINK ILNGTASPAN EFTSPLTPGY KADLKGHENV EFNAKKAKEL WAKADKISKY DGSLTFSYNA DGNAKSVFDA VVNSLKNNLG IKAETTPIPT FQEFRNACAK RQIKGAWRAG WIPDYPSAEN YLTQEFASVA ADGNGSNEGD YKNPKFDDLL KKAASAKPGE AIKLYQQANE ILLEDLPSVP LFYSNAKAVM VPTLKGFTMD WQNMPLYYQL HK
|
| |