Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HMPREF0424_1058 |
Symbol | |
ID | 8709360 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gardnerella vaginalis 409-05 |
Kingdom | Bacteria |
Replicon accession | NC_013721 |
Strand | - |
Start bp | 1204566 |
End bp | 1206257 |
Gene Length | 1692 bp |
Protein Length | 563 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 646483150 |
Product | extracellular solute-binding protein |
Protein accession | YP_003374261 |
Protein GI | 283783507 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.271211 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTAACA GCAAACCTGA GGAAAGCGTG CACGAAGAAA ACGGCGCGAC TGAAAAATTC TCCGGTAAAA CTAACGATGA AGCTAACAAA AGCACTTCTT TCGCTGCGAG CACTGACTCT TCACTGAATG AAGCTGAAAA ATATGCTTAT TCAGGCTCGA AACGCAAAAT GGGCAAGATA TGGCTATTTG GCTTAGCTAT TGTTTTGGTA GTAGTGTTTA GTGCAATTTT TGCAACTCTT ACTACATCTA ATAAACCAAA TAAATCAAAC TCAGTATCGG AAATTTCGAA CAAAACTATA TCTATAGGTC TAAAGCTAGC TCCTACGAAT CTTGATATCC GTAATCAATC AGGTTCATCG CTTGACCAGT TGCTTATTGG CAATGTGTAT GAAGGCTTAG TTGCTCGCGA TGCTACTAAT CAGGTAGTTC CTTCGTTAGC TCAAAGCTGG GAAGTAAGTA AAGATGGTTT GCGTTACACA TTCCACATTC GTAAGGGTGC TGTATTTTCT AATGGTGATA AACTTACTGC GCATGATGTG GAATGGTCGT TTAACGAGCT TATTGCTAAA AAATATCGCG GCTCTAACAT GGTTGGAAAA GTTGAGTCTG CTAAAGCAAA AGATGATTAC ACGTTTGAAA TCACTCTTAA AGAGCCAAAT GCTAAGCTTT TGTGGGCACT TTGCAGTCGT TCAGGTTTAG TTTTTGATAA AAGCGCAAAG TATGATGCAA AAACTCAAGC AGTCGGTTCT GGACCGTATC TAATTGACAA ATTTGTGCCA AATGATCGTG TTGTTTTAAA AGCAAATCCG CGTTATAAAG GTATTCATCG TGCGCAAACT AATAAGATAG TAGTTCGCTA TTTTGTTGAC GATAACGCAG CAGTTGACGC ACTTTCTTCC GGTTCTGTGC AAGCATTAGC TCCTATTTCT GGTCAACTTG CTAAGCCTTT TAAAGATGAT TCCAAGCGTT ACGTTGTCAG TGCAGGTAAT GGCACAGATA AATTTGTTCT AGCTATGAAT ATGAAAGGCG AACGTACTAA AGATAAGCGC GTGCGCAAAG CTATTCGTTA CGCGATTGAT CACAAGCAAA TTATTGCATC TCGTGGCGGC ACGGATTTAG CTCTTGGTGG TCCAATTCCT TCTCTTGACC CTGGTTATGA GGATTTAACT AAGCTGTATC CGCATGATTT GCAGCGCGCT AAGGAGTTAA TGAAGGAAGT CGGATTTAAC GAATCGCATC CTATGGATTT GACGCTTACT TATCCGAATA TTTACGGCAC GCAAATTGGC GATCAGCTGC GCTCGCAATT GAAGCCTATT GGAATCAATT TGAAAGTTAA TATTGTTGAG TTTACTACTT GGCTTCAAGA TGTTTATAAG AATAAGCAGT ATGATTTGTC GATGGTTGAT CACAATGAGA GTCACGATTT TGGGCAGTGG GCGGATCCTA CGTATTACTA CGGTTATGAC AATAAGCAAG TCCAGGATTT GTATGCTAAA GCTATGCTTT GTGCGGATCC TAAAGAGTCA GATAAATTGC TTGCTCAAGC TGCGCGAATT ATTAGCGAGG ATGCTCCTGC TGATTGGCTG TTTAACTATC GCGTTGTAAC TGCGAAAGTG AAGAATCTTG AAGGAATGTC TTTTGATATG AATCAAGAGA TTTTGCCGCT ATACAACCTG CGACTTAGTT GA
|
Protein sequence | MSNSKPEESV HEENGATEKF SGKTNDEANK STSFAASTDS SLNEAEKYAY SGSKRKMGKI WLFGLAIVLV VVFSAIFATL TTSNKPNKSN SVSEISNKTI SIGLKLAPTN LDIRNQSGSS LDQLLIGNVY EGLVARDATN QVVPSLAQSW EVSKDGLRYT FHIRKGAVFS NGDKLTAHDV EWSFNELIAK KYRGSNMVGK VESAKAKDDY TFEITLKEPN AKLLWALCSR SGLVFDKSAK YDAKTQAVGS GPYLIDKFVP NDRVVLKANP RYKGIHRAQT NKIVVRYFVD DNAAVDALSS GSVQALAPIS GQLAKPFKDD SKRYVVSAGN GTDKFVLAMN MKGERTKDKR VRKAIRYAID HKQIIASRGG TDLALGGPIP SLDPGYEDLT KLYPHDLQRA KELMKEVGFN ESHPMDLTLT YPNIYGTQIG DQLRSQLKPI GINLKVNIVE FTTWLQDVYK NKQYDLSMVD HNESHDFGQW ADPTYYYGYD NKQVQDLYAK AMLCADPKES DKLLAQAARI ISEDAPADWL FNYRVVTAKV KNLEGMSFDM NQEILPLYNL RLS
|
| |