Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HMPREF0424_1251 |
Symbol | |
ID | 8709818 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gardnerella vaginalis 409-05 |
Kingdom | Bacteria |
Replicon accession | NC_013721 |
Strand | - |
Start bp | 1493983 |
End bp | 1495725 |
Gene Length | 1743 bp |
Protein Length | 580 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 646483339 |
Product | extracellular solute-binding protein |
Protein accession | YP_003374441 |
Protein GI | 283783687 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00147859 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAAATA CAAATGCAGG TAAACTGACT GTATTTGCCG CAGCTGCACT TTCTGTTGCA ATGCTGCTCG GCGCTTGCGG CGGTGCTACT AACAATGCTG GTAAAGCAGG TATGACTGAA GAGCCAGCTG CTGGTGTTGA CACTTCTTAC ACTGGCGAAC TTCCAATGCC AGATGTCAAC AAGCGTTACG ATAATCCACA GTCACGTGAC AACGTTAAGG ACGGCGGCAC TTACACCGTT GCTCTTCCTG TTCTCGGTCC AAACTGGAAT TACGCTTCTA ACGACGGTAA CTCTGGTTAC ATGAATACTT TGTGGGGCTT CTACCAGCCA AATCTTATGG CTTATGACAC CGTTAAGGGT GAAAAGCTTA AGTACAACCC TGACTACATC ACTTCCGTTA AGAAGGTTAG CGATAAGCCA CTTGTTGTTC AGTACAACTT GAACCCAAAG GCCAAGTGGA ATGATGGCAC TGATTTTGAC TACACTGCAT TTAAGGCTAC TTGGCAGGCT TTGAACGGAA AGAACAAGGA TTACTCCGTC CCAAGTACTG AAGGCTATGA CTGCATCAAG AGTGTTGAAC AAGGTTCCAC TCCAAAGCAG GTTGTAGTGA CTTATGAGAA GCCATGCGCA ACATGGGAAA TGCTTTTCGC TCCACTGGTT CATCCAAAGG CCGGTGACGT TAAGACCTTC AACCAGGGTT GGGTAAACAA TCCTCACAAC GAGTGGGGTG CAGGTCCATT CCAGATTGAA TCCGCAACCG AAAATCAGGT TGTTTTCACC CGCAATCCAA AGTGGTGGGG TAAGAAGGCA AAGCTCGATA AGGTTGTTGT AAAGCGTATG GAAGATACTG CAGCTTTGAA CGCTTTCCAG AACGGTGAAA TCGATGCTGT AACCGATAGC ATTTCTGCTA AGGACGCAAT CAAGTCTGCT CGTAGCGTTA AGGGTGCTCA GCTGCGTTAC GGCTACAGCA CCAAGGTTCG CGTGCTTAAC TTCAACGCTA AGTCCAAGCC ACTTAACGAG CTTGCAGTTC GTAAGGCTGT TGTTCAGGCA TTCGATGTTG CTACCTACAA CAAGATTCAG TTCCAGGGCA TGAACTGGAA GAGCGAGCAG CCAGGTTCTG AGCTTCTCTC CATGTTCCAG GCTGGCTACA AGAACAATCT TCCTGCTGAC GGCAAGTACA ACACCGAAAA CGCTAAGAAG ACCCTTGAAG CTGCTGGTTA CAAGATGGGC AAGGATGGCT ACTACGCCAA GAATGGCAAG ACTCTTGAGA TTTCCTTCAC CTTCTTCGGC GATGATTCTA CTCAGGCAGC TCTTGCTAAC GCATTCCAGG CCATGATGAA GAAGGCTGGA ATTAAGTGCA AGACTGTGAA CCATGCTGTT GCTAAGTTCT CCGAGGTTGT TGCTTCGCAC GAGTACCAGG TGCTTCCATT GGCATGGCAG TCCACATCTC CATTGAGCTT CTTGTCTGCA GCAAGCCAGG TTTACAAGTC TGATAGCGAT TCCAACCTCG GTCTCGTTGG CAATAAGAAG ATTGATGCTA TGCTCAACAA GATTGGCAAG ACCTACGATT ACAAGGAACA GACTGATTAT GCAAACAAGG CAGAGTCTGC AGCTCTCGCA CTCTACGGAA CTCTTCCAGT TTCTGCTCCT CCTATTTACC AGGTGTTCAA GAAGGGCTTT GCTAACAACG GCCCTGCTGG TTACGCAAGC ACCTACCCAG AGGATATGGG CTGGCAGAAG TAA
|
Protein sequence | MKNTNAGKLT VFAAAALSVA MLLGACGGAT NNAGKAGMTE EPAAGVDTSY TGELPMPDVN KRYDNPQSRD NVKDGGTYTV ALPVLGPNWN YASNDGNSGY MNTLWGFYQP NLMAYDTVKG EKLKYNPDYI TSVKKVSDKP LVVQYNLNPK AKWNDGTDFD YTAFKATWQA LNGKNKDYSV PSTEGYDCIK SVEQGSTPKQ VVVTYEKPCA TWEMLFAPLV HPKAGDVKTF NQGWVNNPHN EWGAGPFQIE SATENQVVFT RNPKWWGKKA KLDKVVVKRM EDTAALNAFQ NGEIDAVTDS ISAKDAIKSA RSVKGAQLRY GYSTKVRVLN FNAKSKPLNE LAVRKAVVQA FDVATYNKIQ FQGMNWKSEQ PGSELLSMFQ AGYKNNLPAD GKYNTENAKK TLEAAGYKMG KDGYYAKNGK TLEISFTFFG DDSTQAALAN AFQAMMKKAG IKCKTVNHAV AKFSEVVASH EYQVLPLAWQ STSPLSFLSA ASQVYKSDSD SNLGLVGNKK IDAMLNKIGK TYDYKEQTDY ANKAESAALA LYGTLPVSAP PIYQVFKKGF ANNGPAGYAS TYPEDMGWQK
|
| |