Gene HMPREF0424_1251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_1251 
Symbol 
ID8709818 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp1493983 
End bp1495725 
Gene Length1743 bp 
Protein Length580 aa 
Translation table11 
GC content48% 
IMG OID646483339 
Productextracellular solute-binding protein 
Protein accessionYP_003374441 
Protein GI283783687 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00147859 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAATA CAAATGCAGG TAAACTGACT GTATTTGCCG CAGCTGCACT TTCTGTTGCA 
ATGCTGCTCG GCGCTTGCGG CGGTGCTACT AACAATGCTG GTAAAGCAGG TATGACTGAA
GAGCCAGCTG CTGGTGTTGA CACTTCTTAC ACTGGCGAAC TTCCAATGCC AGATGTCAAC
AAGCGTTACG ATAATCCACA GTCACGTGAC AACGTTAAGG ACGGCGGCAC TTACACCGTT
GCTCTTCCTG TTCTCGGTCC AAACTGGAAT TACGCTTCTA ACGACGGTAA CTCTGGTTAC
ATGAATACTT TGTGGGGCTT CTACCAGCCA AATCTTATGG CTTATGACAC CGTTAAGGGT
GAAAAGCTTA AGTACAACCC TGACTACATC ACTTCCGTTA AGAAGGTTAG CGATAAGCCA
CTTGTTGTTC AGTACAACTT GAACCCAAAG GCCAAGTGGA ATGATGGCAC TGATTTTGAC
TACACTGCAT TTAAGGCTAC TTGGCAGGCT TTGAACGGAA AGAACAAGGA TTACTCCGTC
CCAAGTACTG AAGGCTATGA CTGCATCAAG AGTGTTGAAC AAGGTTCCAC TCCAAAGCAG
GTTGTAGTGA CTTATGAGAA GCCATGCGCA ACATGGGAAA TGCTTTTCGC TCCACTGGTT
CATCCAAAGG CCGGTGACGT TAAGACCTTC AACCAGGGTT GGGTAAACAA TCCTCACAAC
GAGTGGGGTG CAGGTCCATT CCAGATTGAA TCCGCAACCG AAAATCAGGT TGTTTTCACC
CGCAATCCAA AGTGGTGGGG TAAGAAGGCA AAGCTCGATA AGGTTGTTGT AAAGCGTATG
GAAGATACTG CAGCTTTGAA CGCTTTCCAG AACGGTGAAA TCGATGCTGT AACCGATAGC
ATTTCTGCTA AGGACGCAAT CAAGTCTGCT CGTAGCGTTA AGGGTGCTCA GCTGCGTTAC
GGCTACAGCA CCAAGGTTCG CGTGCTTAAC TTCAACGCTA AGTCCAAGCC ACTTAACGAG
CTTGCAGTTC GTAAGGCTGT TGTTCAGGCA TTCGATGTTG CTACCTACAA CAAGATTCAG
TTCCAGGGCA TGAACTGGAA GAGCGAGCAG CCAGGTTCTG AGCTTCTCTC CATGTTCCAG
GCTGGCTACA AGAACAATCT TCCTGCTGAC GGCAAGTACA ACACCGAAAA CGCTAAGAAG
ACCCTTGAAG CTGCTGGTTA CAAGATGGGC AAGGATGGCT ACTACGCCAA GAATGGCAAG
ACTCTTGAGA TTTCCTTCAC CTTCTTCGGC GATGATTCTA CTCAGGCAGC TCTTGCTAAC
GCATTCCAGG CCATGATGAA GAAGGCTGGA ATTAAGTGCA AGACTGTGAA CCATGCTGTT
GCTAAGTTCT CCGAGGTTGT TGCTTCGCAC GAGTACCAGG TGCTTCCATT GGCATGGCAG
TCCACATCTC CATTGAGCTT CTTGTCTGCA GCAAGCCAGG TTTACAAGTC TGATAGCGAT
TCCAACCTCG GTCTCGTTGG CAATAAGAAG ATTGATGCTA TGCTCAACAA GATTGGCAAG
ACCTACGATT ACAAGGAACA GACTGATTAT GCAAACAAGG CAGAGTCTGC AGCTCTCGCA
CTCTACGGAA CTCTTCCAGT TTCTGCTCCT CCTATTTACC AGGTGTTCAA GAAGGGCTTT
GCTAACAACG GCCCTGCTGG TTACGCAAGC ACCTACCCAG AGGATATGGG CTGGCAGAAG
TAA
 
Protein sequence
MKNTNAGKLT VFAAAALSVA MLLGACGGAT NNAGKAGMTE EPAAGVDTSY TGELPMPDVN 
KRYDNPQSRD NVKDGGTYTV ALPVLGPNWN YASNDGNSGY MNTLWGFYQP NLMAYDTVKG
EKLKYNPDYI TSVKKVSDKP LVVQYNLNPK AKWNDGTDFD YTAFKATWQA LNGKNKDYSV
PSTEGYDCIK SVEQGSTPKQ VVVTYEKPCA TWEMLFAPLV HPKAGDVKTF NQGWVNNPHN
EWGAGPFQIE SATENQVVFT RNPKWWGKKA KLDKVVVKRM EDTAALNAFQ NGEIDAVTDS
ISAKDAIKSA RSVKGAQLRY GYSTKVRVLN FNAKSKPLNE LAVRKAVVQA FDVATYNKIQ
FQGMNWKSEQ PGSELLSMFQ AGYKNNLPAD GKYNTENAKK TLEAAGYKMG KDGYYAKNGK
TLEISFTFFG DDSTQAALAN AFQAMMKKAG IKCKTVNHAV AKFSEVVASH EYQVLPLAWQ
STSPLSFLSA ASQVYKSDSD SNLGLVGNKK IDAMLNKIGK TYDYKEQTDY ANKAESAALA
LYGTLPVSAP PIYQVFKKGF ANNGPAGYAS TYPEDMGWQK