Gene HMPREF0424_1306 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_1306 
Symbol 
ID8709046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp1561922 
End bp1563337 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content41% 
IMG OID646483393 
Productextracellular solute-binding protein 
Protein accessionYP_003374494 
Protein GI283783740 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAGTC GTTCGGTTAA GTCGATTATT GCGCTAGTGG TTTGCGCTGT TTTGGCGGCT 
TGTGCGTTTA CTGCTTTTGG TTTTTCTCAG TCTTTTGGCG CAAATGCCGA TAGGTCTGCT
AATGGGAAGG TTATTTTCTT CAACTTGAAG CCTGAAGCTA CTGCTATGTG GGAAGAAGTT
GCTAAAGAAT ACACAAAACA GACTGGTGTT ACAGTAAGCA TAGTTACGCC AACTGGCGGT
CAGTATGAAA AGAGACTTAA AACCGAAGTT GAGAAATATA AAGATAATCC AGATAGCATG
CCTACGTTAT TCCAGATTAA CGGTCCTGTT GGCTACGCTA ACTGGAAGGA TTACACTGCT
GATTTAACTA ATTCCCCTCT TTATACGCAA CTTAGCAATA AAAATCTTGC TTTTATGGAT
AACGGTAAGC CGGTTGCTGT GCCATATGTT ACTGAAAATT ACGGCTTGAT TTACAACAAG
GCTTTGCTTG CTAGGTATGC GGATCTTAAA GATGCAAAGT TGCAGCCAGA AGAAAACGGC
AAATCTTTTG AAGATCAAAT AACTAATTTC GCTGCGTTGA AGGCTGTTGC AGACGATTTG
CAAAAGCGCA AGGATGAGCT TGGTATTGAC GGTGCTTTTG CGTCCGCAGG TTTTGATTTA
AGCTCCGACT GGCGTTTTAA GACGCATCTT GCAAATATGC CTTTGTATTA TCAATTCGTT
AAGAATAACG TTAATGGTCA GCCTCAGTCA ATTAATTACG ACTATGTTGA TAATTTCAGG
AAGATTTTTG ACCTTTATAT TTCTGATTCC ACAACTCCTG TAAGCCAGTT AAGTACTAAA
ACTGGCGAGG ATGCTGTTTA TGAGTTTGCT CTTGGTAAAG CTGTGTTCTA CCAGAATGGT
ACTTGGGCTT GGGGTGATTT TTCAAAAGCT GGCATAAAGC CTGACGATGT TGGCATGCTA
CCAATTTATA TTGGTGCTAA GGGCGAAGAG AAACAAGGCA TGGCTACCGG TTCGGAAAAT
TATTGGTGCA TTAATAAAAA GGCTTCAAGG CGGAATCAGG AAGCAACTAA TGCCTTCCTT
AATTGGTTAC TTACTACAAA TTATGGTAAG GATGCTTTGG CTAATAAAAT GGGCTTTGCT
ACTCCTTTCA AAGGTTTCCC TAAAGCAGAA AATCCTCTAG TTCAGGCTGC TGTTGATTAC
ACTGCTAAGC CTGGCAAATT GCCTGTTAAG TGGGTGTTTG TTACTATGCC ATCTAACGGT
TGGAAAGACG ATGTTGGCTC TGATCTGCTT ACATATGCTC AAGCAACAAA AGACGGTATG
ACTTCTATTG GCGCTAATAA GGCATGGGCT ACTCTTAAAT CCGCGTTTGT TGACGGCTGG
GCTAAGGAAT ATAAGATTGC TCATGGGCAG AGTTAG
 
Protein sequence
MMSRSVKSII ALVVCAVLAA CAFTAFGFSQ SFGANADRSA NGKVIFFNLK PEATAMWEEV 
AKEYTKQTGV TVSIVTPTGG QYEKRLKTEV EKYKDNPDSM PTLFQINGPV GYANWKDYTA
DLTNSPLYTQ LSNKNLAFMD NGKPVAVPYV TENYGLIYNK ALLARYADLK DAKLQPEENG
KSFEDQITNF AALKAVADDL QKRKDELGID GAFASAGFDL SSDWRFKTHL ANMPLYYQFV
KNNVNGQPQS INYDYVDNFR KIFDLYISDS TTPVSQLSTK TGEDAVYEFA LGKAVFYQNG
TWAWGDFSKA GIKPDDVGML PIYIGAKGEE KQGMATGSEN YWCINKKASR RNQEATNAFL
NWLLTTNYGK DALANKMGFA TPFKGFPKAE NPLVQAAVDY TAKPGKLPVK WVFVTMPSNG
WKDDVGSDLL TYAQATKDGM TSIGANKAWA TLKSAFVDGW AKEYKIAHGQ S