Gene HMPREF0424_1058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_1058 
Symbol 
ID8709360 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp1204566 
End bp1206257 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content40% 
IMG OID646483150 
Productextracellular solute-binding protein 
Protein accessionYP_003374261 
Protein GI283783507 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.271211 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTAACA GCAAACCTGA GGAAAGCGTG CACGAAGAAA ACGGCGCGAC TGAAAAATTC 
TCCGGTAAAA CTAACGATGA AGCTAACAAA AGCACTTCTT TCGCTGCGAG CACTGACTCT
TCACTGAATG AAGCTGAAAA ATATGCTTAT TCAGGCTCGA AACGCAAAAT GGGCAAGATA
TGGCTATTTG GCTTAGCTAT TGTTTTGGTA GTAGTGTTTA GTGCAATTTT TGCAACTCTT
ACTACATCTA ATAAACCAAA TAAATCAAAC TCAGTATCGG AAATTTCGAA CAAAACTATA
TCTATAGGTC TAAAGCTAGC TCCTACGAAT CTTGATATCC GTAATCAATC AGGTTCATCG
CTTGACCAGT TGCTTATTGG CAATGTGTAT GAAGGCTTAG TTGCTCGCGA TGCTACTAAT
CAGGTAGTTC CTTCGTTAGC TCAAAGCTGG GAAGTAAGTA AAGATGGTTT GCGTTACACA
TTCCACATTC GTAAGGGTGC TGTATTTTCT AATGGTGATA AACTTACTGC GCATGATGTG
GAATGGTCGT TTAACGAGCT TATTGCTAAA AAATATCGCG GCTCTAACAT GGTTGGAAAA
GTTGAGTCTG CTAAAGCAAA AGATGATTAC ACGTTTGAAA TCACTCTTAA AGAGCCAAAT
GCTAAGCTTT TGTGGGCACT TTGCAGTCGT TCAGGTTTAG TTTTTGATAA AAGCGCAAAG
TATGATGCAA AAACTCAAGC AGTCGGTTCT GGACCGTATC TAATTGACAA ATTTGTGCCA
AATGATCGTG TTGTTTTAAA AGCAAATCCG CGTTATAAAG GTATTCATCG TGCGCAAACT
AATAAGATAG TAGTTCGCTA TTTTGTTGAC GATAACGCAG CAGTTGACGC ACTTTCTTCC
GGTTCTGTGC AAGCATTAGC TCCTATTTCT GGTCAACTTG CTAAGCCTTT TAAAGATGAT
TCCAAGCGTT ACGTTGTCAG TGCAGGTAAT GGCACAGATA AATTTGTTCT AGCTATGAAT
ATGAAAGGCG AACGTACTAA AGATAAGCGC GTGCGCAAAG CTATTCGTTA CGCGATTGAT
CACAAGCAAA TTATTGCATC TCGTGGCGGC ACGGATTTAG CTCTTGGTGG TCCAATTCCT
TCTCTTGACC CTGGTTATGA GGATTTAACT AAGCTGTATC CGCATGATTT GCAGCGCGCT
AAGGAGTTAA TGAAGGAAGT CGGATTTAAC GAATCGCATC CTATGGATTT GACGCTTACT
TATCCGAATA TTTACGGCAC GCAAATTGGC GATCAGCTGC GCTCGCAATT GAAGCCTATT
GGAATCAATT TGAAAGTTAA TATTGTTGAG TTTACTACTT GGCTTCAAGA TGTTTATAAG
AATAAGCAGT ATGATTTGTC GATGGTTGAT CACAATGAGA GTCACGATTT TGGGCAGTGG
GCGGATCCTA CGTATTACTA CGGTTATGAC AATAAGCAAG TCCAGGATTT GTATGCTAAA
GCTATGCTTT GTGCGGATCC TAAAGAGTCA GATAAATTGC TTGCTCAAGC TGCGCGAATT
ATTAGCGAGG ATGCTCCTGC TGATTGGCTG TTTAACTATC GCGTTGTAAC TGCGAAAGTG
AAGAATCTTG AAGGAATGTC TTTTGATATG AATCAAGAGA TTTTGCCGCT ATACAACCTG
CGACTTAGTT GA
 
Protein sequence
MSNSKPEESV HEENGATEKF SGKTNDEANK STSFAASTDS SLNEAEKYAY SGSKRKMGKI 
WLFGLAIVLV VVFSAIFATL TTSNKPNKSN SVSEISNKTI SIGLKLAPTN LDIRNQSGSS
LDQLLIGNVY EGLVARDATN QVVPSLAQSW EVSKDGLRYT FHIRKGAVFS NGDKLTAHDV
EWSFNELIAK KYRGSNMVGK VESAKAKDDY TFEITLKEPN AKLLWALCSR SGLVFDKSAK
YDAKTQAVGS GPYLIDKFVP NDRVVLKANP RYKGIHRAQT NKIVVRYFVD DNAAVDALSS
GSVQALAPIS GQLAKPFKDD SKRYVVSAGN GTDKFVLAMN MKGERTKDKR VRKAIRYAID
HKQIIASRGG TDLALGGPIP SLDPGYEDLT KLYPHDLQRA KELMKEVGFN ESHPMDLTLT
YPNIYGTQIG DQLRSQLKPI GINLKVNIVE FTTWLQDVYK NKQYDLSMVD HNESHDFGQW
ADPTYYYGYD NKQVQDLYAK AMLCADPKES DKLLAQAARI ISEDAPADWL FNYRVVTAKV
KNLEGMSFDM NQEILPLYNL RLS