Gene HMPREF0424_0191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_0191 
Symbol 
ID8709898 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp212445 
End bp213932 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content41% 
IMG OID646482310 
Productextracellular solute-binding protein 
Protein accessionYP_003373455 
Protein GI283782701 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTTGTGT CTGTAATAAT GAGCGGCATT TTGTGGTGTG GATGGCAAAT GTACCAGGGT 
CATTCGCCTT TTGCTGCAAT AATGCATCCA GCTAGTAACT CAGTGCTAAA AGTTGGTCTA
CGTACTGCTC CTGAATCTCT TGATATTAGA AACGACGACA GTGACGCTTT ACAGCAAGCT
TTAATTGGTA ATGTTTACGA AACTCTTGTT AAACGTGGCG ATGATAATAG TTTGCAACCA
GGTCTCGCGA AGTCATGGGA TATTTCGAAA GACGGCTTAA CATATCGTTT TAATTTGCGT
CAAGGTGTGC ATTTCTCTAA CGGTAGTGAG ATGACTTCAA ATTCCGTATT ACAATCTTTG
AAACAAGGCA TCACAAATAA TTATCCTGGT TATAGTGCAC TTACAAATAT TAAAACTGTT
AATAATCCGG ATGATTACAC TTTAGTAATT ACACTTAATA ATCCAGATGC TTTACTTCTA
CGTCGTCTTG CTGGACGTGT TGGCATTGTC TATGACACAA AATCGATGAT CGACTATGCA
AGTGCTGCTC TTGGCACAGG TCCTTTCACT GTAAGTGACT ACAATAAAGG CAATTCTTTA
GTATTACTTC GCAACGATAA ATATTGGGGA ACCCCTGCTT CTTGTGCAAG TATTACTTTG
CAATATTTCA ATAGCGATAC TGCTCTAGCT GAGGCAATGG AAAAGGGCAA TATTCAAATG
GCAGTTCCGC TTGAAGGTAA CGAAAATAAG CGCCTTGCTG CTGTTGCTAA TACGCAAATG
GTAGAGGGTC AGAGCACAAG AGTGCGATTT ATTGCAATAA ATACAACAGT TTCGTCGATT
TTCTGCGATG AGCAGGCTCG TAAAGCTGCG AGATACGCTT TGAATGCGCA AACTGTACTT
GCAGCTGATG GCAATGGTGG TGTTCCAGTA GGAGGACCTA TTGATCCTCT TTCAACTGGT
TATGAAGATT TAAATGGATT GTATCCTTAT AATCCAGGCA AAGCTGCTCC TTTGTTCCAT
TACTTTAGCG CAAGCTATTT AGGAACTATT AATTTTCTTG TTCCACAGGG CGAAGGTGGA
GTTGGGGGCG AGCTTTCAAA ACAGATTAGT TCCGTGAGCG GTTTTAAGGT CAATCTTGAA
GAAGTTGATC AACAAACTAT GCGTAAGCGT ATTAGTGAAG GTAAGTATGA TCTTGCGCTT
ACTACGAGCA ATCGTACTTT AGATGAAGGC ATGTTTGCAG AAAGCGGTTC TCCGTTCGTT
TTGCAAGACG CGCGTGCTCA ACAAGCTTGG ACCGATGCGG TTCATTCTAA GAATGCAAAC
GAATATGAGA CAAATGCTCG TGCATATGCG CGCGAAGTAA GCAATAATGC TGCAGCTCAC
TGGTTATATG CTCGAAAGAG CATTATGGCT GTAAAGTCTA ATGTAAGCGG CTATACGAAA
AATATGACGG ATCAGCTTCT GCCATTGCAG AGCATAGTGG TGAAATAG
 
Protein sequence
MVVSVIMSGI LWCGWQMYQG HSPFAAIMHP ASNSVLKVGL RTAPESLDIR NDDSDALQQA 
LIGNVYETLV KRGDDNSLQP GLAKSWDISK DGLTYRFNLR QGVHFSNGSE MTSNSVLQSL
KQGITNNYPG YSALTNIKTV NNPDDYTLVI TLNNPDALLL RRLAGRVGIV YDTKSMIDYA
SAALGTGPFT VSDYNKGNSL VLLRNDKYWG TPASCASITL QYFNSDTALA EAMEKGNIQM
AVPLEGNENK RLAAVANTQM VEGQSTRVRF IAINTTVSSI FCDEQARKAA RYALNAQTVL
AADGNGGVPV GGPIDPLSTG YEDLNGLYPY NPGKAAPLFH YFSASYLGTI NFLVPQGEGG
VGGELSKQIS SVSGFKVNLE EVDQQTMRKR ISEGKYDLAL TTSNRTLDEG MFAESGSPFV
LQDARAQQAW TDAVHSKNAN EYETNARAYA REVSNNAAAH WLYARKSIMA VKSNVSGYTK
NMTDQLLPLQ SIVVK