Gene HMPREF0424_1291 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_1291 
Symbol 
ID8709886 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp1541622 
End bp1543007 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content44% 
IMG OID646483378 
Productextracellular solute-binding protein 
Protein accessionYP_003374479 
Protein GI283783725 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATCA GAAAAATGTT TATTTCATGT GTCGCCGTTT TAGGCATGCT TGCTAGTGTT 
AGCGCATGCG GCAGCAATTC TGCCGGTGTC ACCGACACGA AAAAAGACAA AGGTTTGCCG
GTAATGGGCG AGGCATTGAA ATATGATCCC AACCACCTTG TTAATCAAGG CAAGCCAATT
GCTTTGGAAT ATTGGAGCTG GGGAGACAGT AGTACCGACC CAATCTACGA TATGATCAAG
CAGTACACGA AAATCTACCC TAACGTAACG TTCAAAACAA AGAACGTTGC TTGGGACGAT
TATTGGACGA AACTGCCGTT AGTACTCAAA GGTAAAAGCG GTCCGGCTCT GTTCAATATT
CACAATTCCA AGGATGAGCT TATACGCCCA TACTCTGCCG ACTACAACGT TGACAAGGCA
GCAATGGAAG CTGACTACAG CTCCTCTGTT GCACACGAAG ATGCCAACGG AAAAGTAAAG
TATATTGACT CGGTTATCAA TACCGGCAAT ATTTACTACA ATAAAAAGCT TTGGAAAGAA
GCCGGTTTAA CAGAAGCAGA TATTCCTACG ACTTGGGAGG AGTTACGCGA GGTGGCAAAG
AAACTCACCA AGTGGAACGG TTCTAAGATG GTACAAGCCG GTTTCAACGT TAATGGTGAC
GCATACGCTG CAATCAACCA AGGCTTGAAC TATCAACGTG GCGAACTTGC GTTCGATAAA
AGTGGCAAAA AGCCTAACTT TGACAACAAA ATTACGCGCG AAAACATGCA ATTCCTAAAG
AATTTGTACG ACAAAGACAA AGTTATTGCA ACAGACTTCG GTACTGATTA CACGCAAAGT
TTTGGCAACG GACAATCCGG AATGGTATAC GCTTGGGGCT GGCTTGAAGG CCTACTGAAA
GAAAAATATC CGAATGTAGA ATACGGTATT TTCCCAACTC CAACATTCAC GAAAGAAACG
CCATTCGCTT ATGATCGCTA CAACGGCGAA TCTACTCCAG GAATCAACGC TCACCAAAGC
AAAGAACAGC AAGCCGTGGC ACAAGATTTT ATCAAGTTCA TTCTCGCTAA CGACGCTTTT
ATTCGCTCTG CAGTGAAACA TCTGAACTCG TTCCCAGCGA AGAAATCGCT ACAGAACGAT
CCAGAGATTC TCAAGGCCCC GGTAATGGCG GCTATTCAGC CTCGCGTGAA TAGGTTGATT
TGGCCAGGCA TCACTCCTTC TACGGTGGAA ACCAGCAGCA AGGCAGCGTT CCAAAACGTT
ATGCAAAATG GTCAATCTAT TGATTCCGCA GTAAAAGAAG CACAGGCGAC CATGGTAAAA
GACATGAAGA ATTCCAACTT TAAGTCTGCA GAAAGCAAGT ATGAATTCTT CAAAGAACAC
AAGTAA
 
Protein sequence
MNIRKMFISC VAVLGMLASV SACGSNSAGV TDTKKDKGLP VMGEALKYDP NHLVNQGKPI 
ALEYWSWGDS STDPIYDMIK QYTKIYPNVT FKTKNVAWDD YWTKLPLVLK GKSGPALFNI
HNSKDELIRP YSADYNVDKA AMEADYSSSV AHEDANGKVK YIDSVINTGN IYYNKKLWKE
AGLTEADIPT TWEELREVAK KLTKWNGSKM VQAGFNVNGD AYAAINQGLN YQRGELAFDK
SGKKPNFDNK ITRENMQFLK NLYDKDKVIA TDFGTDYTQS FGNGQSGMVY AWGWLEGLLK
EKYPNVEYGI FPTPTFTKET PFAYDRYNGE STPGINAHQS KEQQAVAQDF IKFILANDAF
IRSAVKHLNS FPAKKSLQND PEILKAPVMA AIQPRVNRLI WPGITPSTVE TSSKAAFQNV
MQNGQSIDSA VKEAQATMVK DMKNSNFKSA ESKYEFFKEH K