Gene HMPREF0424_1336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_1336 
Symbol 
ID8709024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp1597789 
End bp1598727 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content45% 
IMG OID646483421 
Productperiplasmic binding protein and sugar binding domain of the LacI family protein 
Protein accessionYP_003374519 
Protein GI283783765 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.454554 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTATC CAATTATGCG TCGAGGCGTA AAGATTGCTT GCGCGGTTGC AGCAGCATCT 
GCACTTATTG TCAGCATGAG TGCATGTGGT AGTACTAATG GTGGCGATAA AGTCGCATTG
CTAGTTTCTA CTTTGAACAA TCCGTTCTTT GTGGATTTGC GTGATGGTGC CCAAGCTGAA
GCAGAGAAAC TTGGTGTTGA TTTGCAAGTT TCCGACGCTC AGAATGATTC TGCAAAGCAG
CAGGATCAAG CGCAGAATGC TCAATCACAA GGTGCAAAAG CTGTTATTAT TAACCCTGTA
GATTCTGATG CTGCTGGTCC TGCAGTTGCC CCGTTGCTAA GCGCGAACTT GCCTGTGATT
TCTGTTGACC GTTCTGTAAC TGGTGAAAAA GTTACTTCGC ATATTGCTTC CGATAACGTT
GCTGGTGGTG CTCAGGCTGC TGACACGCTA GCTAATAGTA TGGGCGAAAA GGGTGAAGTT
TTGATTTTGC AAGGTATTCC TGGAGCGGCT TCTACTCGCG ACCGTGGCAA AGGATTCAAA
GATCGTATTA AGAAATATTC AAATATTAAA GTTGTAGCAG AACAAACTGC TAACTTTGAC
CGTGCTGAAG CATTAAATGT GGCTACTAAT TTGTTGCAAT CGCACCCGAA TGCAACCGGC
ATTTACGCCG AAAATGACGA GATGGCATTA GGTGCCATTC AAGCATTAGG AGCTAAAGCT
GGTAAAGAAA TTAAAGTAGT CGGTTTCGAT GGCACTGTTG ACGGTATGAA AGCTATTAAG
GCCGGCGCAA TGGCTGGAAC AATTGCTCAG CAGCCAAAGG AACTTGGTCG TTCCGCCGTT
GCTGCTGCTG TAAAAGCTAT TAAAGGGCAA AGTGTTCCAA AGACTGAACC AATTACTGTG
AAAACAGTGA CAATAAAGAA TGTGGGTGAT TTTGAGTAA
 
Protein sequence
MNYPIMRRGV KIACAVAAAS ALIVSMSACG STNGGDKVAL LVSTLNNPFF VDLRDGAQAE 
AEKLGVDLQV SDAQNDSAKQ QDQAQNAQSQ GAKAVIINPV DSDAAGPAVA PLLSANLPVI
SVDRSVTGEK VTSHIASDNV AGGAQAADTL ANSMGEKGEV LILQGIPGAA STRDRGKGFK
DRIKKYSNIK VVAEQTANFD RAEALNVATN LLQSHPNATG IYAENDEMAL GAIQALGAKA
GKEIKVVGFD GTVDGMKAIK AGAMAGTIAQ QPKELGRSAV AAAVKAIKGQ SVPKTEPITV
KTVTIKNVGD FE