Gene HMPREF0424_1304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_1304 
Symbol 
ID8709405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp1559146 
End bp1560159 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content41% 
IMG OID646483391 
Productperiplasmic binding protein and sugar binding domain of the LacI family protein 
Protein accessionYP_003374492 
Protein GI283783738 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGCAA ATATTCAAGA TGTAGCGAAT AAAGCTCATG TGTCAGTTTC TACTGTGTCG 
CGTTCATTTA CACGACCAAA CTTAGTATCT GCATCCACTC GCAATAAAGT ATTGGCCATT
GCTGAACAAT TGAATTTTTC TATTTCGAAA TCTGCAGCAG CACTAAAATC TGGTAAAACC
TTGCGCATCG CTTTATTGAT TAGCGATCAA ATTCGTCTAT GGTTTTCTGC ATCTATTATT
CAAGGATTGA ATCAAGTATT CCACACAGCT GGATACGATT TGTCAATTTT TCAGATTTCC
AGCAGCAAAG AGCGTAGCGA ATTTTTCACA ATGCTCCCAA CTAGACGTAA TGCTGACGCT
GTTGTTGTGT GCTCCTTTGA CGTAAATACC GACGAAATCG CACAACTTAA ATCAACAGGT
GTACCAATTG TTGGCATAAA CTGCTTATAT CCACAAAAAT GCGATTTTGA CGCGACTATC
AATATTGACG ATGACCAAGG TGCAAGACTT ATGGCACGCC ACTTAATCGG TTTAGGGCAT
CGCAATATTG CATATGTGCG CACAACTCGT GACGTTTCAC TACATTTCAG CGTATTGCAG
CGTTACCATT CTTTTATTGA TGAATGCCAA AATAGCGGGA TCACACCTAC AGAAATTGTG
GCACCTGCTA ATTCTGATCG CATAAGCGCC ATAGTTTCTA TGCTTCTTGG AAGTTCAATT
ATGCCAACAG CAATTGCATG CCAAGAAGAT GGTATTGCAA TACCGCTTAT GTTTCAGCTT
GCACGCAGCG GGTATTCTAC TCCAAAAGAT GTGTCAATTA TTGGTTTTGA CGATAGTTTT
TACGCGCATG AAACTGGCTT AACTACTATT AGACAGGATC CAGTTGATAT TGCTTCTACT
GCGGCAAATA TCACGCTTGC GCTTATAAAC GCAGAGGAAG TTGAAGATCC ATATCGCATT
GTTCCAGCGC AACTTATTGT GCGTTCAAGC ACAGCTGCAT TATTAAAAGA GTAG
 
Protein sequence
MSANIQDVAN KAHVSVSTVS RSFTRPNLVS ASTRNKVLAI AEQLNFSISK SAAALKSGKT 
LRIALLISDQ IRLWFSASII QGLNQVFHTA GYDLSIFQIS SSKERSEFFT MLPTRRNADA
VVVCSFDVNT DEIAQLKSTG VPIVGINCLY PQKCDFDATI NIDDDQGARL MARHLIGLGH
RNIAYVRTTR DVSLHFSVLQ RYHSFIDECQ NSGITPTEIV APANSDRISA IVSMLLGSSI
MPTAIACQED GIAIPLMFQL ARSGYSTPKD VSIIGFDDSF YAHETGLTTI RQDPVDIAST
AANITLALIN AEEVEDPYRI VPAQLIVRSS TAALLKE