Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HMPREF0424_1158 |
Symbol | thiE |
ID | 8709453 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gardnerella vaginalis 409-05 |
Kingdom | Bacteria |
Replicon accession | NC_013721 |
Strand | + |
Start bp | 1344672 |
End bp | 1346249 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 646483248 |
Product | thiamine-phosphate diphosphorylase |
Protein accession | YP_003374356 |
Protein GI | 283783602 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG2145] Hydroxyethylthiazole kinase, sugar kinase family |
TIGRFAM ID | [TIGR00693] thiamine-phosphate pyrophosphorylase [TIGR00694] hydroxyethylthiazole kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.00817075 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACAACAT GCAACACTGA AAAAACAACT AACTGCAACG AAGCAAACGA ATACTGCACA AGCTCAACAA CGCAAGTCGA CACATGCTTC TCTGTACGCG CACGCCGCCG CATGTTTCTT AACGACGTGC GTCAATGCAT CGAACGCGTG CGCACACAGC AGCCACTCAC CCATTGCATC ACGAACGTAA TCGTGCAAGA CATTACAGCC AACGCACTAC TCGCTGCAGG AGCGTCGCCG ATTATGGTAA CCGACCCAGA AGAAGCGCAT GCACTTGCGC AAATTGCCAC AGGCGTGCTC ATAAACGTTG GCACATTCCA TCAGCCAGAA ACTTCCGAAT ACATGCGCGC AGCTGTTGAA GGATGCGAAA AAGCCGATAC TCCGTGGGTG CTTGACCCGG TTGGAATCGG CGTTCCAGCA CTTGCTCCAC GCGCGCGCTT TATACACGAA ATTATTAAGC ATCACCCTAC CGTAATTCGC GCAAATGCTT CCGAAATAAT GGCACTTGCA GGCAAAGAAA GCAACGGCAA AGGTGTGGAT TCTCACGATA ACGTGAACGA TGCTCTTCAA GCCGCGCGCG AATTAGCAAA AAAGTACGGC TCAGTAGTGG CAATTTCTGG CGAAAAAGAC GCGATTTACG CTCACGGTTG CTTAGCTCGC GTAACCGGCG GACATAAAAC TATGACTAAG GTAGTTGGAA CCGGTTGCGC TTTGGGAGCG TTGGTTGCAG CATACGTTGG CGCAAACCCG GAACGTCCGC TCGCAGCAAC TGTTGCAGCA CACGTGCACG CAGCAGCGGC TGGAACTTGG GCAGCTCGCC AAACCACAGC TCCCGGCACT TTTCGCACAC TGTGGATGGA TGCGCTTTCA ACTCTTAGTG TAAACGACAT GTTTAGTTTG ACAAATATTG AGTTTACGGT GGAACCTGTT GATTGGACAC TGTATTTGGT AACGGATCCG CGCATGGGAA ATCGCCCGGA AGAAGAAGTT GCAGTAGAGT CAGTTGAAGG CGGAGTTACA GTTGTGCAAT TGCGCGATAA GTATTCCGAC GACGCTGAAA TTTCAGCAAA GGCGAAGAAG TTGCGTCATG CGCTTATTGA TTCCGGCCAT GGCGACGTGC CTGTGTTCAT AGACGATCAT GTTGATTGCG CAGCTCAGCT TGGTTTCAAC TTGCACGTTG GGCAAAAGGA TACGCCGTTT GTTGAGGCGC GAAAAGCGAT GCCGGCAGAG TGGATGGTTG GGCTTTCTTG CGCGCGCCCG GATTTGATGG AGAAAGCTTA CCGCGAATGC AAAGAAAATG ACGTTCCGCT GCCAGATGTA ATTGGCATTG GTGCCGCTTT TGAAACGCAT ACAAAAGCGC ACGACGTGCC GCCGCTGGGA GTTGACGGAG TGAACGAAGT GGCGAAAGTT GCGCACTCTA TGGGCGTAAA AACGTTAGCA ATTGGCGGAA TTCACGAGAA CACAGTGTTC CCTATTCGAG GTCTTGAACT TGACGGAGTT TGCACAGTGT CGGCGCTTAT GTGCGCTGAG GATGCTGGGA AAGTAGCGCG CGAGCTTAAA AGCGTGATTA CTGAGTAA
|
Protein sequence | MTTCNTEKTT NCNEANEYCT SSTTQVDTCF SVRARRRMFL NDVRQCIERV RTQQPLTHCI TNVIVQDITA NALLAAGASP IMVTDPEEAH ALAQIATGVL INVGTFHQPE TSEYMRAAVE GCEKADTPWV LDPVGIGVPA LAPRARFIHE IIKHHPTVIR ANASEIMALA GKESNGKGVD SHDNVNDALQ AARELAKKYG SVVAISGEKD AIYAHGCLAR VTGGHKTMTK VVGTGCALGA LVAAYVGANP ERPLAATVAA HVHAAAAGTW AARQTTAPGT FRTLWMDALS TLSVNDMFSL TNIEFTVEPV DWTLYLVTDP RMGNRPEEEV AVESVEGGVT VVQLRDKYSD DAEISAKAKK LRHALIDSGH GDVPVFIDDH VDCAAQLGFN LHVGQKDTPF VEARKAMPAE WMVGLSCARP DLMEKAYREC KENDVPLPDV IGIGAAFETH TKAHDVPPLG VDGVNEVAKV AHSMGVKTLA IGGIHENTVF PIRGLELDGV CTVSALMCAE DAGKVARELK SVITE
|
| |