Gene HMPREF0424_1158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_1158 
SymbolthiE 
ID8709453 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp1344672 
End bp1346249 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content51% 
IMG OID646483248 
Productthiamine-phosphate diphosphorylase 
Protein accessionYP_003374356 
Protein GI283783602 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG2145] Hydroxyethylthiazole kinase, sugar kinase family 
TIGRFAM ID[TIGR00693] thiamine-phosphate pyrophosphorylase
[TIGR00694] hydroxyethylthiazole kinase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00817075 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACAACAT GCAACACTGA AAAAACAACT AACTGCAACG AAGCAAACGA ATACTGCACA 
AGCTCAACAA CGCAAGTCGA CACATGCTTC TCTGTACGCG CACGCCGCCG CATGTTTCTT
AACGACGTGC GTCAATGCAT CGAACGCGTG CGCACACAGC AGCCACTCAC CCATTGCATC
ACGAACGTAA TCGTGCAAGA CATTACAGCC AACGCACTAC TCGCTGCAGG AGCGTCGCCG
ATTATGGTAA CCGACCCAGA AGAAGCGCAT GCACTTGCGC AAATTGCCAC AGGCGTGCTC
ATAAACGTTG GCACATTCCA TCAGCCAGAA ACTTCCGAAT ACATGCGCGC AGCTGTTGAA
GGATGCGAAA AAGCCGATAC TCCGTGGGTG CTTGACCCGG TTGGAATCGG CGTTCCAGCA
CTTGCTCCAC GCGCGCGCTT TATACACGAA ATTATTAAGC ATCACCCTAC CGTAATTCGC
GCAAATGCTT CCGAAATAAT GGCACTTGCA GGCAAAGAAA GCAACGGCAA AGGTGTGGAT
TCTCACGATA ACGTGAACGA TGCTCTTCAA GCCGCGCGCG AATTAGCAAA AAAGTACGGC
TCAGTAGTGG CAATTTCTGG CGAAAAAGAC GCGATTTACG CTCACGGTTG CTTAGCTCGC
GTAACCGGCG GACATAAAAC TATGACTAAG GTAGTTGGAA CCGGTTGCGC TTTGGGAGCG
TTGGTTGCAG CATACGTTGG CGCAAACCCG GAACGTCCGC TCGCAGCAAC TGTTGCAGCA
CACGTGCACG CAGCAGCGGC TGGAACTTGG GCAGCTCGCC AAACCACAGC TCCCGGCACT
TTTCGCACAC TGTGGATGGA TGCGCTTTCA ACTCTTAGTG TAAACGACAT GTTTAGTTTG
ACAAATATTG AGTTTACGGT GGAACCTGTT GATTGGACAC TGTATTTGGT AACGGATCCG
CGCATGGGAA ATCGCCCGGA AGAAGAAGTT GCAGTAGAGT CAGTTGAAGG CGGAGTTACA
GTTGTGCAAT TGCGCGATAA GTATTCCGAC GACGCTGAAA TTTCAGCAAA GGCGAAGAAG
TTGCGTCATG CGCTTATTGA TTCCGGCCAT GGCGACGTGC CTGTGTTCAT AGACGATCAT
GTTGATTGCG CAGCTCAGCT TGGTTTCAAC TTGCACGTTG GGCAAAAGGA TACGCCGTTT
GTTGAGGCGC GAAAAGCGAT GCCGGCAGAG TGGATGGTTG GGCTTTCTTG CGCGCGCCCG
GATTTGATGG AGAAAGCTTA CCGCGAATGC AAAGAAAATG ACGTTCCGCT GCCAGATGTA
ATTGGCATTG GTGCCGCTTT TGAAACGCAT ACAAAAGCGC ACGACGTGCC GCCGCTGGGA
GTTGACGGAG TGAACGAAGT GGCGAAAGTT GCGCACTCTA TGGGCGTAAA AACGTTAGCA
ATTGGCGGAA TTCACGAGAA CACAGTGTTC CCTATTCGAG GTCTTGAACT TGACGGAGTT
TGCACAGTGT CGGCGCTTAT GTGCGCTGAG GATGCTGGGA AAGTAGCGCG CGAGCTTAAA
AGCGTGATTA CTGAGTAA
 
Protein sequence
MTTCNTEKTT NCNEANEYCT SSTTQVDTCF SVRARRRMFL NDVRQCIERV RTQQPLTHCI 
TNVIVQDITA NALLAAGASP IMVTDPEEAH ALAQIATGVL INVGTFHQPE TSEYMRAAVE
GCEKADTPWV LDPVGIGVPA LAPRARFIHE IIKHHPTVIR ANASEIMALA GKESNGKGVD
SHDNVNDALQ AARELAKKYG SVVAISGEKD AIYAHGCLAR VTGGHKTMTK VVGTGCALGA
LVAAYVGANP ERPLAATVAA HVHAAAAGTW AARQTTAPGT FRTLWMDALS TLSVNDMFSL
TNIEFTVEPV DWTLYLVTDP RMGNRPEEEV AVESVEGGVT VVQLRDKYSD DAEISAKAKK
LRHALIDSGH GDVPVFIDDH VDCAAQLGFN LHVGQKDTPF VEARKAMPAE WMVGLSCARP
DLMEKAYREC KENDVPLPDV IGIGAAFETH TKAHDVPPLG VDGVNEVAKV AHSMGVKTLA
IGGIHENTVF PIRGLELDGV CTVSALMCAE DAGKVARELK SVITE