Gene HMPREF0424_0892 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_0892 
SymbolthiC 
ID8709229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp1012629 
End bp1015286 
Gene Length2658 bp 
Protein Length885 aa 
Translation table11 
GC content47% 
IMG OID646482991 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_003374107 
Protein GI283783353 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC
[TIGR00693] thiamine-phosphate pyrophosphorylase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATCGA ATTATCCGTA CGCATCAATG CGTAATCAAT TTAATCTAAG CGCCTGCTTT 
ATTGCAGACC CACAAGCTTG TAATAATCGT CCGCTTACCG ATATTGTCGA TGATGCTTTG
CGCGCTGGAG CGACTTTTAT TCGTCTTCAT TGCAACAATG AAAACGCTAA AGAAATCACT
ACTATTGCAC GCGATATTGC ACAAATCATC GAAGACAATA ACAAATGCGA TTCTGTAACT
TTTGTTATCG ACGAGCGTGT TGATGTCGTT TGGCAAGCTC GCAATCAAAG CATCAAAGTT
GACGGAGTGC ATTTAGCACA AAGCGACATG GAGCCTAGAG AGGCTCGCGC TCTTCTTGGC
GAAGATGCTG TAATCGGCTT ATCTGTTGAA ACCGAAAGCT TAGTTAAAGT AATAAACGAA
CTTCCAGATG GATGCATTGA TTACATTTGC GTAACAGCAA TGCGCAATCC TGAAGAAGGA
TGCGAAAGCA CTACTGCCGC TTACGAATTG GAAGCAAATC ACACAACGCT GGACGAAGCG
AAAATTAATA CGATTTGTTC CGCAAGTGAT TTTCCAGTTC TAGTTGGCGG AAGAACTGCA
CTCGACGATA TCGATACAAT CGCTCACACC AAGGCTGCAG GATGGTTCGT TTCTGAAGCA
TTGTATTCTT CAGAAACACC AGAATCAACT ATGCGTGAAT TTGTTGAACA TTGGAAAGCT
GTGCGAGGTG AAGAAAAGCA CGGCTACGCT AAGCGAGTGA TAGTTGCAGA AAATTCTGAA
TCAAAATCTT CAGAAACTCA AGAAAAGAAG CCAACTTTTA TTAATGCGAA AGAAGCAAAG
GATGCTGCGA AATTAGCTAA ACAACAGCGA GTTGACATTG CAGCTCGCGG ATGCACTCAG
CGCGATAAAG CTCATATTCG CAAAACAACT CCAATTCATT TTGAGTATGA ATATGGTTCT
TATGATTTGG AAGTTCCTTA TACGGAAATT AAGCTTTCGG ATACTCCTGG CGTAGGTCCT
AACCCGCCTT TTAAGGATTA CAATACAGAA GGTCCAAAGT GCGATCCGAA GGAAGGTTTG
GCTCCGCTTC GCCTTGACTG GATTCGCGAC CGCGGTGACG TTGTGGAATA TGAGGGTCGC
AGGCGCAATC TTCAAGACGA CGGCAAGCGC GCAATTAAGC GAGGCAAAGC TTCTAAAGAA
TGGCGTGGAC GCACGCATAA GCCAATGAAG GGCGCGGATC ATCCGATTAC ACAAATGTGG
TACGCTCGCC ACGGAATCAC TACTCCGGAA ATGCAATATG TTGCAACGCG CGAAAATTGC
GATGTAGAGC TGGTTCGCGA AGAAGTTGCA GCCGGACGTG CTGTAATTCC TTGCAATATT
AACCATCCTG AAGCTGAACC TATGATTATT GGCTCGCGCT TCTTGACTAA GCTCAACGCA
AACATGGGTA ATTCTGCTGT TACGTCATCT ATCGACGAAG AAGTAGAAAA GCTTACGTGG
GCCACGAAGT GGGGTGCGGA TACCGTTATG GATCTTTCCA CCGGTAACGA TATTCACACA
ACGCGCGAAT GGATTTTGCG CAACTCCCCT GTGCCAATTG GAACAGTGCC AATGTATCAG
GCTTTGGAAA AGGTTGAGGA TGATGCTTCT AAGCTCAGCT GGGAGCTTTT CCGCGACACT
GTTATTGAGC AGTGCGAGCA GGGCGTTGAC TACATGACTA TTCACGCTGG CGTGCTTCTT
CGCTACGTGC CGCTTACTGC AAACCGCGTA ACCGGTATTG TTTCTCGTGG TGGCTCAATT
ATGGCTGAAT GGTGCTTGCA ACATCATCAA GAGAGCTTCT TGTATACGCA CTTTGAAGAA
TTATGCGAGA TTTTCGCAAA GTACGATGTT GCATTCTCTT TGGGTGATGG TTTGCGTCCA
GGTAGCTTGG CTGATGCTAA CGATGCGGCT CAGCTTTCCG AGCTTATGAC GCTTGGCGAG
CTTACGAAGA TCGCTTGGAA GCATGACGTA CAGGTGATGA TTGAAGGTCC TGGTCACGTG
CCATTCGACA CTGTGCGTAT GAATATTGAG ATGGAAAAGG CAATTTGCCA GAATGCTCCA
TTCTATACGC TTGGTCCTTT GACTACGGAT ACCGCACCTG GCTATGACCA CATTACTTCC
GCAATTGGTG GCGTGGAGAT TGCGCGATAC GGCACCGCAA TGCTTTGCTA TGTGACTCCT
AAGGAACATT TGGGGTTGCC TAACAAGGAT GACGTGAAGC AAGGCGTGAT TGCGTATAAG
ATCGCTTGCC ACGCAGCTGA TCTTGCTAAG CATCATCCAC ATGCTATGGA TCGCGACAAC
GCAATCAGTA AGGCTCGCTT TGAGTTCCGC TGGTTGGATC AGTTCAACTT AAGCTATGAT
CCAGATACCG CAATCGCCTT CCACGACGAA ACACTTCCTG CAGAACCAGC AAAAATGGCG
CACTTCTGCT CGATGTGCGG ACCAAAGTTC TGCTCGATGG CTATTTCGCA AAATATTCGT
AAGCGTTTTG GCGGAGCAGC TCAGCAGGAG CAGCTCGTTG AAGAAGCACG CAGTCAGGCA
ATTGCCGATG GTATGAAAGA GATGAGCAAA AAGTTCCAAG AATCCGGCTC ATCGTTGTAT
CAAAGCGTGA AAGCATAA
 
Protein sequence
MTSNYPYASM RNQFNLSACF IADPQACNNR PLTDIVDDAL RAGATFIRLH CNNENAKEIT 
TIARDIAQII EDNNKCDSVT FVIDERVDVV WQARNQSIKV DGVHLAQSDM EPREARALLG
EDAVIGLSVE TESLVKVINE LPDGCIDYIC VTAMRNPEEG CESTTAAYEL EANHTTLDEA
KINTICSASD FPVLVGGRTA LDDIDTIAHT KAAGWFVSEA LYSSETPEST MREFVEHWKA
VRGEEKHGYA KRVIVAENSE SKSSETQEKK PTFINAKEAK DAAKLAKQQR VDIAARGCTQ
RDKAHIRKTT PIHFEYEYGS YDLEVPYTEI KLSDTPGVGP NPPFKDYNTE GPKCDPKEGL
APLRLDWIRD RGDVVEYEGR RRNLQDDGKR AIKRGKASKE WRGRTHKPMK GADHPITQMW
YARHGITTPE MQYVATRENC DVELVREEVA AGRAVIPCNI NHPEAEPMII GSRFLTKLNA
NMGNSAVTSS IDEEVEKLTW ATKWGADTVM DLSTGNDIHT TREWILRNSP VPIGTVPMYQ
ALEKVEDDAS KLSWELFRDT VIEQCEQGVD YMTIHAGVLL RYVPLTANRV TGIVSRGGSI
MAEWCLQHHQ ESFLYTHFEE LCEIFAKYDV AFSLGDGLRP GSLADANDAA QLSELMTLGE
LTKIAWKHDV QVMIEGPGHV PFDTVRMNIE MEKAICQNAP FYTLGPLTTD TAPGYDHITS
AIGGVEIARY GTAMLCYVTP KEHLGLPNKD DVKQGVIAYK IACHAADLAK HHPHAMDRDN
AISKARFEFR WLDQFNLSYD PDTAIAFHDE TLPAEPAKMA HFCSMCGPKF CSMAISQNIR
KRFGGAAQQE QLVEEARSQA IADGMKEMSK KFQESGSSLY QSVKA