Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HMPREF0424_0892 |
Symbol | thiC |
ID | 8709229 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gardnerella vaginalis 409-05 |
Kingdom | Bacteria |
Replicon accession | NC_013721 |
Strand | + |
Start bp | 1012629 |
End bp | 1015286 |
Gene Length | 2658 bp |
Protein Length | 885 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 646482991 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_003374107 |
Protein GI | 283783353 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC [TIGR00693] thiamine-phosphate pyrophosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACATCGA ATTATCCGTA CGCATCAATG CGTAATCAAT TTAATCTAAG CGCCTGCTTT ATTGCAGACC CACAAGCTTG TAATAATCGT CCGCTTACCG ATATTGTCGA TGATGCTTTG CGCGCTGGAG CGACTTTTAT TCGTCTTCAT TGCAACAATG AAAACGCTAA AGAAATCACT ACTATTGCAC GCGATATTGC ACAAATCATC GAAGACAATA ACAAATGCGA TTCTGTAACT TTTGTTATCG ACGAGCGTGT TGATGTCGTT TGGCAAGCTC GCAATCAAAG CATCAAAGTT GACGGAGTGC ATTTAGCACA AAGCGACATG GAGCCTAGAG AGGCTCGCGC TCTTCTTGGC GAAGATGCTG TAATCGGCTT ATCTGTTGAA ACCGAAAGCT TAGTTAAAGT AATAAACGAA CTTCCAGATG GATGCATTGA TTACATTTGC GTAACAGCAA TGCGCAATCC TGAAGAAGGA TGCGAAAGCA CTACTGCCGC TTACGAATTG GAAGCAAATC ACACAACGCT GGACGAAGCG AAAATTAATA CGATTTGTTC CGCAAGTGAT TTTCCAGTTC TAGTTGGCGG AAGAACTGCA CTCGACGATA TCGATACAAT CGCTCACACC AAGGCTGCAG GATGGTTCGT TTCTGAAGCA TTGTATTCTT CAGAAACACC AGAATCAACT ATGCGTGAAT TTGTTGAACA TTGGAAAGCT GTGCGAGGTG AAGAAAAGCA CGGCTACGCT AAGCGAGTGA TAGTTGCAGA AAATTCTGAA TCAAAATCTT CAGAAACTCA AGAAAAGAAG CCAACTTTTA TTAATGCGAA AGAAGCAAAG GATGCTGCGA AATTAGCTAA ACAACAGCGA GTTGACATTG CAGCTCGCGG ATGCACTCAG CGCGATAAAG CTCATATTCG CAAAACAACT CCAATTCATT TTGAGTATGA ATATGGTTCT TATGATTTGG AAGTTCCTTA TACGGAAATT AAGCTTTCGG ATACTCCTGG CGTAGGTCCT AACCCGCCTT TTAAGGATTA CAATACAGAA GGTCCAAAGT GCGATCCGAA GGAAGGTTTG GCTCCGCTTC GCCTTGACTG GATTCGCGAC CGCGGTGACG TTGTGGAATA TGAGGGTCGC AGGCGCAATC TTCAAGACGA CGGCAAGCGC GCAATTAAGC GAGGCAAAGC TTCTAAAGAA TGGCGTGGAC GCACGCATAA GCCAATGAAG GGCGCGGATC ATCCGATTAC ACAAATGTGG TACGCTCGCC ACGGAATCAC TACTCCGGAA ATGCAATATG TTGCAACGCG CGAAAATTGC GATGTAGAGC TGGTTCGCGA AGAAGTTGCA GCCGGACGTG CTGTAATTCC TTGCAATATT AACCATCCTG AAGCTGAACC TATGATTATT GGCTCGCGCT TCTTGACTAA GCTCAACGCA AACATGGGTA ATTCTGCTGT TACGTCATCT ATCGACGAAG AAGTAGAAAA GCTTACGTGG GCCACGAAGT GGGGTGCGGA TACCGTTATG GATCTTTCCA CCGGTAACGA TATTCACACA ACGCGCGAAT GGATTTTGCG CAACTCCCCT GTGCCAATTG GAACAGTGCC AATGTATCAG GCTTTGGAAA AGGTTGAGGA TGATGCTTCT AAGCTCAGCT GGGAGCTTTT CCGCGACACT GTTATTGAGC AGTGCGAGCA GGGCGTTGAC TACATGACTA TTCACGCTGG CGTGCTTCTT CGCTACGTGC CGCTTACTGC AAACCGCGTA ACCGGTATTG TTTCTCGTGG TGGCTCAATT ATGGCTGAAT GGTGCTTGCA ACATCATCAA GAGAGCTTCT TGTATACGCA CTTTGAAGAA TTATGCGAGA TTTTCGCAAA GTACGATGTT GCATTCTCTT TGGGTGATGG TTTGCGTCCA GGTAGCTTGG CTGATGCTAA CGATGCGGCT CAGCTTTCCG AGCTTATGAC GCTTGGCGAG CTTACGAAGA TCGCTTGGAA GCATGACGTA CAGGTGATGA TTGAAGGTCC TGGTCACGTG CCATTCGACA CTGTGCGTAT GAATATTGAG ATGGAAAAGG CAATTTGCCA GAATGCTCCA TTCTATACGC TTGGTCCTTT GACTACGGAT ACCGCACCTG GCTATGACCA CATTACTTCC GCAATTGGTG GCGTGGAGAT TGCGCGATAC GGCACCGCAA TGCTTTGCTA TGTGACTCCT AAGGAACATT TGGGGTTGCC TAACAAGGAT GACGTGAAGC AAGGCGTGAT TGCGTATAAG ATCGCTTGCC ACGCAGCTGA TCTTGCTAAG CATCATCCAC ATGCTATGGA TCGCGACAAC GCAATCAGTA AGGCTCGCTT TGAGTTCCGC TGGTTGGATC AGTTCAACTT AAGCTATGAT CCAGATACCG CAATCGCCTT CCACGACGAA ACACTTCCTG CAGAACCAGC AAAAATGGCG CACTTCTGCT CGATGTGCGG ACCAAAGTTC TGCTCGATGG CTATTTCGCA AAATATTCGT AAGCGTTTTG GCGGAGCAGC TCAGCAGGAG CAGCTCGTTG AAGAAGCACG CAGTCAGGCA ATTGCCGATG GTATGAAAGA GATGAGCAAA AAGTTCCAAG AATCCGGCTC ATCGTTGTAT CAAAGCGTGA AAGCATAA
|
Protein sequence | MTSNYPYASM RNQFNLSACF IADPQACNNR PLTDIVDDAL RAGATFIRLH CNNENAKEIT TIARDIAQII EDNNKCDSVT FVIDERVDVV WQARNQSIKV DGVHLAQSDM EPREARALLG EDAVIGLSVE TESLVKVINE LPDGCIDYIC VTAMRNPEEG CESTTAAYEL EANHTTLDEA KINTICSASD FPVLVGGRTA LDDIDTIAHT KAAGWFVSEA LYSSETPEST MREFVEHWKA VRGEEKHGYA KRVIVAENSE SKSSETQEKK PTFINAKEAK DAAKLAKQQR VDIAARGCTQ RDKAHIRKTT PIHFEYEYGS YDLEVPYTEI KLSDTPGVGP NPPFKDYNTE GPKCDPKEGL APLRLDWIRD RGDVVEYEGR RRNLQDDGKR AIKRGKASKE WRGRTHKPMK GADHPITQMW YARHGITTPE MQYVATRENC DVELVREEVA AGRAVIPCNI NHPEAEPMII GSRFLTKLNA NMGNSAVTSS IDEEVEKLTW ATKWGADTVM DLSTGNDIHT TREWILRNSP VPIGTVPMYQ ALEKVEDDAS KLSWELFRDT VIEQCEQGVD YMTIHAGVLL RYVPLTANRV TGIVSRGGSI MAEWCLQHHQ ESFLYTHFEE LCEIFAKYDV AFSLGDGLRP GSLADANDAA QLSELMTLGE LTKIAWKHDV QVMIEGPGHV PFDTVRMNIE MEKAICQNAP FYTLGPLTTD TAPGYDHITS AIGGVEIARY GTAMLCYVTP KEHLGLPNKD DVKQGVIAYK IACHAADLAK HHPHAMDRDN AISKARFEFR WLDQFNLSYD PDTAIAFHDE TLPAEPAKMA HFCSMCGPKF CSMAISQNIR KRFGGAAQQE QLVEEARSQA IADGMKEMSK KFQESGSSLY QSVKA
|
| |