Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PG2110 |
Symbol | thiC |
ID | 2551489 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Porphyromonas gingivalis W83 |
Kingdom | Bacteria |
Replicon accession | NC_002950 |
Strand | - |
Start bp | 2219539 |
End bp | 2221302 |
Gene Length | 1764 bp |
Protein Length | 587 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637150688 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | NP_906170 |
Protein GI | 34541691 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.675546 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGAAT TCAAAGTGAC TACCGGTCCT CTGCCCGGCA GTGAGAAAAT CTATGTCGAA GGAGAGCGTT TCCCCTTCTT GCGCGTACCG ATGCGGCGCA TCCGAATGTC CGACACCATA TTGGAGAACG GCGAACGTGA GAAAAACGAA GATGTAGTCG TATACGATAC CAGTGGCCCC TATACCGATA CCTCCTATGA GGTGAATCTG CACCGAGGCG TACCGAAGAT ACGCGAGCAG TGGATAGAGG ATCGGGGCGA TACGGTGCGG CTCGAAGGGC TCAGCTCCGA ATACGGACGG ATAAGGCAGT CGGACGCTTC GCTCGAAAAG CTGCGTTACG AGCATGTGTG CACGCGTCCC CGTGCCGCCA AGGACGGCTG TGCCACGCAG CTCTACTACG CCCGTCAGGG GATCGTGACG CCGGAGATGG AGTTCGTGGC CATCCGCGAA AATCAGTTGA TCGATCAGGT CAGGACGCGC TATCGCGCTG AGGAGGGTGA GCCGCTCGGA GCTGTTATTC CGCGCAAGAT CACGCCCGAA TTTGTACGCG ACGAGATTGC CGCCGGACGG GCTATCCTTC CGGCCAATAT CAATCATCCG GAAAGTGAGC CGATGATCAT CGGGCGCAAT TTCCTCGTCA AGATCAATGC GAACATAGGC AATTCGCCCA TCAGCAGTAC CATCGAGGAA GAGGTGGAAA AGGCCGTCTG GGCCATACGC TGGGGTGCCG ATACGGTCAT GGATCTCTCC ACGGGGGATC ATATCCATGA GACGCGCGAG TGGATCATCC GCAATTCGCC CGTGCCCATC GGCACTGTGC CCCTCTACCA GACGCTGGAG AAGGTGCAGG GCGATGTGAC GAAGCTCAAC TGGGAGATAT TCCGCGATAC GCTCATCGAG CAGGCCGAGC AGGGTGTGGA CTACTTCACC ATCCACGCCG GTCTGCGTTG GCACCACGTG CCTCTGACCT TGCGCCGCCT CACGGGGATC GTCTCCCGCG GTGGTTCCAT CATCGCCAAC TGGTGCACCA CCCACAAGCG CGAAAGTTTC ATCTACGAGC ATTTCGAAGA GATCTGCCAA ATCCTCGCAC GCTACGACGT AGCCATATCT CTCGGCGATG GCTTGCGCCC GGGCTGCATC CACGACGCCA ACGATGCTGC GCAGATAGCT GAGCTGAAGA CGCTGGGCGA ACTTACCGAG ATCGCTTGGA AGTATAACGT GCAAACCATT ATCGAAGGAC CGGGACACGT GCCCATGCAC AAGATCCGCG AGAATATGGA GATTCAACTC GAAGCCTGCC ATGGCGCACC CTTCTACACT CTCGGCCCGT TGGTCAGCGA CGTGGCGTCC GGCTACGACC ATATCACATC GGCTATCGGC GCGGCACAGA TCGGATGGTT CGGCACAGCC ATGCTCTGCT ATGTGACGCA AAAGGAGCAT TTGGGTCTGC CCAACCGCGA AGATGTACGT GAAGGTGTAG TAACCTATAG ACTGGCTGCT CATGCCGCCG ACTTGGCCAA AGGACACCCC ACGGCGTACT GGCGCGACTA TATGATGAGC AAGGCGCGGT TCGAATTCCG CTGGAAGGAT CAGTTCCATC TCTCGCTTGA TCCAGAGAAA GCGATCCAAT TCCACGATGC CACGCTGCCG GACGAAGGCC ACAAAGAGGC GCATTTCTGC TCCATGTGCG GCGAACACTT CTGCTCCATG CGTGCCAATA AGAACTTCCG CAAGTTGCTA AACGAAGAGG CAGTCTCCAA ATGA
|
Protein sequence | MKEFKVTTGP LPGSEKIYVE GERFPFLRVP MRRIRMSDTI LENGEREKNE DVVVYDTSGP YTDTSYEVNL HRGVPKIREQ WIEDRGDTVR LEGLSSEYGR IRQSDASLEK LRYEHVCTRP RAAKDGCATQ LYYARQGIVT PEMEFVAIRE NQLIDQVRTR YRAEEGEPLG AVIPRKITPE FVRDEIAAGR AILPANINHP ESEPMIIGRN FLVKINANIG NSPISSTIEE EVEKAVWAIR WGADTVMDLS TGDHIHETRE WIIRNSPVPI GTVPLYQTLE KVQGDVTKLN WEIFRDTLIE QAEQGVDYFT IHAGLRWHHV PLTLRRLTGI VSRGGSIIAN WCTTHKRESF IYEHFEEICQ ILARYDVAIS LGDGLRPGCI HDANDAAQIA ELKTLGELTE IAWKYNVQTI IEGPGHVPMH KIRENMEIQL EACHGAPFYT LGPLVSDVAS GYDHITSAIG AAQIGWFGTA MLCYVTQKEH LGLPNREDVR EGVVTYRLAA HAADLAKGHP TAYWRDYMMS KARFEFRWKD QFHLSLDPEK AIQFHDATLP DEGHKEAHFC SMCGEHFCSM RANKNFRKLL NEEAVSK
|
| |