Gene PG2110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPG2110 
SymbolthiC 
ID2551489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePorphyromonas gingivalis W83 
KingdomBacteria 
Replicon accessionNC_002950 
Strand
Start bp2219539 
End bp2221302 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content57% 
IMG OID637150688 
Productthiamine biosynthesis protein ThiC 
Protein accessionNP_906170 
Protein GI34541691 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.675546 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGAAT TCAAAGTGAC TACCGGTCCT CTGCCCGGCA GTGAGAAAAT CTATGTCGAA 
GGAGAGCGTT TCCCCTTCTT GCGCGTACCG ATGCGGCGCA TCCGAATGTC CGACACCATA
TTGGAGAACG GCGAACGTGA GAAAAACGAA GATGTAGTCG TATACGATAC CAGTGGCCCC
TATACCGATA CCTCCTATGA GGTGAATCTG CACCGAGGCG TACCGAAGAT ACGCGAGCAG
TGGATAGAGG ATCGGGGCGA TACGGTGCGG CTCGAAGGGC TCAGCTCCGA ATACGGACGG
ATAAGGCAGT CGGACGCTTC GCTCGAAAAG CTGCGTTACG AGCATGTGTG CACGCGTCCC
CGTGCCGCCA AGGACGGCTG TGCCACGCAG CTCTACTACG CCCGTCAGGG GATCGTGACG
CCGGAGATGG AGTTCGTGGC CATCCGCGAA AATCAGTTGA TCGATCAGGT CAGGACGCGC
TATCGCGCTG AGGAGGGTGA GCCGCTCGGA GCTGTTATTC CGCGCAAGAT CACGCCCGAA
TTTGTACGCG ACGAGATTGC CGCCGGACGG GCTATCCTTC CGGCCAATAT CAATCATCCG
GAAAGTGAGC CGATGATCAT CGGGCGCAAT TTCCTCGTCA AGATCAATGC GAACATAGGC
AATTCGCCCA TCAGCAGTAC CATCGAGGAA GAGGTGGAAA AGGCCGTCTG GGCCATACGC
TGGGGTGCCG ATACGGTCAT GGATCTCTCC ACGGGGGATC ATATCCATGA GACGCGCGAG
TGGATCATCC GCAATTCGCC CGTGCCCATC GGCACTGTGC CCCTCTACCA GACGCTGGAG
AAGGTGCAGG GCGATGTGAC GAAGCTCAAC TGGGAGATAT TCCGCGATAC GCTCATCGAG
CAGGCCGAGC AGGGTGTGGA CTACTTCACC ATCCACGCCG GTCTGCGTTG GCACCACGTG
CCTCTGACCT TGCGCCGCCT CACGGGGATC GTCTCCCGCG GTGGTTCCAT CATCGCCAAC
TGGTGCACCA CCCACAAGCG CGAAAGTTTC ATCTACGAGC ATTTCGAAGA GATCTGCCAA
ATCCTCGCAC GCTACGACGT AGCCATATCT CTCGGCGATG GCTTGCGCCC GGGCTGCATC
CACGACGCCA ACGATGCTGC GCAGATAGCT GAGCTGAAGA CGCTGGGCGA ACTTACCGAG
ATCGCTTGGA AGTATAACGT GCAAACCATT ATCGAAGGAC CGGGACACGT GCCCATGCAC
AAGATCCGCG AGAATATGGA GATTCAACTC GAAGCCTGCC ATGGCGCACC CTTCTACACT
CTCGGCCCGT TGGTCAGCGA CGTGGCGTCC GGCTACGACC ATATCACATC GGCTATCGGC
GCGGCACAGA TCGGATGGTT CGGCACAGCC ATGCTCTGCT ATGTGACGCA AAAGGAGCAT
TTGGGTCTGC CCAACCGCGA AGATGTACGT GAAGGTGTAG TAACCTATAG ACTGGCTGCT
CATGCCGCCG ACTTGGCCAA AGGACACCCC ACGGCGTACT GGCGCGACTA TATGATGAGC
AAGGCGCGGT TCGAATTCCG CTGGAAGGAT CAGTTCCATC TCTCGCTTGA TCCAGAGAAA
GCGATCCAAT TCCACGATGC CACGCTGCCG GACGAAGGCC ACAAAGAGGC GCATTTCTGC
TCCATGTGCG GCGAACACTT CTGCTCCATG CGTGCCAATA AGAACTTCCG CAAGTTGCTA
AACGAAGAGG CAGTCTCCAA ATGA
 
Protein sequence
MKEFKVTTGP LPGSEKIYVE GERFPFLRVP MRRIRMSDTI LENGEREKNE DVVVYDTSGP 
YTDTSYEVNL HRGVPKIREQ WIEDRGDTVR LEGLSSEYGR IRQSDASLEK LRYEHVCTRP
RAAKDGCATQ LYYARQGIVT PEMEFVAIRE NQLIDQVRTR YRAEEGEPLG AVIPRKITPE
FVRDEIAAGR AILPANINHP ESEPMIIGRN FLVKINANIG NSPISSTIEE EVEKAVWAIR
WGADTVMDLS TGDHIHETRE WIIRNSPVPI GTVPLYQTLE KVQGDVTKLN WEIFRDTLIE
QAEQGVDYFT IHAGLRWHHV PLTLRRLTGI VSRGGSIIAN WCTTHKRESF IYEHFEEICQ
ILARYDVAIS LGDGLRPGCI HDANDAAQIA ELKTLGELTE IAWKYNVQTI IEGPGHVPMH
KIRENMEIQL EACHGAPFYT LGPLVSDVAS GYDHITSAIG AAQIGWFGTA MLCYVTQKEH
LGLPNREDVR EGVVTYRLAA HAADLAKGHP TAYWRDYMMS KARFEFRWKD QFHLSLDPEK
AIQFHDATLP DEGHKEAHFC SMCGEHFCSM RANKNFRKLL NEEAVSK