Gene Pden_2944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPden_2944 
Symbol 
ID4581511 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParacoccus denitrificans PD1222 
KingdomBacteria 
Replicon accessionNC_008687 
Strand
Start bp111383 
End bp113182 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content68% 
IMG OID639770271 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_916724 
Protein GI119385669 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.858401 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGA CCATCCCGCA AATCACCACC GGCCCGCTGC CCGGATCGCG CAAGATCCAT 
GTGCCGGGCA GCCTGCACGA CATCCGCGTG CCCATGCGCG AGATCGCCGT CTCGAACGAG
CCGCCGCTGG TCGTCTATGA CAGCTCCGGC CCCTATACCG ACGCGGCGGT GCAGGCCGAC
ATCGCCCGCG GCCTGCCGGA CCTGCGCGGC GACTGGCAGC TGCGGCGCGG CGACGTGGCG
CCCTATCCCG GCCGGCAGGT CACCGCCGCC GACAACGGCT TTGCCGAGGG CGCGCGGCTG
ACCCCCGCCT TTCCGCTGCG CCGCGATCCC CTGCGCGCGG CGGCGGGCCG GGCGGTGACG
CAGCTGGCCT ATGCCCGCGC CGGCATCGTC ACCCCCGAGA TGGAGTTCGC GGCCATCCGC
GAGAACGAGG GGCGGCTGGT CGCCCATGCC CGCGACGGCG CGCCGATGGG GGCCGAGCTG
CCCGATCTGG TGACGCCGGA ATTCGTGCGG GCCGAGATCG CCGCCGGCCG CGCCATCATC
CCGGCCAACA TCAACCACCG CGAGTTGGAG CCGATGATCA TCGGCCGCAA TTTCAAGGTC
AAGATCAACG CCAATATCGG CAACTCCGCC GTCACCTCCA GCATGGAGGA GGAGGTCGAG
AAGATGGTCT GGGCGATCCG CTGGGGCGCG GACACGGTGA TGGACCTGTC CACCGGCCGC
AACATCCACA ACATCCGCGA CTGGATCATC CGCAACGCGC CGGTGCCCAT CGGCACCGTG
CCGCTGTATC AGGCGCTGGA GAAGGTCGGC GGCGTGGCCG AGGATCTGAG CTGGGAGGTG
TTTCGGGACA CGCTGGTCGA ACAGGCCGAG CAGGGCGTGG ACTATTTCAC CATCCATGCC
GGGGTGCGGC TGCACATGAT CCCGCTGACC GCGCGGCGGG TGACGGGGAT CGTCAGCCGC
GGCGGCTCGA TCATGGCGAA ATGGTGCCTG CACCACCACC GCGAGAGCTT CCTGTATGAG
CGCTTCGACG AGATCTGCGA GATCATGCAG GCCTATGACG TCAGCTTCAG CCTGGGCGAC
GGGCTGCGTC CCGGCTCGAT CGCCGATGCC AATGACGAGG CGCAATGCGC CGAACTGCGC
ACCCTGGGCG AGCTGACGAA GATCGCCTGG GCCCGGGATT GCCAGGTGAT GATCGAGGGG
CCGGGCCATG TGCCGATGCA CAAGATCAAG GCCAATATGG AGGAGCAGCT GCGGCATTGC
CACGAGGCGC CGTTCTATAC GCTTGGCCCG CTGACCACCG ATATCGCACC GGGCTACGAC
CACATCACCT CGGCCATCGG GGCGGCGATG ATCGGCTGGT TCGGCACGGC GATGCTGTGC
TATGTCACGC CCAAGGAGCA TCTGGGCCTG CCCGACCGCG ACGACGTCAA GACCGGCGTC
ATCACCTACA AGCTGGCCGC CCATGCCGCC GATCTGGCCA AGGGCCATCC CGGGGCGCAG
CGGCGCGACG ATGCGCTGTC GCGCGCGCGG TTCGAGTTCC GCTGGCAGGA CCAGTTCAAC
CTGGGCCTGG ACCCGGACAC CGCGCAGGCC ATGCATGACG AGACCCTGCC GAAAGAGGCG
CACAAGCTGG CGCATTTCTG TTCGATGTGC GGGCCGAAGT TCTGCTCGAT GCGGATCTCG
CACGACATCC GGGCCGAGGC TGAAAAGCAG GCCGGCATGG CGCGCATGGC CGAGAAGTTC
CGCGAGGGCG GGGCGCTTTA CCTGCCGGTT GCGGAAAGCG TGGCGGAGGC GGCCGATTGA
 
Protein sequence
MSQTIPQITT GPLPGSRKIH VPGSLHDIRV PMREIAVSNE PPLVVYDSSG PYTDAAVQAD 
IARGLPDLRG DWQLRRGDVA PYPGRQVTAA DNGFAEGARL TPAFPLRRDP LRAAAGRAVT
QLAYARAGIV TPEMEFAAIR ENEGRLVAHA RDGAPMGAEL PDLVTPEFVR AEIAAGRAII
PANINHRELE PMIIGRNFKV KINANIGNSA VTSSMEEEVE KMVWAIRWGA DTVMDLSTGR
NIHNIRDWII RNAPVPIGTV PLYQALEKVG GVAEDLSWEV FRDTLVEQAE QGVDYFTIHA
GVRLHMIPLT ARRVTGIVSR GGSIMAKWCL HHHRESFLYE RFDEICEIMQ AYDVSFSLGD
GLRPGSIADA NDEAQCAELR TLGELTKIAW ARDCQVMIEG PGHVPMHKIK ANMEEQLRHC
HEAPFYTLGP LTTDIAPGYD HITSAIGAAM IGWFGTAMLC YVTPKEHLGL PDRDDVKTGV
ITYKLAAHAA DLAKGHPGAQ RRDDALSRAR FEFRWQDQFN LGLDPDTAQA MHDETLPKEA
HKLAHFCSMC GPKFCSMRIS HDIRAEAEKQ AGMARMAEKF REGGALYLPV AESVAEAAD