Gene Ppro_1034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPpro_1034 
Symbol 
ID4572774 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePelobacter propionicus DSM 2379 
KingdomBacteria 
Replicon accessionNC_008609 
Strand
Start bp1083519 
End bp1085297 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content63% 
IMG OID639755074 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_900718 
Protein GI118579468 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGAGA ACAGGAACTG CCCCAGGGGC ACGAACAGCG CGGAGAGCGC GGTAACGGGG 
GAGTTTCCCC ATTCCCGGAG GGCCTATCTG ACCGGTTCGC GGCCTGACCT GCGGGTGCCG
ATGAGGGAAA TCCTCCTCAC CGACTCTCCA GGGGGGGATG GAGCGGCTAA GCCCGCTTCG
ATCCATGTCT ACGACACAGC CGGCCCCTAT GGCGATGCCA CGCTTTGCTG CGATATCCGC
TCAGGCTTGG CCGGCCTGCG CGAGAACTGG ATTGCCGAAC GCTACGACAC TGAAACGTAT
TCGGATCCGG CACTTCTGTC GGAAAAGAGC AACTGTGCCG GCCGCGGGTC TGGCGCTGGC
GCCTTTCCCG TCAGCCGCGC CCCCCGCCGC TCCCGCTCTG GCGGCAACGT GACCCAGATG
CACTATGCCC GCCAGGGCAT TGTCACCCCC GAGATGGAGT TTGTCGCCAT TCGGGAAAAC
CAGCGCCGGG AGGGAGCGGA TACGCAACAG CAGACTCCTC ATCCCGTTCG TGAGGGGGGC
ATGTCACGCC CCGCAGTAAT CACCCCGGAA TACGTCAGGT CCGAAGTTGC CCGCGGCAGG
GCCATTATTC CGCTCAATAT CAACCACCCG GAGGCTGAGC CGATGATCAT CGGCCGTAAC
TTCCTGGTTA AGGTCAACGC CAATATCGGC AATTCCGCCC TGGCCTCCTC TGTCATGGAC
GAGGTGGAAA AGATGATCTG GGCCATCCGC TGGGGTGCGG ATACGGTGAT GGACCTCTCC
ACCGGCGCCC ACATCCACGA GACGCGCGAG TGGATACTGC GCAACAGCCC GGTTCCCATT
GGCACGGTGC CCATCTACCA GGCCCTGGAG AAGGTAGCCG GCGTGGCCGA GGATCTCAGC
TGGGACGTCT TCCGCGATAC CCTGGTGGAG CAGGCGGAGC AGGGGGTGGA CTACTTCACC
ATCCATGCCG GGGTGAGGCT GGAGCATCTG CCGCTGACCT CGCGACGCCT GACCGGCATC
GTGTCCCGCG GCGGCTCGAT CATGGCCAAA TGGTGTCATG CCCACAGATG CGAAAGTTTC
CTGTACACCC GCTTCGAGGA GATCTGCGAG ATCATGAAGG CCTACGACGT CAGCTTCTCC
CTGGGGGATG GCATGCGTCC CGGCTCGCTG CACGATGCCA ATGACGAGGC CCAGTTCGCC
GAACTGAAGA CCCTGGGCGA GCTGACCCGG CTGGCCTGGA AGCATGACGT GCAGACCATG
ATAGAGGGGC CGGGGCATGT GCCCCTGCAC CTGATCAGGG AAAACATGGA GCTGCAGCTG
GAGCAGTGCC ACGAAGCGCC GTTCTATACC CTGGGGCCGC TGGTCACCGA TGTGGCGCCG
GGATACGACC ACATCACCTC GGCCATCGGC GGGGCCATGA TCGGCTGGCT GGGCACCTCC
ATGCTCTGCT ACGTCACCCG CAAGGAGCAC CTGGGCCTGC CGGACAAAGA TGATGTCAAG
GAGGGGATTG TCACCTTCAA GATCGCCGCC CATGCCGCCG ACCTGGCAAA GGGGCATCCC
GGCGCCCGAC TGCGGGACGA TGCCCTCTCC AAGGCGCGCT TTGAATTCCG CTGGAAGGAC
CAGTTCAACC TGGCCCTCGA CCCGGAAATG CCGCAACGCC TGCACGACCT GACCCTGCCC
GCCGAGGCTG ACAAGGCATC CCATTTCTGC TCCATGTGCG GCCCGGATTT CTGCGCCATG
AGAATCACGC GCAATATCCG TGAACAGGCG GGAGCGTAA
 
Protein sequence
MNENRNCPRG TNSAESAVTG EFPHSRRAYL TGSRPDLRVP MREILLTDSP GGDGAAKPAS 
IHVYDTAGPY GDATLCCDIR SGLAGLRENW IAERYDTETY SDPALLSEKS NCAGRGSGAG
AFPVSRAPRR SRSGGNVTQM HYARQGIVTP EMEFVAIREN QRREGADTQQ QTPHPVREGG
MSRPAVITPE YVRSEVARGR AIIPLNINHP EAEPMIIGRN FLVKVNANIG NSALASSVMD
EVEKMIWAIR WGADTVMDLS TGAHIHETRE WILRNSPVPI GTVPIYQALE KVAGVAEDLS
WDVFRDTLVE QAEQGVDYFT IHAGVRLEHL PLTSRRLTGI VSRGGSIMAK WCHAHRCESF
LYTRFEEICE IMKAYDVSFS LGDGMRPGSL HDANDEAQFA ELKTLGELTR LAWKHDVQTM
IEGPGHVPLH LIRENMELQL EQCHEAPFYT LGPLVTDVAP GYDHITSAIG GAMIGWLGTS
MLCYVTRKEH LGLPDKDDVK EGIVTFKIAA HAADLAKGHP GARLRDDALS KARFEFRWKD
QFNLALDPEM PQRLHDLTLP AEADKASHFC SMCGPDFCAM RITRNIREQA GA