Gene CPR_1011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1011 
SymboldhaT 
ID4206140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1151232 
End bp1152389 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content35% 
IMG OID642565568 
Product1,3-propanediol dehydrogenase 
Protein accessionYP_698334 
Protein GI110803773 
COG category[C] Energy production and conversion 
COG ID[COG1454] Alcohol dehydrogenase, class IV 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000128902 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATGT ACGATTATTT AGTACCAAGT GTAAACTTTA TGGGAGCTAA CTCAATATCA 
GTAGTTGGTG AAAGATGTAA AATATTAGGT GGAAAGAAAG CTTTAATAGT TACAGATAAA
TTCTTAAGAG GATTAAAAGG GGGAGCAGTT GAATTAACTG AAAAATACCT AAAAGAAGCA
GGAATCGAAG TTGCTTATTA TGATGGAGTT GAACCAAATC CAAAAGATAC AAATGTTAAA
GATGGTTTAA AAATATTCAA AGACGAAAAC TGTGATATGA TAGTTACAGT TGGTGGAGGA
AGCTCACATG ACTGTGGTAA AGGAATAGGT ATAGCTGCAA CTCACGAAGG AGATCTTTAT
GACTATGCTG GAATAGAAAC TTTAACAAAT CCACTTCCTC CAATAGTAGC AGTAAACACT
ACAGCTGGAA CTGCAAGTGA AGTAACTAGA CACTGTGTTA TAACAAACAC TAAAACTAAA
GTTAAATTCG TTATAGTAAG CTGGAGAAAC TTACCTTTAG TTTCAATCAA TGACCCAATG
TTAATGGTTG GAAAACCAGC AGGATTAACA GCTGCAACAG GAATGGACGC TTTAACTCAT
GCTGTAGAAG CATATGTATC AAAAGATGCT AACCCTGTAA CAGATGCTGC TGCAATACAA
GCTATAAAAT TAATATCAAG CAATTTAAGA CAAGCTGTTG CTTTAGGAGA AAACTTAGTA
GCTAGAGAAA ACATGGCTTA CGGTTCATTA TTAGCTGGTA TGGCATTTAA CAATGCTAAC
TTAGGATATG TACATGCTAT GGCTCACCAA TTAGGCGGAT TATATGATAT GCCTCACGGA
GTAGCTAACG CTATGTTATT ACCACACGTA TGTAAATACA ACTTAATATC TAACCCACAA
AAATTTGCTG ATATAGCTGA ATTCATGGGA GAAAACATAG AAGGATTATC AGTAATGGAT
GCTGCTCAAA AAGCTATAGA TGCAATGTTC AGATTATCAA CTGATATCGG AATACCAGCA
AAATTAAGAG ACATGGGAGT AAAAGAAGAA GACTTCGGAT ACATGGCTGA AATGGCTCTT
AAAGATGGTA ATGCATTCAG TAACCCAAGA AAAGGTAACG AAAGAGACAT CGTTGAAATA
TTCAAAGCTG CATTCTAA
 
Protein sequence
MRMYDYLVPS VNFMGANSIS VVGERCKILG GKKALIVTDK FLRGLKGGAV ELTEKYLKEA 
GIEVAYYDGV EPNPKDTNVK DGLKIFKDEN CDMIVTVGGG SSHDCGKGIG IAATHEGDLY
DYAGIETLTN PLPPIVAVNT TAGTASEVTR HCVITNTKTK VKFVIVSWRN LPLVSINDPM
LMVGKPAGLT AATGMDALTH AVEAYVSKDA NPVTDAAAIQ AIKLISSNLR QAVALGENLV
ARENMAYGSL LAGMAFNNAN LGYVHAMAHQ LGGLYDMPHG VANAMLLPHV CKYNLISNPQ
KFADIAEFMG ENIEGLSVMD AAQKAIDAMF RLSTDIGIPA KLRDMGVKEE DFGYMAEMAL
KDGNAFSNPR KGNERDIVEI FKAAF