Gene CPR_1004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1004 
SymboldhaB 
ID4206440 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1145256 
End bp1146920 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content35% 
IMG OID642565561 
Productglycerol dehydratase, alpha subunit 
Protein accessionYP_698327 
Protein GI110802661 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG4909] Propanediol dehydratase, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.872763 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAATCTA AAAGATTCCA AGTATTATCA GAACGTCCTG TAAACCAAGA TGGACTTATA 
GGAGAGTGGG CTGATGAAGG CTTAATAGCT TTAGATAGTC CAAATGATCC AAAATCATCA
ATAAAAATAG AAAATGGAAT AATTACTGAA TTAGACGGTA GATCAAGAGA TGAGTTTGAT
ATGATAGATA AATTTATAGC AGAGTACGCT ATAAATATAG AAGACGCAGA AGCATCTATG
AAACTTTCAT CTAAAGAAAT AGCAAGAAGA TTAGTTGATA TAAATGTTAG TAGAGATGAA
ATAGTAAAAA TCACTACTTC AATAACACCA ATGAAGGCTG TAGAAGTTAT TCAAGAAATG
AACGTTGTTG AAATGATGAT GGCTCTTCAA AAAATGAGAG CAAGAAGAAC ACCTGCTAAC
CAATGTCACG TTACTAACGT AAAAGACAAC CCAGTTCAAA TAGCAGCAGA TGCTGCAGAG
GCTGCTTTAA GAGGATTTGC AGAGCAAGAA ACTACAGTAG GTATAGTTAG ATATGCACCT
TTTAATGCAT TAGCTATCTT AGTAGGTTCA CAAGTAGGTA GAGGAGGAGT TTTAACTCAA
TGTGCAGTTG AGGAAGCTAC TGAACTTGAC CTAGGAATGA GAGGACTTAC AAGTTATGCA
GAAACAGTTT CAGTTTATGG AACAGAATCA GTATTTACAG ATGGAGATGA TACTCCATGG
TCAAAAGCAT TCTTAGCATC AGCTTATGCT TCAAGAGGAC TTAAGATGAG ATTTACATCA
GGTTCAGGTT CAGAAGCATT AATGGGATAC TCAGAAGGTA GATCAATGCT TTACTTAGAA
TCAAGATGTA TATATATAAC TAAGGGAGCT GGAGTTCAAG GATTACAAAA TGGTGCAGTT
AGTTGTATAG GTATGACAGG AGCAGTTCCA TCAGGAATAA GAGCAGTTCT TGGAGAAAAC
TTAATAGCTG CAATGCTTGA TATAGAGGTT GCATCAGCAA ATGACCAAAC ATTCTCACAC
TCAGACATAA GAAGAACAGC AAGAATGTTA ATGCAAATGC TTCCAGGAAC AGACTTCATA
TTCTCAGGAT ATAGTGCAGT TCCAAACTAC GATAACATGT TTGCTGGATC AAACTTTGAT
GCAGAAGACT TTGATGACTA CAACATACTT CAAAGAGACT TAAAAGTTGA CGGTGGATTA
AGACCAGTTA CAGAAGAAGA AACTATAAAG GTTAGAAATA AAGCTGCCAA ATGCATACAA
ATAATCTTTA GAGAATTAGG ATTCCCAGAA GTTACTGATG AAGAAGTAGA AGCTGCAACT
TACTGTCACG GAAGTAAGGA AATGCCAAAC AGAAATGTAG TTGAAGATTT AAAGGCTGCA
GAAGAAATGT TAGAAAGAAG AATAACAGGA TTAGATATAA TAAAAGCTTT AAGCAAAAAT
GGTATGGAAG ATATAGCAAA CAATTTATTA AACATGCTTA AGCAAAGAGT TACTGGAGAT
TATCTTCAAA CTTCAGCAAT CTTAGATAAA GATTTCAATG TTATAAGTGC TGTTAATGAT
GTAAATGACT ATATGGGACC TGGAACAGGA TATAGACTAG ATGGTCAAAG ATGGGAAGAA
ATCAAAAAAG TTCCTACAGT AATGAGACCA GAGGATATAG AGTAG
 
Protein sequence
MKSKRFQVLS ERPVNQDGLI GEWADEGLIA LDSPNDPKSS IKIENGIITE LDGRSRDEFD 
MIDKFIAEYA INIEDAEASM KLSSKEIARR LVDINVSRDE IVKITTSITP MKAVEVIQEM
NVVEMMMALQ KMRARRTPAN QCHVTNVKDN PVQIAADAAE AALRGFAEQE TTVGIVRYAP
FNALAILVGS QVGRGGVLTQ CAVEEATELD LGMRGLTSYA ETVSVYGTES VFTDGDDTPW
SKAFLASAYA SRGLKMRFTS GSGSEALMGY SEGRSMLYLE SRCIYITKGA GVQGLQNGAV
SCIGMTGAVP SGIRAVLGEN LIAAMLDIEV ASANDQTFSH SDIRRTARML MQMLPGTDFI
FSGYSAVPNY DNMFAGSNFD AEDFDDYNIL QRDLKVDGGL RPVTEEETIK VRNKAAKCIQ
IIFRELGFPE VTDEEVEAAT YCHGSKEMPN RNVVEDLKAA EEMLERRITG LDIIKALSKN
GMEDIANNLL NMLKQRVTGD YLQTSAILDK DFNVISAVND VNDYMGPGTG YRLDGQRWEE
IKKVPTVMRP EDIE