Gene CPF_1173 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1173 
Symbol 
ID4203123 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1338644 
End bp1340308 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content35% 
IMG OID638082054 
Productcoenzyme B12-dependent glycerol dehydratase, large subunit 
Protein accessionYP_695619 
Protein GI110801202 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG4909] Propanediol dehydratase, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAATCTA AAAGATTCCA AGTATTATCA GAACGTCCTG TAAACCAAGA TGGACTTATA 
GGAGAGTGGG CTGATGAAGG CTTAATAGCT TTAGATAGTC CAAATGATCC AAAATCATCA
ATAAAAATAG AAAATGGAAT AATTACTGAA TTAGACGGCA GATCAAGAGA TGAGTTTGAT
ATGATAGATA AATTTATAGC AGAGTACGCT ATAAATATAG AAGACGCAGA AGCATCTATG
AAACTTTCAT CTAAAGAAAT AGCAAGAAGA TTAGTTGATA TAAATGTTAG TAGAGATGAA
ATAGTAAAAA TCACTACTTC AATAACACCA ATGAAGGCTG TAGAAGTTAT TCAAGAAATG
AACGTTGTTG AAATGATGAT GGCTCTTCAA AAAATGAGAG CAAGAAGAAC ACCTGCTAAC
CAATGTCACG TTACTAACGT AAAAGACAAC CCAGTTCAAA TAGCAGCAGA TGCTGCAGAG
GCTGCTTTAA GAGGATTTGC AGAGCAAGAA ACTACAGTAG GTATAGTTAG ATATGCACCT
TTTAATGCAT TAGCTATCTT AGTAGGTTCA CAAGTAGGTA GAGGAGGAGT TTTAACTCAA
TGTGCAGTTG AGGAAGCTAC TGAACTTGAC CTAGGAATGA GAGGACTTAC AAGTTATGCA
GAAACAGTTT CAGTTTATGG AACAGAATCA GTATTTACAG ATGGAGATGA TACTCCATGG
TCAAAAGCAT TCTTAGCATC AGCTTATGCT TCAAGAGGAC TTAAGATGAG ATTTACATCA
GGTTCAGGTT CAGAAGCATT AATGGGATAC TCAGAAGGTA GATCAATGCT TTACTTAGAA
TCAAGATGTA TATATATAAC TAAGGGAGCT GGAGTTCAAG GCTTACAAAA TGGTGCAGTT
AGTTGTATAG GTATGACAGG AGCAGTTCCA TCAGGAATAA GAGCAGTTCT TGGAGAAAAC
TTAATAGCTG CAATGCTTGA TATAGAGGTT GCATCAGCAA ATGACCAAAC ATTCTCACAT
TCAGACATAA GAAGAACAGC AAGAATGTTA ATGCAAATGC TTCCAGGAAC AGACTTCATA
TTCTCAGGAT ATAGTGCAGT TCCAAACTAC GATAACATGT TTGCTGGATC AAACTTTGAT
GCAGAAGACT TTGATGATTA CAACATACTT CAAAGAGACT TAAAAGTTGA CGGTGGATTA
AGACCAGTTA CAGAAGAAGA AACTATAAAG GTTAGAAATA AAGCAGCTAA ATGCATACAA
ATAATCTTTA GAGAATTAGG ATTCCCAGAA GTTACTGATG AAGAAGTAGA AGCTGCAACT
TACTGTCATG GAAGTAAGGA AATGCCAAAC AGAAATGTAG TTGAAGACTT AAAAGCTGCA
GAAGAAATGT TAGAAAGAAG AATAACAGGA TTAGATATAA TAAAAGCTTT AAGCAAAAAT
GGTATGGAAG ATATAGCAAA CAACTTATTA AACATGCTTA AGCAAAGAGT TACTGGAGAT
TATCTTCAAA CTTCAGCAAT TTTAGATAAA GATTTCAATG TTATAAGTGC TGTTAATGAT
GTAAATGACT ATATGGGACC TGGAACAGGA TATAGACTAG ATGGTCAAAG ATGGGAAGAA
ATCAAAAAAG TTCCTACAGT AATGAGACCA GAGGATATAG AGTAG
 
Protein sequence
MKSKRFQVLS ERPVNQDGLI GEWADEGLIA LDSPNDPKSS IKIENGIITE LDGRSRDEFD 
MIDKFIAEYA INIEDAEASM KLSSKEIARR LVDINVSRDE IVKITTSITP MKAVEVIQEM
NVVEMMMALQ KMRARRTPAN QCHVTNVKDN PVQIAADAAE AALRGFAEQE TTVGIVRYAP
FNALAILVGS QVGRGGVLTQ CAVEEATELD LGMRGLTSYA ETVSVYGTES VFTDGDDTPW
SKAFLASAYA SRGLKMRFTS GSGSEALMGY SEGRSMLYLE SRCIYITKGA GVQGLQNGAV
SCIGMTGAVP SGIRAVLGEN LIAAMLDIEV ASANDQTFSH SDIRRTARML MQMLPGTDFI
FSGYSAVPNY DNMFAGSNFD AEDFDDYNIL QRDLKVDGGL RPVTEEETIK VRNKAAKCIQ
IIFRELGFPE VTDEEVEAAT YCHGSKEMPN RNVVEDLKAA EEMLERRITG LDIIKALSKN
GMEDIANNLL NMLKQRVTGD YLQTSAILDK DFNVISAVND VNDYMGPGTG YRLDGQRWEE
IKKVPTVMRP EDIE