Gene CPR_1181 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1181 
SymboltdcB 
ID4205420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1330239 
End bp1331447 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content30% 
IMG OID642565737 
Productthreonine dehydratase 
Protein accessionYP_698503 
Protein GI110802155 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR01127] threonine dehydratase, medium form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000071198 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATTATTT TGTATTTAGA AAAAATTATT AAAGCCAAAA AAAATATAGA AGATGTAGTC 
ATAAAAACAC CTTTAATATA TAGCGAGGTC TTTTCAAGGA AATCTGGAAA CCAAGTGTAT
ATGAAATGTG AAAATTTACA ATTAACAGGT GCTTACAAAA TAAGAGGTGC TTTAAATAAA
ATAAGATCTT TATCAGATGA AGAAAAATCA AAAGGTGTTG TTTGTTCTTC TGCCGGAAAT
CATGCTCAAG GCGTAGCTTT TGCAGCATCA CAAGCTAATG TTAAATCAAC TATAGTAATG
CCAAAGACTA CTCCTCTACT AAAAATCCAA TCAACAAAGG ATTTAGGAGG GAATGTTGTT
TTATCAGGTT ATGTTTATGA TGATGCTTTT AATGAGGCTA AAAGAATTGA ACAAGAACAA
GGAGCCTTAT TTATACATCC ATTTAATGAT ATTGATGTAA TTTGTGGACA AGGTACAGTA
GCCTTAGAAA TATTTGAAGA TTTAAATGAT GTAGATATTA TTCTCTGCCC TATAGGTGGC
GGTGGCTTAA TAAGTGGAGT TACCCTAGCT GCTAAGGCTT TAAATCCTAA TGTTAAAGTA
ATTGGAGTAC AGGGTGAAGG TGCAAATGCA ATGGTTAAAA GTTTTAAGGC AGGAGAAATA
ATTGCTTTAG ACGCTGTTGA TACTATTGCT GATGGAATTG CAGTAAAAAG ACCTGGTGAT
TTAACATTTA AATTTATAAA AGAATATGTT GATGATATAA TAACTGTATC AGATCATGAA
ATTGTTGAAG CATTCTTTAC ATTAAGTGAA AAACATAAAC TTTTAGCAGA AGCTTCAGGA
GCAGTCTCAT TAGCAGCTTC TGCTAAATTA AATTGTAAAG ATAAAAATAT AGTATCAGTA
ATAAGTGGTG GTAATATAGA TATGGTTACT ATAACTTCAT TAATAAACAG CGCATTAGTA
GCTAAAGGAA GACTTTTTGG ATTTAGTTTA GAAGTTCCTC ATAAACCAGG ACAAATATTG
AAGATTGCTA AAGTTCTTGC TGATACTAAT GCTAATATAG TAAAACTTGA ACATGATCAT
TTTAAAGCAA GGGATGCTCT TAAAAATATG GTTATAGAAG TAACTTTAGA GACAAATGGA
CACTCTCACA TAGAGGAAAT AAAAAAAGCT TTAACAGATC AAAATTATGT AATAAAACAA
ATTTATTAA
 
Protein sequence
MIILYLEKII KAKKNIEDVV IKTPLIYSEV FSRKSGNQVY MKCENLQLTG AYKIRGALNK 
IRSLSDEEKS KGVVCSSAGN HAQGVAFAAS QANVKSTIVM PKTTPLLKIQ STKDLGGNVV
LSGYVYDDAF NEAKRIEQEQ GALFIHPFND IDVICGQGTV ALEIFEDLND VDIILCPIGG
GGLISGVTLA AKALNPNVKV IGVQGEGANA MVKSFKAGEI IALDAVDTIA DGIAVKRPGD
LTFKFIKEYV DDIITVSDHE IVEAFFTLSE KHKLLAEASG AVSLAASAKL NCKDKNIVSV
ISGGNIDMVT ITSLINSALV AKGRLFGFSL EVPHKPGQIL KIAKVLADTN ANIVKLEHDH
FKARDALKNM VIEVTLETNG HSHIEEIKKA LTDQNYVIKQ IY