Gene Cthe_1325 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1325 
Symbol 
ID4809465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1608211 
End bp1609353 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content39% 
IMG OID640106749 
Productcoproporphyrinogen III oxidase, anaerobic 
Protein accessionYP_001037750 
Protein GI125973840 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGATG TAAAGGAACT GGGATTGTAT ATTCATGTAC CGTTTTGCAA AGCAAAGTGT 
TTTTACTGTG ATTTTAATTC CTTTGCCTGC AGGGATGATT TTGTCCCTGC ATATTTTAAT
GCTCTTAAAA AGGAAATTTC AGCCTATTCG GACATCATCA AAGGATACAG GATAAAGACG
GTATTTTTTG GAGGGGGAAC ACCTTCCTAC GTTGGAGCGC ATAATATATA TGAAATTGTA
TCGCTTTTAA AGCAAAAGTT TGACATGGAC GGCTGTTTTG AACTTACCAT TGAAGCCAAT
CCCGGAACTC TGGACGAAGA AAAGCTTTTG GTGTACAGGG ATGCGGGAAT AAACCGGCTG
AGCATCGGGC TTCAGGCATG GCAGAATCAC CTGCTTGAAA GCCTGGGGAG AATTCATACT
GTGGAAGAGT TTGAAGAAAA TTATCATCTT GCCGTAAAAA CTGGATTTGA CAACATAAAT
GTGGATCTTA TTTTTGCAAT TCCGGGGCAG AGTTTTGAGG ATTGGGCGGA GACAATAAAC
AAAGTGGCAG AACTTAATCC CCGGCATATT TCCTGCTACA GCCTTATAAT TGAAGAGGAT
ACGGTTTTCG GAGCAAAGTT TTCCAAAGGG GAGCTTTCTT CTGTGGAAGA CGAACTTGAC
AGGAAAATGT ACTGGTATGC AGTGGAAAAA TTAAGGACCG TGGGATATAA GCACTATGAA
ATATCGAATT TTTCAAAAGA AGGATTTGAG TGCGCCCACA ATTTGATTTA CTGGAAGGAA
CAAGAATACA TAGGTTTCGG TGCCGGAGCC CATTCTTATT TTAACGGTCA AAGATTCAAC
AATACTTATA ATATTGAGGA ATATGTGAAA ATAATAAATT CCGGAAGACT TCCGGTGGAA
AATAAGATTG CCATAGGAAG GAAAGATGAA ATTTCCGAAT TTATGATGCT GGGATTAAGA
CTTGTTGAAG GGGTGAGTAT CAGGGAATTT TACGAAAGGT TTGGAGAGAA TGTTTTGGAG
GTTTTCAAAG AGCAGATAAA AAATTTGTCC AAAAGAGGAC TTGTTGCGGT TGAAAATGGT
TTTATAAAAC TTACCCGATT GGGCTTGGAT CTTGCAAATC AAGCCTTTAT GGAGTTTGTG
TGA
 
Protein sequence
MFDVKELGLY IHVPFCKAKC FYCDFNSFAC RDDFVPAYFN ALKKEISAYS DIIKGYRIKT 
VFFGGGTPSY VGAHNIYEIV SLLKQKFDMD GCFELTIEAN PGTLDEEKLL VYRDAGINRL
SIGLQAWQNH LLESLGRIHT VEEFEENYHL AVKTGFDNIN VDLIFAIPGQ SFEDWAETIN
KVAELNPRHI SCYSLIIEED TVFGAKFSKG ELSSVEDELD RKMYWYAVEK LRTVGYKHYE
ISNFSKEGFE CAHNLIYWKE QEYIGFGAGA HSYFNGQRFN NTYNIEEYVK IINSGRLPVE
NKIAIGRKDE ISEFMMLGLR LVEGVSIREF YERFGENVLE VFKEQIKNLS KRGLVAVENG
FIKLTRLGLD LANQAFMEFV