Gene Cthe_1341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1341 
Symbol 
ID4809481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1632612 
End bp1634129 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content40% 
IMG OID640106765 
Productcoproporphyrinogen III oxidase 
Protein accessionYP_001037766 
Protein GI125973856 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.996503 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGTAT ATGTAAAGTT GGAAGGACAT GAGTTTCGAT ATGAGATAGA GAATATATTA 
AAAATGTTCT TTGAAATGGG GAGTACAGAG ATATCATATC AGGACCCCGG AGAGAATTAC
CGGGGGATTT TGCTGTATTC ACGCCTTGAT ATTCCTTACG GAAGTGACGG GCTGTACCGG
ACAGAAACCG TAATTTGTGT TGACGGGGAG AATGTATTAA AGGAAAATCA CTTCTTTACG
GTCTCAGTAC CTGGGGAAGA TTCAAACTCT TTGCTTGAAG AGAGAAAAAT ACAAAAAAGA
GAGGTTAAAA GAGAGGCATA CAAGGCACTG TCCAAATTTA CAGGGAAAAG TATGCCTTGG
GGAATGCTTA CCGGGATAAG ACCTGCCAAA ATAGTCCATG AACTTATGGA CAAAGGCTGT
TCGAAGGAAG AAATAAACTC TACACTGAAA GAATATTATT TTGTCTCTGA TAAAAAGTCA
GAGATTTTAT ACAACGTTGC CAAAAAAGAA AGGTATATAC TGGATAACAG TGAACAGGAC
ATGGTGGGAG TTTACATTGG CATTCCTTTC TGCACCACCC GCTGCCTTTA CTGCTCTTTT
ACTTCCAATC CGATAAAAAA ATATGAGCAT ATGGTGGAAA GCTATATAAA GGCCCTGAAG
AAGGAAATAA TGAGTGTGGC CGGTATTTTG GAGAAGAAGA AATTAAAAAT AGAGAGCATA
TATATAGGCG GAGGCACACC TACTTCCATT GAAGCTTTGC ATCTTAAAGA ACTTCTTGGT
TTTATTGAGC AGGCATTGAA TTTAAAAGAT TTGAAGGAAT ACTCTTTGGA GGCCGGAAGG
CCTGACTCCA TTACCTGTGA GAAGCTGGAG ATAATAAAAA ACAGCAGGGT GGACAGGATA
AGTATCAATC CTCAGTCCAT GAATGATGAA ATCTTAAAGA AAATTGGGAG GCTTCATACT
TCAAAGGATA TAGTCGAGGC TTTTCAACTT GCCAGAAGCA TGGGCTTTGA CAATATAAAC
ATGGATGTTA TTGCAGGACT TCCGGGAAGC ACTCTTGAGG ACTTTGTAAA AACTATGGAG
GAAATAATTG TTTTAGGACC TGAGAGTGTT ACTGTTCATA CCATGGCAAT CAAGCGTGCG
TCACGGCTTA ATGAAGACAG GGAAAACTAC AGCCTGACCT CGGGAAGCGA AGTGTCCAAA
ATGGTTGATG CGGCTTATGA TATTTTGACC AAAATGGGAC TGGAGCCGTA TTATCTTTAC
AGGCAGAAAA ACATGCTTGG CAATCTCGAA AACATTGGAT ACAGCAAGGC TGGCTATGAG
TCGATATACA ATGTCCAGAT TATGGAAGAA AAGCAGTCAA TTATAGCATT GGGGGCGGGG
GCCGTAACCA AAGTGGTTTT TCCCGAAAGC AACAGGATTG AAAGGGCTTT TAATGTAAAG
AATGTGGAGG AGTATATAAG CCGGATTGAC GAGATGATTG AGAGGAAAAA TGTTCTTTTA
TTTTCCAATG AAGAGTAG
 
Protein sequence
MKVYVKLEGH EFRYEIENIL KMFFEMGSTE ISYQDPGENY RGILLYSRLD IPYGSDGLYR 
TETVICVDGE NVLKENHFFT VSVPGEDSNS LLEERKIQKR EVKREAYKAL SKFTGKSMPW
GMLTGIRPAK IVHELMDKGC SKEEINSTLK EYYFVSDKKS EILYNVAKKE RYILDNSEQD
MVGVYIGIPF CTTRCLYCSF TSNPIKKYEH MVESYIKALK KEIMSVAGIL EKKKLKIESI
YIGGGTPTSI EALHLKELLG FIEQALNLKD LKEYSLEAGR PDSITCEKLE IIKNSRVDRI
SINPQSMNDE ILKKIGRLHT SKDIVEAFQL ARSMGFDNIN MDVIAGLPGS TLEDFVKTME
EIIVLGPESV TVHTMAIKRA SRLNEDRENY SLTSGSEVSK MVDAAYDILT KMGLEPYYLY
RQKNMLGNLE NIGYSKAGYE SIYNVQIMEE KQSIIALGAG AVTKVVFPES NRIERAFNVK
NVEEYISRID EMIERKNVLL FSNEE