Gene Cthe_2149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2149 
Symbol 
ID4811197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2554448 
End bp2555737 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content39% 
IMG OID640107553 
Producthypothetical protein 
Protein accessionYP_001038545 
Protein GI125974635 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1232] Protoporphyrinogen oxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.226037 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATAT GTGTTGTCGG CGCGGGAGCC ACAGGACTTG TTGCTGCAAA TGAACTTGTC 
AAAAAGGGCT GTAAAGTTTC CGTATTTGAG GCTGAGAATC AGCATGGCGG GCTGGTAAGG
ACCGTAGAAG TAGGCAATGA AAAACTGGAA GTATTTTATC ACCATATATT TACCAATGAT
GTCGAAATAA TTAAACTGAT TGAAGAATTG AATCTGTCTT CCGAGCTTAT GTGGCTTGAG
CCAAAAAATG CCATATATAT TAACCGCAAG CTTTATCCTT TTACTTCTCC GATAGATTTG
CTTCTTTTTA AGGAGCTTTC GTTTATCGAC AGGATAAGAA TGGGGCTGCT TGTCTTTAAG
GCAAAGTTTC TAAAAGACTG GATGGAGTTG GAAAACATCA GCTCCAGGGA CTGGATAATC
AAAAACGCGG GCAAGGATGT GTACGAAAAA GTATGGGGGC CGCTGCTGGT TTCGAAGTTT
GATTATGACG CTGATAAAAT TTCGGGTACC TGGTTGTGGA ACAAATTCAA ACTCAGGGGC
TCCACAAGAG GAAAAAATAT CAATAAAGAA CTGCTGGGAT ATATGAAAGG CAGTTTCGGG
ATTATATATG ACAAATTGGT GGAAAGAATA ATCGATGCCG GAGGGGAAAT ACATTACTCA
AGCCCTGTGG ACAGAATTGA ACCTCAAAAA GATAAAACCC TGAATGTCCA TAGTAACGGA
AAAGTATATA ATTTTGATCG GGTTATTGTT ACAACTTCAC CGGAAATCTT CGGCAAAATG
AATGTTCCTC TTCCGGAAGA ATATAGTGAA AAGCTTTCAA AAGTAAAGTA CAAAGCTAAT
ATTTGCATGA TTCTGGAGCT TTCGGAGAAG TTGTCGGATT ACTATTGGGT TACGATTGCG
GAAAAAGATT TTCCGTTTGT ACTTTTGATA GAACATACCA ACTTGGTTGC CGACAATGAT
TATAAGTCAC ATGTTGTCTA TCTTTCAAGG TATTTGGACA AAAAGAACGA GTTTTATTCT
CTAACCGACG AGGAAATTCA GAGGGAGTTT GTAAAATACC TGAAAATCAT GTTCCCAAAT
TGGGATGAAT CAAAGATAAA ACGGGTTCAT ATCAACAGGA CGGATTACGC ACAACCGGTT
ATTGTACAGC AATATTCAAA GATTTTACCG GAAATTGCCA CTCCTGTGGA GAACCTGTAT
TTGGCTTCTA TGGCCCAAAT ATATCCGGAG GACAGAGGGC AAAATTATTC GGTGAGACTT
GGAAAACAAG TGGCTAATAT GATCAAATAG
 
Protein sequence
MNICVVGAGA TGLVAANELV KKGCKVSVFE AENQHGGLVR TVEVGNEKLE VFYHHIFTND 
VEIIKLIEEL NLSSELMWLE PKNAIYINRK LYPFTSPIDL LLFKELSFID RIRMGLLVFK
AKFLKDWMEL ENISSRDWII KNAGKDVYEK VWGPLLVSKF DYDADKISGT WLWNKFKLRG
STRGKNINKE LLGYMKGSFG IIYDKLVERI IDAGGEIHYS SPVDRIEPQK DKTLNVHSNG
KVYNFDRVIV TTSPEIFGKM NVPLPEEYSE KLSKVKYKAN ICMILELSEK LSDYYWVTIA
EKDFPFVLLI EHTNLVADND YKSHVVYLSR YLDKKNEFYS LTDEEIQREF VKYLKIMFPN
WDESKIKRVH INRTDYAQPV IVQQYSKILP EIATPVENLY LASMAQIYPE DRGQNYSVRL
GKQVANMIK