Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2149 |
Symbol | |
ID | 4811197 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2554448 |
End bp | 2555737 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640107553 |
Product | hypothetical protein |
Protein accession | YP_001038545 |
Protein GI | 125974635 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1232] Protoporphyrinogen oxidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.226037 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATATAT GTGTTGTCGG CGCGGGAGCC ACAGGACTTG TTGCTGCAAA TGAACTTGTC AAAAAGGGCT GTAAAGTTTC CGTATTTGAG GCTGAGAATC AGCATGGCGG GCTGGTAAGG ACCGTAGAAG TAGGCAATGA AAAACTGGAA GTATTTTATC ACCATATATT TACCAATGAT GTCGAAATAA TTAAACTGAT TGAAGAATTG AATCTGTCTT CCGAGCTTAT GTGGCTTGAG CCAAAAAATG CCATATATAT TAACCGCAAG CTTTATCCTT TTACTTCTCC GATAGATTTG CTTCTTTTTA AGGAGCTTTC GTTTATCGAC AGGATAAGAA TGGGGCTGCT TGTCTTTAAG GCAAAGTTTC TAAAAGACTG GATGGAGTTG GAAAACATCA GCTCCAGGGA CTGGATAATC AAAAACGCGG GCAAGGATGT GTACGAAAAA GTATGGGGGC CGCTGCTGGT TTCGAAGTTT GATTATGACG CTGATAAAAT TTCGGGTACC TGGTTGTGGA ACAAATTCAA ACTCAGGGGC TCCACAAGAG GAAAAAATAT CAATAAAGAA CTGCTGGGAT ATATGAAAGG CAGTTTCGGG ATTATATATG ACAAATTGGT GGAAAGAATA ATCGATGCCG GAGGGGAAAT ACATTACTCA AGCCCTGTGG ACAGAATTGA ACCTCAAAAA GATAAAACCC TGAATGTCCA TAGTAACGGA AAAGTATATA ATTTTGATCG GGTTATTGTT ACAACTTCAC CGGAAATCTT CGGCAAAATG AATGTTCCTC TTCCGGAAGA ATATAGTGAA AAGCTTTCAA AAGTAAAGTA CAAAGCTAAT ATTTGCATGA TTCTGGAGCT TTCGGAGAAG TTGTCGGATT ACTATTGGGT TACGATTGCG GAAAAAGATT TTCCGTTTGT ACTTTTGATA GAACATACCA ACTTGGTTGC CGACAATGAT TATAAGTCAC ATGTTGTCTA TCTTTCAAGG TATTTGGACA AAAAGAACGA GTTTTATTCT CTAACCGACG AGGAAATTCA GAGGGAGTTT GTAAAATACC TGAAAATCAT GTTCCCAAAT TGGGATGAAT CAAAGATAAA ACGGGTTCAT ATCAACAGGA CGGATTACGC ACAACCGGTT ATTGTACAGC AATATTCAAA GATTTTACCG GAAATTGCCA CTCCTGTGGA GAACCTGTAT TTGGCTTCTA TGGCCCAAAT ATATCCGGAG GACAGAGGGC AAAATTATTC GGTGAGACTT GGAAAACAAG TGGCTAATAT GATCAAATAG
|
Protein sequence | MNICVVGAGA TGLVAANELV KKGCKVSVFE AENQHGGLVR TVEVGNEKLE VFYHHIFTND VEIIKLIEEL NLSSELMWLE PKNAIYINRK LYPFTSPIDL LLFKELSFID RIRMGLLVFK AKFLKDWMEL ENISSRDWII KNAGKDVYEK VWGPLLVSKF DYDADKISGT WLWNKFKLRG STRGKNINKE LLGYMKGSFG IIYDKLVERI IDAGGEIHYS SPVDRIEPQK DKTLNVHSNG KVYNFDRVIV TTSPEIFGKM NVPLPEEYSE KLSKVKYKAN ICMILELSEK LSDYYWVTIA EKDFPFVLLI EHTNLVADND YKSHVVYLSR YLDKKNEFYS LTDEEIQREF VKYLKIMFPN WDESKIKRVH INRTDYAQPV IVQQYSKILP EIATPVENLY LASMAQIYPE DRGQNYSVRL GKQVANMIK
|
| |