Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2529 |
Symbol | |
ID | 4809285 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2997875 |
End bp | 2998855 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640107945 |
Product | delta-aminolevulinic acid dehydratase |
Protein accession | YP_001038924 |
Protein GI | 125975014 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0113] Delta-aminolevulinic acid dehydratase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTAATCA GGCCACGCAG ATTGCGGCTC AATCAACATA TAAGAGATAT GGCAAGAGAA GTAAGGCTAA GCAGTAAGTC GCTCATATTG CCGCTTTTCC TTATGGAGGG CAAAAATGTA AAGCTGCCTG TAAACTCTCT TGAAGGCCAT TTTTATTACA GCCCGGATAC AGTTTTTGAA GCAGTGGACG ATGCTCTAAA ATGCGGAGTA AACAAGTTTT TGCTTTTTGG GTTGCCGTCA GAAAAGGATG AGATCGGAAG CCAGAGTTTT AACGAAAACG GAGTCATACA AAAAGCAGTC CGGGCAATTA AGGAGCGCTA CAGGGATGAA GTGCTTGTCA TAACCGATGT CTGCATGTGT GAATATACTT CCCACGGACA TTGCGGAATA TTGAACCATG GATATGTTGA CAATGACAAA ACTCTGGAGT ATCTCGCCAA AATTGCCCTG TCCCATGCAG CGGCCGGGGC GGACATGGTG GCACCGTCGG ATATGATGGA CGGCAGGGTG GCGGCAATCC GGCAAGAGTT GGACAAACAC AACTTTATCA ATACGCTTAT CATGTCATAT GCCGTAAAAT ATTCGTCATC TTTTTACGGA CCGTTCCGTG ATGCTGCAAA GTCCGCCCCG TCCTTTGGGG ACCGGAAGTC ATACCAAATG GACTATCACA ACAAAAAAGA GGCTGTAAAG GAAGCCTTGA CGGATATAAA CGAAGGTGCG GATATTCTAA TGGTAAAACC GGCATTGGCT TATTTGGACG TTATCAGCGA AGTTTCCAAA CACTCCAGCC TGCCCACGGC GGCATACAGT GTCAGCGGAG AGTATGCAAT GATAAAAGCA GCTGCGAAAA TGAATCTGAT AGATGAATAC CCAACGATGT GCGAAACTGC AGTATCTATC TTTAGAGCCG GAGCGGATAT TTTAATCTCA TACTACGCCA AAGAGATAGC AGAAGCAATT AACAGGGGGG ACATAGGATA A
|
Protein sequence | MLIRPRRLRL NQHIRDMARE VRLSSKSLIL PLFLMEGKNV KLPVNSLEGH FYYSPDTVFE AVDDALKCGV NKFLLFGLPS EKDEIGSQSF NENGVIQKAV RAIKERYRDE VLVITDVCMC EYTSHGHCGI LNHGYVDNDK TLEYLAKIAL SHAAAGADMV APSDMMDGRV AAIRQELDKH NFINTLIMSY AVKYSSSFYG PFRDAAKSAP SFGDRKSYQM DYHNKKEAVK EALTDINEGA DILMVKPALA YLDVISEVSK HSSLPTAAYS VSGEYAMIKA AAKMNLIDEY PTMCETAVSI FRAGADILIS YYAKEIAEAI NRGDIG
|
| |