Gene Cthe_2529 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2529 
Symbol 
ID4809285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2997875 
End bp2998855 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content46% 
IMG OID640107945 
Productdelta-aminolevulinic acid dehydratase 
Protein accessionYP_001038924 
Protein GI125975014 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0113] Delta-aminolevulinic acid dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTAATCA GGCCACGCAG ATTGCGGCTC AATCAACATA TAAGAGATAT GGCAAGAGAA 
GTAAGGCTAA GCAGTAAGTC GCTCATATTG CCGCTTTTCC TTATGGAGGG CAAAAATGTA
AAGCTGCCTG TAAACTCTCT TGAAGGCCAT TTTTATTACA GCCCGGATAC AGTTTTTGAA
GCAGTGGACG ATGCTCTAAA ATGCGGAGTA AACAAGTTTT TGCTTTTTGG GTTGCCGTCA
GAAAAGGATG AGATCGGAAG CCAGAGTTTT AACGAAAACG GAGTCATACA AAAAGCAGTC
CGGGCAATTA AGGAGCGCTA CAGGGATGAA GTGCTTGTCA TAACCGATGT CTGCATGTGT
GAATATACTT CCCACGGACA TTGCGGAATA TTGAACCATG GATATGTTGA CAATGACAAA
ACTCTGGAGT ATCTCGCCAA AATTGCCCTG TCCCATGCAG CGGCCGGGGC GGACATGGTG
GCACCGTCGG ATATGATGGA CGGCAGGGTG GCGGCAATCC GGCAAGAGTT GGACAAACAC
AACTTTATCA ATACGCTTAT CATGTCATAT GCCGTAAAAT ATTCGTCATC TTTTTACGGA
CCGTTCCGTG ATGCTGCAAA GTCCGCCCCG TCCTTTGGGG ACCGGAAGTC ATACCAAATG
GACTATCACA ACAAAAAAGA GGCTGTAAAG GAAGCCTTGA CGGATATAAA CGAAGGTGCG
GATATTCTAA TGGTAAAACC GGCATTGGCT TATTTGGACG TTATCAGCGA AGTTTCCAAA
CACTCCAGCC TGCCCACGGC GGCATACAGT GTCAGCGGAG AGTATGCAAT GATAAAAGCA
GCTGCGAAAA TGAATCTGAT AGATGAATAC CCAACGATGT GCGAAACTGC AGTATCTATC
TTTAGAGCCG GAGCGGATAT TTTAATCTCA TACTACGCCA AAGAGATAGC AGAAGCAATT
AACAGGGGGG ACATAGGATA A
 
Protein sequence
MLIRPRRLRL NQHIRDMARE VRLSSKSLIL PLFLMEGKNV KLPVNSLEGH FYYSPDTVFE 
AVDDALKCGV NKFLLFGLPS EKDEIGSQSF NENGVIQKAV RAIKERYRDE VLVITDVCMC
EYTSHGHCGI LNHGYVDNDK TLEYLAKIAL SHAAAGADMV APSDMMDGRV AAIRQELDKH
NFINTLIMSY AVKYSSSFYG PFRDAAKSAP SFGDRKSYQM DYHNKKEAVK EALTDINEGA
DILMVKPALA YLDVISEVSK HSSLPTAAYS VSGEYAMIKA AAKMNLIDEY PTMCETAVSI
FRAGADILIS YYAKEIAEAI NRGDIG