Gene Cthe_0880 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0880 
Symbol 
ID4810498 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1056027 
End bp1057043 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content46% 
IMG OID640106296 
Product3-deoxy-D-arabinoheptulosonate-7-phosphate synthase 
Protein accessionYP_001037307 
Protein GI125973397 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000574615 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTATCG TTATGAGTCC AAATGCTACA AAAGAGCAGA TTGAAAATGT GGAAAAAAAA 
CTTTTGGAGC TGGGTTTTAA AACTCATCCC ATAGTCGGAG ACGTAAAAAC GGTAATTGGG
GCTATCGGAG ACAAAAGACT TCTCAATACC CACTCCATAT CCACCATGCC CGGAGTTGAA
AGCATTGTTC CAATCATGAA ACCTTACAAG CTGGCCAGCA AAGAACTAAA GCAGGAACCA
ACCATTGTTG AGGTAGGCGA TGTACGAATT GGTGGCAATG AAGTAGTGGT TATGGCCGGC
CCCTGTGCAA TTGAAAACGA AGAAATTTAT GTCGAAACAG CCAAAAAGGT TAAAGAGGCA
GGAGCCAAAA TACTCCGCGG CGGTGCTTTC AAGCCCCGTA CATCTCCTTA TTCTTTCCAA
GGTTTGGAAG AAGAAGGCCT CAAAATAATG GCCATTGCCC GGGAAGTAAC GGGACTTAAG
CTTGTCACCG AAGTTGTGGA CACAAGAGAT GTGGAACTTG TCGCATCTTA TACAGACATC
ATCCAAATCG GTGCAAGAAA CATGCAAAAC TTCAGGCTGC TTAAAGAGGT CGGAATGTCC
AATAAGCCCG TACTCCTAAA AAGAGGACTG GCTGCAACCA TTGAAGAATG GTTAATGGCC
GCCGAATATA TTATTTCCGA GGGTAATCCC AATGTAATAC TTTGCGAACG AGGCATCCGA
ACCTTCGAGA CAGCCACAAG GAACACCATT GACATGAGCG CCATTCCGGT AATAAAAGAG
CTGTCCCATT TGCCGATAGT GCTTGACCCC AGCCATGCGG CAGGTACCTG GAAATATGTT
GAGCCTCTTG CAAAAGGCGC AATAGCAACC GGAGCCGACG GTTTAATCAT TGAAGTCCAC
AGCCAGCCTG ACTGTGCTCT CTGTGACGGT CAACAGTCTT TGATACCTTC AAGGTTCGAA
CAGCTTATGA AGGATCTTGA GCCTATAGCT CTTGCAGTGG GAAGAAAACT ATTGTAA
 
Protein sequence
MIIVMSPNAT KEQIENVEKK LLELGFKTHP IVGDVKTVIG AIGDKRLLNT HSISTMPGVE 
SIVPIMKPYK LASKELKQEP TIVEVGDVRI GGNEVVVMAG PCAIENEEIY VETAKKVKEA
GAKILRGGAF KPRTSPYSFQ GLEEEGLKIM AIAREVTGLK LVTEVVDTRD VELVASYTDI
IQIGARNMQN FRLLKEVGMS NKPVLLKRGL AATIEEWLMA AEYIISEGNP NVILCERGIR
TFETATRNTI DMSAIPVIKE LSHLPIVLDP SHAAGTWKYV EPLAKGAIAT GADGLIIEVH
SQPDCALCDG QQSLIPSRFE QLMKDLEPIA LAVGRKLL