Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1796 |
Symbol | |
ID | 4810041 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2120816 |
End bp | 2121913 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640107210 |
Product | prephenate dehydrogenase |
Protein accession | YP_001038210 |
Protein GI | 125974300 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0287] Prephenate dehydrogenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGGTTG AAAAGATATC CATTATCGGG CTTGGACTTA TCGGCGGGTC GCTGGCGAAA GCTTTGAAAG AAAAGCTTGG CATTGAGTCA ATAACCGCCG TTGACATCAA TGAGAAAAGT CTGAGCCAGG CTCTTAAAGA GGGTTTTATA AAAGAAGGTT TTACCGAACT TAACGAATCG GTATATAATT CTGACATTAT TTTCATATGT ACACCGGTTA AGGATGCTGT TGAGTATATA ACCCGACTGC ACGGCAAAGT GAAAGCCGGA TGCATCCTGA CGGACACAGC AAGTACAAAG GGGGAAATTA TAGATTATGT AAATTCATTG GATAATCCCC CCTGCTTCAT AGGGGGGCAT CCAATGGCCG GTACTGAGAA GGCAGGTTTT TCATCAAGTT TTTCACATTT GTTTGAAAAT GCGTACTATA TAATGTCGCC TTCAAAAAAT TGCCCCGAAG AATCCCTTGA GTACTTGGCA GAAATAATCA GAGGAATCGG CGCAATACCG ATAAAGCTTG ACTCCAAAGA ACACGATATT ATCACCGCAA CCATAAGCCA TGTACCGCAT GTAATTGCTT CCGCCCTGGT AAACCTTGTG AAATTCTCTG ATTCCCCCGA CGGCAAAATG CAAACTTTGG CAGCAGGAGG ATTTAAGGAT ATAACAAGAA TTGCATCATC AAACCCTAAG ATGTGGGAAA ATATTATTCT CAGCAACAAG GAAATAGTTA AATCGACTTT GAATAAATTT ACCGAGACAA TAAACACTTT TATTGAATAT ATTGATAACG AAAATTCCAA CGGCATATAC AATTTTTTCG ATTCTGCAAA AAAGTTTCGT GATTCCATTC CAAACAACAG GAAAGGACTC ATTGAACCGC AGAACGAGCT TATTGTAGAT GTTGTTGACA AGCCCGGCAT CATCGGTGAA ATAGCAACCA TTCTCGGAAA CAACGGTATT AATATAAAAA ACATTAATGT TTCCAACAGC CGGGAGTTTG AGCAGGGGTG TCTCAGAATC ACACTGCCCG ATTCAGGCAG TGTGGCCGAG GCTTATGAAC TGCTCGCAAA AAAGGGTTAT AAAGTGTTTA AAATTTGA
|
Protein sequence | MQVEKISIIG LGLIGGSLAK ALKEKLGIES ITAVDINEKS LSQALKEGFI KEGFTELNES VYNSDIIFIC TPVKDAVEYI TRLHGKVKAG CILTDTASTK GEIIDYVNSL DNPPCFIGGH PMAGTEKAGF SSSFSHLFEN AYYIMSPSKN CPEESLEYLA EIIRGIGAIP IKLDSKEHDI ITATISHVPH VIASALVNLV KFSDSPDGKM QTLAAGGFKD ITRIASSNPK MWENIILSNK EIVKSTLNKF TETINTFIEY IDNENSNGIY NFFDSAKKFR DSIPNNRKGL IEPQNELIVD VVDKPGIIGE IATILGNNGI NIKNINVSNS REFEQGCLRI TLPDSGSVAE AYELLAKKGY KVFKI
|
| |