Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0683 |
Symbol | |
ID | 4810301 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 840619 |
End bp | 841932 |
Gene Length | 1314 bp |
Protein Length | 437 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640106100 |
Product | diaminopimelate decarboxylase |
Protein accession | YP_001037111 |
Protein GI | 125973201 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0019] Diaminopimelate decarboxylase |
TIGRFAM ID | [TIGR01048] diaminopimelate decarboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000143238 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTGTTT CAAAAGCTTT AAAGGTTAAC AGTAAAAATC ACCTGGAAAT CGGCGGATGT GATTGCGTTG ATCTTGTAAA CAATTTCGGT ACTCCCTTGT ATGTAATGGA TGAAAGTCTT ATAAGGGAAA ATTGCCGTAT ATATAAAAAT GCACTGGACA AGTATTATAA CGGAAACGGA CTGGTACTTT ACGCCAGCAA GGCTTTCTGT ACAATGGCAA TGTGCAAAAT TGTCCAGCAG GAAGGCCTGG GTCTTGACGT GGTATCGGGC GGAGAGCTGT ACACCGCGAT TAAAGCGGGA TTTCCCATGG AAAAGGTGTA TTTTCACGGA AACAATAAAA CCATTGACGA ACTGGAGCTG GCGATTGACA ACAATGTAAG AAGAATAGTA GTGGATAATA GGCAGGAACT TTTGCATGTA AACAGAATTG CAGCAGAAAA AGGCAAGACA GTAAACATTT CTTTCAGAAT AAAACCCGGA ATTGATGCTC ATACTCATGA CTTTATCCGG ACAGGTCAGA TTGACTCAAA ATTTGGTGTT GCCCTTGAAA ACGGTGAGGC AATGGAAATA ATAGGCGAAG CGGTGAAACT GAGCAATGTG AAGGTGGTTG GACTTCATTG CCACATAGGC TCTCAAATTT TCGAGCTTGC TCCTTTTGAG GAAGCTGCAA GGGTGATGCT TACCTTTATT GCAAAAATAA AGGAAGAGCT GGGTATAGAA ATTGAGGAGC TGAACCTTGG AGGAGGCTTT GGGATAAAAT ATACCCAGGA TGACGACCCG ATAGAGTATG ACCGTTATAT AAAATCAGTA TCGGAAGTTG TGAAAAGTGT GTGCGAAGAC AAGGGAATAA AGCTTCCGTT TATAGTTATA GAGCCGGGAA GGTCCATTGT TGCATCTGCG GGAATAACGC TCTACAGAAT TGGCACTATA AAAGATATCA AGGGTGTCAG AAAATATATC GCCGTTGACG GCGGAATGAC CGACAACCCA AGATATGCCC TCTATCAGTC AAAATATGAA GGTGTTATTG CCAATAAAGC TGATGCGGCA AAAACAGAAA AGGTTACAAT TGCGGGCAAG TGCTGTGAAT CCGGGGACCT GCTTGGCAAG GACGTATTGC TTCCCGAAGC GGAGGAAGGG GATATTTTGG CAATACTTGC TACCGGTGCA TACAACTATT CCATGTCCAG TAACTACAAC CGTATTCCAA GACCTGCGGT GGTTCTTGTA AAAGACGGTA AAGCGCGGGT TATTGTTAAA AGGGAAGACT ATAACGATAT AATAAGAAAC GATATTATCC CTGAAGATCT GTAA
|
Protein sequence | MFVSKALKVN SKNHLEIGGC DCVDLVNNFG TPLYVMDESL IRENCRIYKN ALDKYYNGNG LVLYASKAFC TMAMCKIVQQ EGLGLDVVSG GELYTAIKAG FPMEKVYFHG NNKTIDELEL AIDNNVRRIV VDNRQELLHV NRIAAEKGKT VNISFRIKPG IDAHTHDFIR TGQIDSKFGV ALENGEAMEI IGEAVKLSNV KVVGLHCHIG SQIFELAPFE EAARVMLTFI AKIKEELGIE IEELNLGGGF GIKYTQDDDP IEYDRYIKSV SEVVKSVCED KGIKLPFIVI EPGRSIVASA GITLYRIGTI KDIKGVRKYI AVDGGMTDNP RYALYQSKYE GVIANKADAA KTEKVTIAGK CCESGDLLGK DVLLPEAEEG DILAILATGA YNYSMSSNYN RIPRPAVVLV KDGKARVIVK REDYNDIIRN DIIPEDL
|
| |