Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0420 |
Symbol | |
ID | 4808423 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 528909 |
End bp | 529781 |
Gene Length | 873 bp |
Protein Length | 290 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640105834 |
Product | dipicolinate synthase subunit A |
Protein accession | YP_001036851 |
Protein GI | 125972941 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0373] Glutamyl-tRNA reductase |
TIGRFAM ID | [TIGR02853] dipicolinic acid synthetase, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAGAATA AAAAATTTAC AATTATCGGC GGTGATCTTA GGAGTGTCAA GCTGGCTGAG TTGATTGTAG CGGAAGGGAA TAAAGTTAAC ATATATGGGT TTAAGAATGC TAACTTCGAG ATAGGTATTG AAGAAAGCGA AGATCTTGAC CACGCTATTG GGGAGGCGGA TGTAATAGTA GGTCCTACTC CATGTTCAAC AGATAATGAA ACAATTAATA CACCTTTTCA TTCTCAAAAA ATTTTTATTA AAGACATATT CAAAAAGATG AATAAAAGCC AGTTGTTTAT TGCCGGAAGA ATAACCGATA AGATTGCCCA GCTTGCAGAT GTCTACAACG TATATTCAGT TGATTTGCTT GAAAGAGAAG AAATGGCGGT TTTAAATGCC ATTCCGACTG CAGAAGGGGC AATTCAAATT GCAATGGAGG AAATGCCTAT AACTCTTCAT GGAAGCAATG CATTAATTTT AGGGTTCGGA AGAATTGGAA AGATACTTGC AAAAATGCTC CATGGAATTG GAAGCAACGT GTATGTCGAG GCCAGAAAAT ATTCCGACCT GGCTTGGATT GAAAGTTATG GATACAAACC GGTTTTCATA ACTGAGCTTG AAAGTTACAT AGACAGGGCA AATGTTATTT TTAATACTAT TCCCAGCATT GTACTGGATG AAAATTTGCT CAGGAAGGTA AACAAGGATT GCCTTTTGAT AGATCTTGCT TCCAAGCCTG GCGGAATTGA CTTTGAAAAA GCAAAAGAGA TGGGATTAAA AACAATATGG GCTCTTTCTC TTCCGGGAAA AGTAGCACCG GTGACAGCAG CAAAATTCAT AAAAGACACT ATTAGCAATA TTATTGATGA GTTGGGGGTA TAA
|
Protein sequence | MKNKKFTIIG GDLRSVKLAE LIVAEGNKVN IYGFKNANFE IGIEESEDLD HAIGEADVIV GPTPCSTDNE TINTPFHSQK IFIKDIFKKM NKSQLFIAGR ITDKIAQLAD VYNVYSVDLL EREEMAVLNA IPTAEGAIQI AMEEMPITLH GSNALILGFG RIGKILAKML HGIGSNVYVE ARKYSDLAWI ESYGYKPVFI TELESYIDRA NVIFNTIPSI VLDENLLRKV NKDCLLIDLA SKPGGIDFEK AKEMGLKTIW ALSLPGKVAP VTAAKFIKDT ISNIIDELGV
|
| |