Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0961 |
Symbol | |
ID | 4811254 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1151063 |
End bp | 1152061 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640106380 |
Product | aspartate semialdehyde dehydrogenase |
Protein accession | YP_001037388 |
Protein GI | 125973478 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0136] Aspartate-semialdehyde dehydrogenase |
TIGRFAM ID | [TIGR01296] aspartate-semialdehyde dehydrogenase (peptidoglycan organisms) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGG TTAATTTAGC AGTAGTCGGA GCCACCGGAA TGGTCGGCAG AACATTTATC AAAGTGCTGG AAGAAAGAAA CTTGCCTATC GATAACATAT ATTTCTTTTC TTCAAGTCGC TCTGCTGGTT CGACTGTAAC TTTCAACAAC AAGGAATATG TGGTGGAAGA GCTCACCGAG ACATCCTTTG ACAGAGGAAT AGACATAGCA CTGTTCTCCG CTGGAGCAAG TGTCAGTGAA AAATTTGCAC CAATAGCGGC TTCAAAAGGC TGTGTGGTTG TGGACAACAG CAGCTGCTGG AGAATGAACG AAAAAGTTCC CCTTGTAGTA CCGGAAGTCA ATCCCCAGGA CATTTCCTGT CACCAGGGCA TCATAGCCAA TCCAAACTGC TCCACCATAC AGGCGGTTGT TGTGCTAAAG CCCTTACATG ACAAGTATAA AATAAAAAGA ATAGTTTATT CCACATACCA GGCGGTATCA GGAGCCGGGC ACAACGGATA CATGGACCTC GAAAACGGTC TTAAGGGAGA ACCTCCAAAG AAGTTCCCTC ATCCGATAGC CGGCAACTGC CTGCCTCATA TTGACTCTTT CCTCCCTAAC GGCTATACAA AGGAAGAAAT GAAAATGGTA AATGAGACAA GAAAGATATT GGGAGACTAC AGCATCAGAA TTACCGCCAC AACCGTAAGA GTGCCCGTAT TCAACGGTCA CAGTGAATCA ATAAACGTTG AGTTTGAAAA ACAGTTTGAC CTTGAAGAAC TCAAAGAAGT TTTAAGAAAT GCACCGGGCG TCGTTGTTCA GGATGACGTT GCCAACAATG TTTATCCGAT GCCTATTTAC GCAAGCGGAA GAGATGAAAC TTTTGTAGGA AGAATTCGTC GCGATGAAAG TGTTGAAAGC GGCGTAAACC TCTGGGTTGT TGCGGACAAT ATAAGAAAAG GCGCTGCCAC AAACGCCGTT CAAATTGCCG AAGAACTGAT TAAAATGTGG AATAAATAA
|
Protein sequence | MKKVNLAVVG ATGMVGRTFI KVLEERNLPI DNIYFFSSSR SAGSTVTFNN KEYVVEELTE TSFDRGIDIA LFSAGASVSE KFAPIAASKG CVVVDNSSCW RMNEKVPLVV PEVNPQDISC HQGIIANPNC STIQAVVVLK PLHDKYKIKR IVYSTYQAVS GAGHNGYMDL ENGLKGEPPK KFPHPIAGNC LPHIDSFLPN GYTKEEMKMV NETRKILGDY SIRITATTVR VPVFNGHSES INVEFEKQFD LEELKEVLRN APGVVVQDDV ANNVYPMPIY ASGRDETFVG RIRRDESVES GVNLWVVADN IRKGAATNAV QIAEELIKMW NK
|
| |