Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1064 |
Symbol | |
ID | 4811362 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1270803 |
End bp | 1271966 |
Gene Length | 1164 bp |
Protein Length | 387 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640106486 |
Product | aminotransferase, class V |
Protein accession | YP_001037489 |
Protein GI | 125973579 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAGGG AAATTTATCT TGACAACAGT GCCACTACGA GGCCGTATGA TGAGGTTATA GACTTTGTAA ATCATATAAG CAGGGAAGTT TACGGCAATC CATCTTCTCT GCACACCAAA GGTATTGAAG CCGAACGAAT GGTGAGAAAT GCAAGGGAAA TTGTTGCAAA ATCTTTGGGA GTGTCAAGGG ATGAGATTTA TTTTACTTCC GGCGGAACCG AGGCAAACAA CCTTGCCATT GCAGGATACC TTTTTGCAAA TCCAAGGAAG GGAAAGCACA TTATCACGAC AAAGATTGAG CATCCTTCTG TCCTGGAGGT CTTTAACAAC TTGTCCCAAC ATGGTTACAA AGTTGATTTT ATAGATGTGG ACAAAAACGG CATAGTGATT GTTGATGACC TGCGAAAAAA AATAAATGAA GAAACTTCTC TTATAAGCGT GATTTACATT AACAATGAGA CGGGAGCGGT TCAACCCATT GATAAAATCG TGGAGGTAAA AAACAGTATA AACAAAGATA TAGTTCTTCA CGTTGATGCG GTTCAGGCGT ACGGAAAGAT AAGAATTGCT CCTGAAAAAC AAGGAATTGA CCTTTTGACC ATGAGCTCTC ACAAGATTCA CGGGCCCAAG GGAGTTGGGG CTTTATATAC CCGCCGGGAC ATTAAACTAA AACCGATTAT TTTCGGCGGG GGGCAGGAGA GCCAACTCAG GTCCGGTACT GAAAATGTTC CCGGAATCTG TGGTTTTGGA GCGGCGGTGG ACATTACCTT CAGGAAAATG GAAGAAAGTT CAAAATATTG TGAGAAGTTA AAAAGCATGC TTTTGGATAT GCTGAAAAAA GAGGTTGAGG ATGTGGTGAT AGTATCTCCC GAAGGATCAT CACCTTATAT TTTGAATGCT GCTTTCCCCA ATGTCCGTGC TGAGGTGCTT TTGCACCATT TGGAGACGAA GAACATATTT GTATCGACAG GAGCAGCCTG CTCTTCCAGA AAGCAGGTTT TAAGCCATGT GCTGAGGGCA ATGGGAATCA AGCCTGAAAT AATTGAAGGG GCGATTAGAT TCAGTTTTTC GTCGTTTAAC AACGAAGAGG ATATAATAAA AACCGTGGAA GCCATAAAAG ATATTTTGCC AAAAATAAGA ATAAAACGTG GAGGAAGAAG ATGA
|
Protein sequence | MSREIYLDNS ATTRPYDEVI DFVNHISREV YGNPSSLHTK GIEAERMVRN AREIVAKSLG VSRDEIYFTS GGTEANNLAI AGYLFANPRK GKHIITTKIE HPSVLEVFNN LSQHGYKVDF IDVDKNGIVI VDDLRKKINE ETSLISVIYI NNETGAVQPI DKIVEVKNSI NKDIVLHVDA VQAYGKIRIA PEKQGIDLLT MSSHKIHGPK GVGALYTRRD IKLKPIIFGG GQESQLRSGT ENVPGICGFG AAVDITFRKM EESSKYCEKL KSMLLDMLKK EVEDVVIVSP EGSSPYILNA AFPNVRAEVL LHHLETKNIF VSTGAACSSR KQVLSHVLRA MGIKPEIIEG AIRFSFSSFN NEEDIIKTVE AIKDILPKIR IKRGGRR
|
| |