Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2800 |
Symbol | |
ID | 4810117 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3299819 |
End bp | 3301000 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640108220 |
Product | aminotransferase, class I and II |
Protein accession | YP_001039192 |
Protein GI | 125975282 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1168] Bifunctional PLP-dependent enzyme with beta-cystathionase and maltose regulon repressor activities |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000000680188 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAGCA TATTTGACGA AGTTGTAAAC AGAAGAAATA CCGACAGTCT CAAATGGGAC TCTTGCAGGC AGAGATTCGG CAAAGCAGAC ATTCTTCCAA TGTGGGTTGC CGATATGGAT TTTAAATCAC CCTCTATTAT TACAGAGGCC ATAATACGGA GAGCTCAGCA TGGGATATTC GGTTATACCG AAGCGTCGGA AAGGTTGTCA GCGGCTTTGG CCGGTTGGGT GAAAAAAAGA CACAACTGGC AGATAGACGA GAGGTGGATT TCTTACAGTC CGGGCGTTGT TACTTCGGTA AACACCGCAA TACTTGCATA TACGAATCCA GGGGACAAGG TTTTAATGCA GACTCCCATA TACTACCCTT TTTATTCCAG TATTCTGGAT AATGAGAGGG AGCTGGTGAC AAATTCCCTA AGGGATAATA ACCGACATTA TGAAATTGAT TTTGAAGACC TTGAAAAGAA GCTTTCCGAC AATGTGAAAA TGATGATTTT CTGCAGTCCC CACAATCCGA TAGGCAGGGT TTGGAAGATT GATGAACTCA AAGAGGTATT GAGGCTTTGC AAAAAATACA ATGTAATTCT TGTTTCGGAC GAAATTCATT CGGATTTGGT GTTTAAGGGA CACAAACATA TTCCGGTTGG GTTGCCGGCT GCGGAAAGCG ATTTTGAAAA CTTTATTGTG CTGGTGTCGC CGACGAAAAC CTTCAATATT GCGGGACTTT CGGTGTCTGC CTCAATAATA CCTGATGCGG GGCTAAGGAG AAAATTCAGA GCAACTTTAA GCAAAAACGG AGCCAACATG CTGAACATAT TCGGGCTTGT GGCGGCCGAG GCTGCTTATT CAAGCTGTGA AAAATGGCTG GATGAACTGC TTTTGTATCT TGAAGAAAAT CTAAATACTC TGGAAGAGTA TTTTAAGAAC AATATCCCTC AAATAAAAGT GATAAGGCCG GAGGCGACGT ATCTGGCATG GCTTGACTGC AACGGGCTTC TGGTTCCGGC GGAAGAGCTG AAGAGCTTTT TTGTCAACAA AGCGGGCGTG GGATTAAATG ACGGGGTGAC CTTCGGCAAA GAGGGCCTTG GTTTTCAGAG ACTCAATTTC GCCTGCCCGA GAACGGTTTT ACTGGAAGGA CTTTCAAGAA TCAAAAAAGC TGTAGATGAG CTTTCCAATT AG
|
Protein sequence | MSSIFDEVVN RRNTDSLKWD SCRQRFGKAD ILPMWVADMD FKSPSIITEA IIRRAQHGIF GYTEASERLS AALAGWVKKR HNWQIDERWI SYSPGVVTSV NTAILAYTNP GDKVLMQTPI YYPFYSSILD NERELVTNSL RDNNRHYEID FEDLEKKLSD NVKMMIFCSP HNPIGRVWKI DELKEVLRLC KKYNVILVSD EIHSDLVFKG HKHIPVGLPA AESDFENFIV LVSPTKTFNI AGLSVSASII PDAGLRRKFR ATLSKNGANM LNIFGLVAAE AAYSSCEKWL DELLLYLEEN LNTLEEYFKN NIPQIKVIRP EATYLAWLDC NGLLVPAEEL KSFFVNKAGV GLNDGVTFGK EGLGFQRLNF ACPRTVLLEG LSRIKKAVDE LSN
|
| |