Gene Cthe_2800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2800 
Symbol 
ID4810117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3299819 
End bp3301000 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content44% 
IMG OID640108220 
Productaminotransferase, class I and II 
Protein accessionYP_001039192 
Protein GI125975282 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1168] Bifunctional PLP-dependent enzyme with beta-cystathionase and maltose regulon repressor activities 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000680188 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAGCA TATTTGACGA AGTTGTAAAC AGAAGAAATA CCGACAGTCT CAAATGGGAC 
TCTTGCAGGC AGAGATTCGG CAAAGCAGAC ATTCTTCCAA TGTGGGTTGC CGATATGGAT
TTTAAATCAC CCTCTATTAT TACAGAGGCC ATAATACGGA GAGCTCAGCA TGGGATATTC
GGTTATACCG AAGCGTCGGA AAGGTTGTCA GCGGCTTTGG CCGGTTGGGT GAAAAAAAGA
CACAACTGGC AGATAGACGA GAGGTGGATT TCTTACAGTC CGGGCGTTGT TACTTCGGTA
AACACCGCAA TACTTGCATA TACGAATCCA GGGGACAAGG TTTTAATGCA GACTCCCATA
TACTACCCTT TTTATTCCAG TATTCTGGAT AATGAGAGGG AGCTGGTGAC AAATTCCCTA
AGGGATAATA ACCGACATTA TGAAATTGAT TTTGAAGACC TTGAAAAGAA GCTTTCCGAC
AATGTGAAAA TGATGATTTT CTGCAGTCCC CACAATCCGA TAGGCAGGGT TTGGAAGATT
GATGAACTCA AAGAGGTATT GAGGCTTTGC AAAAAATACA ATGTAATTCT TGTTTCGGAC
GAAATTCATT CGGATTTGGT GTTTAAGGGA CACAAACATA TTCCGGTTGG GTTGCCGGCT
GCGGAAAGCG ATTTTGAAAA CTTTATTGTG CTGGTGTCGC CGACGAAAAC CTTCAATATT
GCGGGACTTT CGGTGTCTGC CTCAATAATA CCTGATGCGG GGCTAAGGAG AAAATTCAGA
GCAACTTTAA GCAAAAACGG AGCCAACATG CTGAACATAT TCGGGCTTGT GGCGGCCGAG
GCTGCTTATT CAAGCTGTGA AAAATGGCTG GATGAACTGC TTTTGTATCT TGAAGAAAAT
CTAAATACTC TGGAAGAGTA TTTTAAGAAC AATATCCCTC AAATAAAAGT GATAAGGCCG
GAGGCGACGT ATCTGGCATG GCTTGACTGC AACGGGCTTC TGGTTCCGGC GGAAGAGCTG
AAGAGCTTTT TTGTCAACAA AGCGGGCGTG GGATTAAATG ACGGGGTGAC CTTCGGCAAA
GAGGGCCTTG GTTTTCAGAG ACTCAATTTC GCCTGCCCGA GAACGGTTTT ACTGGAAGGA
CTTTCAAGAA TCAAAAAAGC TGTAGATGAG CTTTCCAATT AG
 
Protein sequence
MSSIFDEVVN RRNTDSLKWD SCRQRFGKAD ILPMWVADMD FKSPSIITEA IIRRAQHGIF 
GYTEASERLS AALAGWVKKR HNWQIDERWI SYSPGVVTSV NTAILAYTNP GDKVLMQTPI
YYPFYSSILD NERELVTNSL RDNNRHYEID FEDLEKKLSD NVKMMIFCSP HNPIGRVWKI
DELKEVLRLC KKYNVILVSD EIHSDLVFKG HKHIPVGLPA AESDFENFIV LVSPTKTFNI
AGLSVSASII PDAGLRRKFR ATLSKNGANM LNIFGLVAAE AAYSSCEKWL DELLLYLEEN
LNTLEEYFKN NIPQIKVIRP EATYLAWLDC NGLLVPAEEL KSFFVNKAGV GLNDGVTFGK
EGLGFQRLNF ACPRTVLLEG LSRIKKAVDE LSN