Gene Cthe_2793 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2793 
Symbol 
ID4810110 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3293731 
End bp3294903 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content45% 
IMG OID640108213 
Productaminotransferase 
Protein accessionYP_001039185 
Protein GI125975275 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAAAGT TAAGTAGCAA AATTGAGGGA TTTACCGATT CGGTCATAAG AAGAATGACC 
CGTATTGCCA ACAGCTATGG AGCAATAAAC CTTTCCCAGG GATTTCCGGA TTTTGATCCT
CCTGTTGAAT TAAAAAATGC TTTAAGCAGA GTGGCATCCG GAAGCATACA TCAATATGCA
GTAACATGGG GTGCTAAAAA TTTCAGAGAG TCACTGGCAA AAAAACAGTC CAGGTTTATG
GGAATACCCA TTGATCCCGA GACGCAAATC GTGGTGACCT GCGGCAGTAC CGAAGCCATG
ATGGCGGCCA TGATGACGGT TTGCAATCCG GGGGACAAAG TGGTGGTCTT TTCCCCGTTT
TATGAAAATT ACGCCGCAGA TGCCATACTT TCGGGGGCAG AGCCTATATA TGTGCATTTA
CGACCACCTG GATTCAATTT TGATGTGGAT GAATTGGAAG AAGCTTTTAA ACAGAGGCCC
AAAGCGTTGA TTTTATGCAA TCCGTCAAAT CCCTCGGGAA AGGTATTTAC CTTGGAGGAA
CTGAAGACCA TAGCTTATTT TGCAGAAAAA TATGATACGT TTGTGATTAC CGACGAGGTT
TATGAGCATA TTGTGTATCC CCCTCATCAT CATATATATT TTGCATCCCT TCCCGGGATG
TTTGAAAGAA CCATATCCTG CAGTTCCCTG TCAAAAACTT ATTCCATCAC CGGATGGAGA
CTGGGGTATT TGATTGCACC CTCTTACATT GTTGACGGGG CCAGGAAGGT TCATGACTTT
CTGACAGTGG GCGCTGCCGC ACCGTTGCAG GAAGCGGCAG TGGTTGCCCT GAATTTCGGG
GATGATTATT ATGAGAACTT AAAAAGAATT TATACAGAGA AAAGGGATTT TTTTCTGGAT
GGTTTGGATA GGCTGGGGCT TGCGTATACC GTACCGCAAG GGGCCTATTA TGTAATGGTG
GACATTTCGG AGTTTGGAGC AAAAAGCGAC TTGGAATTTT GTGAGTGGAT GGCGAGGGAA
GTGGGCGTTG CAGCGGTTCC CGGCTCAAGC TTTTTCAGAG ATAATGTAAA TCATCTGATT
CGCTTCCACT TTGCAAAAAA GAAAGAAACT CTGGCTGAAG CTGTAAAGAG ACTTGAAAAG
CTTAAAGACA AAGCAAGGGA GCGATGGAGA TGA
 
Protein sequence
MPKLSSKIEG FTDSVIRRMT RIANSYGAIN LSQGFPDFDP PVELKNALSR VASGSIHQYA 
VTWGAKNFRE SLAKKQSRFM GIPIDPETQI VVTCGSTEAM MAAMMTVCNP GDKVVVFSPF
YENYAADAIL SGAEPIYVHL RPPGFNFDVD ELEEAFKQRP KALILCNPSN PSGKVFTLEE
LKTIAYFAEK YDTFVITDEV YEHIVYPPHH HIYFASLPGM FERTISCSSL SKTYSITGWR
LGYLIAPSYI VDGARKVHDF LTVGAAAPLQ EAAVVALNFG DDYYENLKRI YTEKRDFFLD
GLDRLGLAYT VPQGAYYVMV DISEFGAKSD LEFCEWMARE VGVAAVPGSS FFRDNVNHLI
RFHFAKKKET LAEAVKRLEK LKDKARERWR