Gene Cthe_0755 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0755 
Symbol 
ID4810373 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp920548 
End bp921735 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content41% 
IMG OID640106172 
Productaspartate aminotransferase 
Protein accessionYP_001037183 
Protein GI125973273 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAGTTT CAAAAAAGGC GCTTTCTATT AGCCCGTCTT CCACGTTGGC TATAGATGCA 
AAAGCGAAAA AGATGAGGTC AGAAGGAATT GATATAATTG GATTCGGTGC AGGTGAACCC
GACTTTGACA CACCTGATCA CATAAAAAAA GCGGCAATAG ATGCTATAAA TGCCGGGTTT
ACCAAATACA CTCCGGCTTC AGGTACACTT GAGCTGAAAC AGGCTATTTG CCGGAAGTTT
AAGAGAGACA ACGGGCTTGA TTACAATCCT TCAAATATTG TAATAAGCAA CGGTGCAAAG
CATTCTCTGG TTAACGCTTT GCAGGCAATA TGCAATCCCG GCGATGAGGT AATTATTCCG
ACTCCTGCAT GGGTAAGCTA TCCTGAGATG GTTAAGCTTG CAGACGGAGT GCCGGTTTAC
ATTCATTGTT CCGAGGAAGA GGGCTTTAAA TTTACGATAG ATAAACTTGA AAAAGCTATT
ACAGACAAAA CCAGAGCAAT AATTATCAAC AGCCCGAGCA ATCCTACGGG TATGATTTAC
AGTGAAGAGG AATTAAGAGC TGTGGCCGAT TTGGCTGTGA GCAAAGGTAT ATATATCATA
TCCGATGAAA TATACGAAAA GCTTATTTAT GACGGATACA AGCATGTGAG CATTGCATCC
TTTAATGATA AAATTAAGGA CCTTACCATA GTTGTTAACG GTGTGTCCAA ATCTTATGCA
ATGACAGGTT GGAGAATAGG ATATACTGCC AGCAACGAGC AAATTGCAAA AATTATGGCT
AATGTACAGA GTCATGCCAC ATCAAATCCA AATTCAATAG CACAGAAAGC TGCATTGGCT
GCCCTTGAAG GACCACAGGA GATAATCGAC GAGATGTCGG CAGAGTTTGT GAAGAGAAGA
GATTACATGG TTGATAGGAT AAACTCAATG AACGGGGTAT CCTGTATAAA ACCAAACGGT
GCTTTCTATG TAATGATGAA CATATCAAAG CTTATCGGAA AAGAAATAGC CGGCATGAAA
ATAACCGGTT CCGACAGCTT TGCGGAAGCT CTGCTGGAAA AAGCAAATGT TGCTTTGGTT
CCGGGTTCAG GTTTTGGAAC GGATATTCAC GTGAGGTTGT CTTATGCAAC TTCCATGGAG
AATATTGTTG AAGGTTTGAA CAGGATAGAG AAGTTTTTGA GTATGTAA
 
Protein sequence
MVVSKKALSI SPSSTLAIDA KAKKMRSEGI DIIGFGAGEP DFDTPDHIKK AAIDAINAGF 
TKYTPASGTL ELKQAICRKF KRDNGLDYNP SNIVISNGAK HSLVNALQAI CNPGDEVIIP
TPAWVSYPEM VKLADGVPVY IHCSEEEGFK FTIDKLEKAI TDKTRAIIIN SPSNPTGMIY
SEEELRAVAD LAVSKGIYII SDEIYEKLIY DGYKHVSIAS FNDKIKDLTI VVNGVSKSYA
MTGWRIGYTA SNEQIAKIMA NVQSHATSNP NSIAQKAALA ALEGPQEIID EMSAEFVKRR
DYMVDRINSM NGVSCIKPNG AFYVMMNISK LIGKEIAGMK ITGSDSFAEA LLEKANVALV
PGSGFGTDIH VRLSYATSME NIVEGLNRIE KFLSM