Gene Cthe_0580 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0580 
Symbol 
ID4808255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp709432 
End bp710625 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content40% 
IMG OID640105994 
Productaspartate aminotransferase 
Protein accessionYP_001037009 
Protein GI125973099 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATATCAG AAAGTGTTGT AAACAGTTTA AAGAAAGCCT CCTGGATAAG GGCAATGTTT 
GAAGAGGGAG AAAAACTTCG TAAAATTCAC GGGGCCGACA ACGTTTATGA TTTTACTTTG
GGAAATCCCG ACCACGAACC GCCCTCTTCG GTAAAAGAAA CACTTAAAAA AATTGTTACC
GAAGACAAAC CGGGCATACA CCGTTATATG AATAACGCAG GATATGAAGA TGTGAGACAA
AAAGTGGCAG ACTATCTGAA CAGAACCTCC GGGCTGTCTT CCATATCATC TCAGCATATA
ATCATGACCT GCGGCGCTGC CGGTGCTCTC AACGTTGTAC TGAAAACTCT TCTCAACCCC
GGAGAAGAAG TCATCATACT GGCACCTTAC TTTGCAGAGT ATATATTCTA TGTGGGAAAT
CACGGCGGAA AAGTGGTTAT AGTACCACCG GAAAAGGACA GTTTTAAACC TGACTTAAAA
ATACTTGAAA ACAGCATCAC CGAAAAAACT AAAGCCATAA TCATAAATTC TCCCAATAAT
CCATCGGGTT ACATATACAG CGAAGAAACC CTGAAGGAGA TTTTTGAAGT TCTTGAAAAG
AAAGAAAAGG AATATAATTC CAGTATATAT GCAATTTCCG ATGAACCTTA CTACAAGCTG
GTTTACGACA ATGTAAAACT TCCTTTTCTT TTCAGACTGT ATAAAAAATC CTTTATCGTA
AACTCTTTCA GCAAATCCCT GGCTCTTGCG GGGGAAAGAA TCGGTTATAT TGCGGTAAAT
CCGGAGATTC CCGAACTGGA ACTTATATTG GAAAGCTTGA TATTCTGCAA CCGTACCTTA
GGTTACGTCA ATGCTCCTGC ATTGTTCCAA AAGGCAATTG CCGACTCTCT GGATGCGGAT
ATTGATGTTG AAAGCTATAA ACAAAGGCGG GATTTAATAT ATGACACTTT AACCCGTCTG
GGCTTTTCAT GCATAAAGCC CCAGGGAACT TTCTACATTT TCCCCAAATC CCCTATTGAA
GATGATATAC AATTTATCAA ACATGCGGTT AAATACAACA TTCTTTTGGT TCCGGGCACC
GGCTTTGGTT TACCGGGGCA CTTCAGACTC TCCTACTGCG TAAGCATGGA TATCATAAAA
AAATCACTGC CGGCTTTCGA AGCATTGGCC AAAGACTTTA ATCTTATAAA ATAA
 
Protein sequence
MISESVVNSL KKASWIRAMF EEGEKLRKIH GADNVYDFTL GNPDHEPPSS VKETLKKIVT 
EDKPGIHRYM NNAGYEDVRQ KVADYLNRTS GLSSISSQHI IMTCGAAGAL NVVLKTLLNP
GEEVIILAPY FAEYIFYVGN HGGKVVIVPP EKDSFKPDLK ILENSITEKT KAIIINSPNN
PSGYIYSEET LKEIFEVLEK KEKEYNSSIY AISDEPYYKL VYDNVKLPFL FRLYKKSFIV
NSFSKSLALA GERIGYIAVN PEIPELELIL ESLIFCNRTL GYVNAPALFQ KAIADSLDAD
IDVESYKQRR DLIYDTLTRL GFSCIKPQGT FYIFPKSPIE DDIQFIKHAV KYNILLVPGT
GFGLPGHFRL SYCVSMDIIK KSLPAFEALA KDFNLIK