Gene Cthe_1569 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1569 
Symbol 
ID4810076 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1897121 
End bp1898437 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content46% 
IMG OID640106987 
ProductO-acetylhomoserine aminocarboxypropyltransferase 
Protein accessionYP_001037988 
Protein GI125974078 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAACG AATTAGAAAA AACTGCATAT CGGTTTGATA CTCAAAGACT GCGGGCAGGC 
TATGATCCAA AGGAGCATAA CTATGCCGTA TCGCCGCCCA TTTATCAGAC CACTTCCTTT
GACTTTCGCG ATGTAGAGCA TGCAAAAGCT TTATTTGGAC TTCGCGAATT GGGAAACCTT
TATACCCGCA TTGGAAATCC TACGGTTGCT GTACTGGAGC AACGGGTAGC GGCCCTTGAC
GGAGCCAGCG GTGCAGTGGC ACTGGCTTCC GGCATGGCAG CTATCAGTTA TACTCTGCTC
AATGTTGCAG AGGGCGGAGG TCGTATTTTG ACTTCTCCCT ATCTCTATGG CGGCAGTGCG
GACAGTTTTA AAAAAGTTTA TCCTAAATTT GGTATAACCT TCGATTTTGC TAAAAATATA
GAAAATCCTG AAAAGCTTTC GGAAGAAATT AAGCCCGATA CCAAAGCAAT CTATGTGGAA
AGCATCAGCA ATCCCAATGC CGCTCTTTTG GACATTGATG CGATTGCGAA AGTCGCCCAT
GAACATGGCA TACCTCTGAT TGTAGACAAT ACAGTTGCAA CGCCGTACTT GTATAACCCT
ATTTCCCATG GGGCCGACAT TGTGGTCTAC TCTGCGACCA AAGGATTGAC GGGACACGGA
AACGTTATTG CCGGACTGGT GCTGGAAAGC GGTAAATTCA ACTGGCAGAG CGATAAATTC
CCTCAGTTTT CGGAAAAGTA TTATACCCTG CGGGACATTA ACGATAACTA CCGCAGTTTT
CTTGAAGTAT TTCCGGAGGC TCCGTTTACC GGCAGGATAC GCTTTAACTA TCTCAACTAC
TTTGGAGCGG CATTATCGCC TTTTGATGCC TACCTTGTTT TAATCGGATT GGAAACCCTG
TCTGAGCGCG TTGAAAAACA AGTGCGTAAT GCCAGTATCC TGGCGGAATA CCTAAAATCC
CACAAGTCGG TGGAATGGGT TCGTTACCCC GGTTTGAAGG ATAGCCCCTA TTATGAGCTG
GCACAAAGGG ATTTCCCAAA AGGAGCGGGA GGAATCCTGT CATTCGGTTT CAAAGGGACA
ACGGCTCAAA GGGAAACTTT CTTGAACTCA GTGAAACTCT TTCATTATCA TGTCAATATT
GGGGATGCCC GCAGCCTGAT TGTAAATTCT CCGCAAACCA CCCATAGTGA ACTGGAGCCT
GATGAGAAAA AATTTGCCGA TATTCCGGAA AACTTAATTC GCATATCGGC AGGACTGGAA
GATCCGGCAG ATTTAATAGC TGATTTGGAA CAGGCGTTTG AAAAGGCATA TATATAA
 
Protein sequence
MANELEKTAY RFDTQRLRAG YDPKEHNYAV SPPIYQTTSF DFRDVEHAKA LFGLRELGNL 
YTRIGNPTVA VLEQRVAALD GASGAVALAS GMAAISYTLL NVAEGGGRIL TSPYLYGGSA
DSFKKVYPKF GITFDFAKNI ENPEKLSEEI KPDTKAIYVE SISNPNAALL DIDAIAKVAH
EHGIPLIVDN TVATPYLYNP ISHGADIVVY SATKGLTGHG NVIAGLVLES GKFNWQSDKF
PQFSEKYYTL RDINDNYRSF LEVFPEAPFT GRIRFNYLNY FGAALSPFDA YLVLIGLETL
SERVEKQVRN ASILAEYLKS HKSVEWVRYP GLKDSPYYEL AQRDFPKGAG GILSFGFKGT
TAQRETFLNS VKLFHYHVNI GDARSLIVNS PQTTHSELEP DEKKFADIPE NLIRISAGLE
DPADLIADLE QAFEKAYI