Gene Ccel_1962 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1962 
Symbol 
ID7310677 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2322311 
End bp2323579 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content43% 
IMG OID643608896 
ProductO-acetylhomoserine/O-acetylserine sulfhydrylase 
Protein accessionYP_002506290 
Protein GI220929381 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.493434 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCATGGA AAATTGAAAC AAAGTGCCTC CAGGAAGGCT ATAAACCTGA AAACGGTCAG 
CCCCGTGTAC TTCCTATTTA CCAAAGCACA ACCTACAAAT ATGACTCAAC CGAGCATGTT
GCCAAACTCT TTGACTTATC GGTTCCGGGG CATATGTATT CCAGAATCAG CAATCCCACA
GTAGAATGTG TTGAAAATAA GATTGCAGCA TTGGAAGGCG GTGTCGGTGC TCTCTGCACC
TCGTCCGGAC AAGCTGCCTC TCTGATATCC ATTCTAAATA TATGCGAAGC AGGAGATCAT
TTTGTATGTT CCGGCACAAT CTACGGAGGC ACAATCAATC TCTTTGGTGC CAGTTTGAAA
AAGCATGGAA TAAGCGTAAC CTTTATTGAC CAGGACAGCT CAGAGGAAGA GATTCAAAAG
GCTTTTCAGC CCAACACAAA AGCTCTTTTC GGTGAATCTA TTGCCAATCC TAAAATCAGT
GTACTCGACA TAGAGAAGTT TGCAAAAATA GCACATAAAA ACAATGTTCC CCTTATTATT
GACAATACCT TTGCAACACC TATTTTGCTA AGGCCTATAG AACATGGTGC GGATATTGTA
ATACACTCCA CTACAAAATA CATGGATGGA CATGCTGTAT GCATCGGCGG AGTTATAGTT
GACTCCGGCA ACTTCAACTG GGAAAGCGGC AAATTCCCCG GTTTAACTGA ACCTGATGAA
ACCTACCATG GTATGGTTTA TACAAGAGAC TGCGGTAAAG CTGCATATAT AACTAAAGCC
AGAGTACAGC TTATAAGGGA TTACGGGTGT TGTATGTCGG CAAATAATGC ATTTTTACTT
AATCTTGGAC TTGAAACTCT GCACTTGAGA ATGGAACGTC ACTGTCAAAA CGCCAAAAAA
GTTGCGGAAT ACCTTGCAAC AAGCGACAAA GTAATATCTG TTAGCTACCC TACCCTGCCC
GGTGATCCTT ATCACTCTCT TGCTGAGAAG TATCTTCCAA AGGGCTGCAG CGGAGTTGTG
TCTTTCAGAA TTAAAGGCGG CAGAGAAGGT GCAGTCAAGT TTATGGATAA GTTAAAGCTT
GCCGCGATAG TTGTTCACGT CGCAGATTGC AGAACAGCTG TACTTCACCC TGCAAGTTCC
ACTCACAGAC AATTAAGTGA TGAGCAGCTT GTACAGGCTG GGATCGACCC GGGTCTTATC
CGTTTTTCAG TAGGTCTTGA AAATGTTGAT GATATTATAG TTGACATAAA ACAAGCTTTG
GAAGATTAA
 
Protein sequence
MSWKIETKCL QEGYKPENGQ PRVLPIYQST TYKYDSTEHV AKLFDLSVPG HMYSRISNPT 
VECVENKIAA LEGGVGALCT SSGQAASLIS ILNICEAGDH FVCSGTIYGG TINLFGASLK
KHGISVTFID QDSSEEEIQK AFQPNTKALF GESIANPKIS VLDIEKFAKI AHKNNVPLII
DNTFATPILL RPIEHGADIV IHSTTKYMDG HAVCIGGVIV DSGNFNWESG KFPGLTEPDE
TYHGMVYTRD CGKAAYITKA RVQLIRDYGC CMSANNAFLL NLGLETLHLR MERHCQNAKK
VAEYLATSDK VISVSYPTLP GDPYHSLAEK YLPKGCSGVV SFRIKGGREG AVKFMDKLKL
AAIVVHVADC RTAVLHPASS THRQLSDEQL VQAGIDPGLI RFSVGLENVD DIIVDIKQAL
ED