Gene Ccel_3332 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_3332 
Symbol 
ID7311903 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp3875484 
End bp3876764 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content39% 
IMG OID643610235 
ProductO-acetylhomoserine/O-acetylserine sulfhydrylase 
Protein accessionYP_002507601 
Protein GI220930692 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000160258 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAGAA AACTTAGCTT TGATACATTA CAGGTTCATG CAGGACAGGA AGTAGATCCG 
ACCACAAGGT CAAGGGCAGT GCCAATATAC CAGACATCAT CCTATGTTTT TAAGGATTGC
GACAATGCAG CAGATTTATT TGGATTAAAG GATTCTGGTA ATATTTACAC AAGAATAATG
AACCCTACAA CAGATGTATT TGAAAAAAGA ATCGCGGCAC TTGAGGGAGG TACAGCAGCA
CTTGCAGTTG CATCAGGATC ATCAGCAATA ACTTATTCAG TACTTAACAT TGCTGGAGCA
GGAGACGAAA TAGTAGCTGC CAACACCTTA TACGGAGGAA CTTATAACCT TTTTGCGGTA
ACCTTGCCAA GATACGGAGT AAATACAATA TTTGTCGATC CCGATAACAT CCAAAACTTT
GAAGATGCCA TTACAGAAAA AACAAAAGCT CTGTACATTG AATCAATAGG TAACCCAAAT
GCAAATCTAA TTGATATTCA AGCTGTTGCA GACATTGCAC ATAAGCATGG TATCCCCCTT
ATAGTTGATA ATACTTTTGG CTCACCATAT CTGGTAAGGC CTATCGACTT CGGTGCGGAC
GTAGTTGTAC ATTCAGCAAC CAAATTTATA GGCGGACATG GAAGTTCTAT CGGTGGTGTA
ATCATTGATG GAGGTAAATT TGACTATTCA GCGGGGGACA AATTCCCCGG ATTTACAACA
CCTGATGAAA GCTATCATGG AGTTGTATAC AGTCAGTTAG AAGGTGTTGC CTTTATAACA
AAAGCCAGAG TTCAACTGCT TAGAGATACA GGTGCGGCTA TCAGTCCGTT TAATTCCTTC
CTTTTTATAC AAGGACTTGA GACACTATCT TTGAGAGTCG AAAGACATGT AAGCAATTCT
AAAAAGATTG CAGAATACCT GGAAAAACAC TCATTGGTGG AGAAAGTAAA TTATCCAAGC
CTGAAAGGAA ATAAATACTT TGATCTCGCT CAGAAATACT TTCCAAAGGG TTCAGGGTCA
ATATTTACCT TTGAAATAAA AGGAGGTCAC GAATCTGCGA AGAAATTTAT AAATAGTCTG
GAAATATTCT CATTATTAGC AAATGTCGCA GATGCAAAAT CTCTGGTAAT ACATCCTGCA
AGCACTACTC ATTCCCAGCT TTCAGAAGAT GAGCTTTTGA AATCAGGAAT AACACCCGGA
ACAGTAAGAC TTTCCATAGG CATTGAAGAT CCTGACGATC TTATATACGA CATAGATCAG
GCTCTTGAAA AGAGCAGGTA A
 
Protein sequence
MNRKLSFDTL QVHAGQEVDP TTRSRAVPIY QTSSYVFKDC DNAADLFGLK DSGNIYTRIM 
NPTTDVFEKR IAALEGGTAA LAVASGSSAI TYSVLNIAGA GDEIVAANTL YGGTYNLFAV
TLPRYGVNTI FVDPDNIQNF EDAITEKTKA LYIESIGNPN ANLIDIQAVA DIAHKHGIPL
IVDNTFGSPY LVRPIDFGAD VVVHSATKFI GGHGSSIGGV IIDGGKFDYS AGDKFPGFTT
PDESYHGVVY SQLEGVAFIT KARVQLLRDT GAAISPFNSF LFIQGLETLS LRVERHVSNS
KKIAEYLEKH SLVEKVNYPS LKGNKYFDLA QKYFPKGSGS IFTFEIKGGH ESAKKFINSL
EIFSLLANVA DAKSLVIHPA STTHSQLSED ELLKSGITPG TVRLSIGIED PDDLIYDIDQ
ALEKSR