Gene Cagg_0970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0970 
Symbol 
ID7268044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1198447 
End bp1199742 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content56% 
IMG OID643565819 
ProductO-acetylhomoserine/O-acetylserine sulfhydrylase 
Protein accessionYP_002462324 
Protein GI219847891 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0111967 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.585496 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGATA CGCCACGCTT CACCGGTTTC GAGACGCTTG CCCTACACGC CGGTCAGATC 
CCGGATCCAA CAACCGGAGC ACGGGCAGTA CCGATCTACG CAACGACATC TTATCAGTTT
AAGGACACCG ATCACGCAGC GCGCTTGTTC AACCTGCAAG AGTTCGGCAA TATCTACACC
CGCATTATGA ATCCGACCAC CGATGTATTC GAGCAGCGGA TGGCCGCGCT GGAAGGTGGC
GTGGGTGCAT TGGCGTTGTC GTCAGGGCAA GCTGCTGAGA CGTTGGCGAT TTTGAATCTG
GCCGGGAGTG GTGATAACAT CGTCGCCTCG TCGGATTTGT ACGGTGGTAC CTATAATCTC
TTCCGTCATA CCTTACCGCG TTTGGGTATT ACGACCCGTT TTATCGATGC CCGTGATTAT
GATGGGTTTG CAGCGGCGAT TGATGATCGA ACAAAGGCGT TCTTTCTCGA ACTAGTTGGT
AACCCGCGGC TCGATGTGCT CGATCTGGAG CGGATTGCGG CGATTGCACA CGAGCGAGGT
GTACCGGTCA TTGTCGATGC AACGACGGTG ACCCCGTATC TGTGGCAGCC GATCAAGCAT
GGTGCTGACA TTGTCATTCA CTCGGCGACG AAGTACATTG GTGGGCACGG TACCGCGATC
GGTGGGATTA TTATCGATAG TGGTAAGTTT GATTGGGCGG CAAGTGGGCG TTTTCCCGAT
TTCACCAACC CCGATCCGAG CTATCACGGC TTGGTGTATA CGCAGACCTT CGGCAATCTT
GCCTATATCA TCAAGGCGCG CGTGCAAGGC CTACGTGATA TTGGTGCAGC CCTAAGCCCA
TTCAACAGTT TCCTCTTCTT GCAAGGGCTA GAGACGTTGC CGTTGCGGAT GGAGCGGCAC
AGCAAGAATG CGTTAGCCGT CGCGCGCTAT CTCAGCGAGC ATCCGAAGGT CGCATGGGTC
AACTATCCCG GCTTACCGAG CCACCCGAGC TATCCGTTGG CCCAAAAGTA TCTACCGCGC
GGACAGAGCG GGATCGTCGG GTTCGGCTTG AAGGGTGGGC GTAACGCCGG ACGAATCTTT
ATCGAACGGT TACGCCTCTT CTCACACTTG GCGAATATCG GTGATGCCAA GAGTTTGGCG
ATCCATCCGG CGACGACGAC GCATAGCCAG TTGACCCCTG AAGAACAGCG TCTCACCGGG
GTGACCGACG ACTACGTGCG GCTCTCCATC GGCCTTGAAA CGATAGACGA TATTTTGGCC
GACCTCGATC AGGCGTTGGC CGGAACACCA TCGTAG
 
Protein sequence
MSDTPRFTGF ETLALHAGQI PDPTTGARAV PIYATTSYQF KDTDHAARLF NLQEFGNIYT 
RIMNPTTDVF EQRMAALEGG VGALALSSGQ AAETLAILNL AGSGDNIVAS SDLYGGTYNL
FRHTLPRLGI TTRFIDARDY DGFAAAIDDR TKAFFLELVG NPRLDVLDLE RIAAIAHERG
VPVIVDATTV TPYLWQPIKH GADIVIHSAT KYIGGHGTAI GGIIIDSGKF DWAASGRFPD
FTNPDPSYHG LVYTQTFGNL AYIIKARVQG LRDIGAALSP FNSFLFLQGL ETLPLRMERH
SKNALAVARY LSEHPKVAWV NYPGLPSHPS YPLAQKYLPR GQSGIVGFGL KGGRNAGRIF
IERLRLFSHL ANIGDAKSLA IHPATTTHSQ LTPEEQRLTG VTDDYVRLSI GLETIDDILA
DLDQALAGTP S