Gene Cpha266_1767 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1767 
Symbol 
ID4570111 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2008496 
End bp2009788 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content50% 
IMG OID639766350 
ProductO-acetylhomoserine/O-acetylserine sulfhydrylase 
Protein accessionYP_912208 
Protein GI119357564 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAGT TATCATCTTT ATGCAGGTTT GAGACACTGC AGGTTCATGC AGGCCAGGAG 
CCTGATCCAA CCACGAATGC GCGGGCGGTG CCTATATATC AGACAACTTC CTATACGTTT
GACAGTGTGG CGCACGGTTC CGATCTTTTT GCTCTCAAAG CGTTTGGCAA TATCTATACC
AGGTTGATGA ATCCGACTAC CGATGTTCTC GAAAAGCGTG TTGCTGCGCT TGAAGGGGGA
GCGGCAGCGC TTGCGCTTGC AAGCGGACAT TCGGCACAGT TTATTGCAAT CACCAACCTC
TGCCAGGCGG GAGACAATAT CGTTTCGTCA AGTTATCTCT ATGGAGGAAC CTATAATCAG
TTCAAGGTTA CCTTTCCACG TCTCGGCATC AAGGTGAAGA TTGTTGACGG TCAGAGTCCT
GAAGCATTCC GGGCCGCGAT TGATGAACAG ACCAAAGCGC TCTACGTCGA ATCTATCGGT
AACCCGGCAT TTCATGTTCC CGATTTTGAT GCTTTGGCTG AGTTGTCCCG TGAATATGGT
ATTCCGCTAA TCGTAGACAA TACGTTCGGG TGTGCAGGAT ATCTTGCTCG TCCGCTCGAT
CACGGCGCTT CGATTGTTGT TGAGTCAGCT ACAAAATGGA TAGGCGGTCA TGGAACTTCA
ATGGGCGGCG TTATCGTTGA TTCGGGAACG TTCAACTGGG GGAACGGAAA ATTTCCGTTG
CTCAGTGAGC CCTCGGAAGG CTATCATGGA CTTAAGTTTT ATGAGACGTT CGGGTCTCTT
GCCTTTATTA TAAAGGCGAG GGTTGAGGGT CTGCGGGATA TCGGGCCAGC GATCAGTCCC
TTTAACTCGT TTTTACTTCT GCAGGGGCTT GAGACGCTTT CATTGCGTGT CCAGCGCCAT
GCCGACAATA CGCTTGCGCT TGCCCGCTGG CTTGAGAAGC ACCCTTCTGT AGCCTGGGTG
AACTATGCCG GACTTGAAGG GCATCAAACC TGGGAACTGG CAAAAAAATA TCTTCAAAAC
GGATTTGGCT GTGTGCTAAC CTTCGGTATC AGGGGCGGAT ATGAGAAAGC CGTTGGTTTT
ATCGAGAGCG TCAGGCTTGC AAGCCATCTT GCAAATGTAG GCGATGCCAA GACTCTTGTT
ATCCATCCGG CATCAACAAC GCATCAGCAG CTCAGTTCAG GCGAGCAGGA GTCTGCAGGT
GTCAGTAGCG ATATGATCCG CGTATCGGTC GGGATAGAGC ATATCGAGGA CATCAAAGAC
GATTTTAAGC AAGCATTTAA TAAAATTGGT TGA
 
Protein sequence
MSKLSSLCRF ETLQVHAGQE PDPTTNARAV PIYQTTSYTF DSVAHGSDLF ALKAFGNIYT 
RLMNPTTDVL EKRVAALEGG AAALALASGH SAQFIAITNL CQAGDNIVSS SYLYGGTYNQ
FKVTFPRLGI KVKIVDGQSP EAFRAAIDEQ TKALYVESIG NPAFHVPDFD ALAELSREYG
IPLIVDNTFG CAGYLARPLD HGASIVVESA TKWIGGHGTS MGGVIVDSGT FNWGNGKFPL
LSEPSEGYHG LKFYETFGSL AFIIKARVEG LRDIGPAISP FNSFLLLQGL ETLSLRVQRH
ADNTLALARW LEKHPSVAWV NYAGLEGHQT WELAKKYLQN GFGCVLTFGI RGGYEKAVGF
IESVRLASHL ANVGDAKTLV IHPASTTHQQ LSSGEQESAG VSSDMIRVSV GIEHIEDIKD
DFKQAFNKIG