Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1767 |
Symbol | |
ID | 4570111 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 2008496 |
End bp | 2009788 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 639766350 |
Product | O-acetylhomoserine/O-acetylserine sulfhydrylase |
Protein accession | YP_912208 |
Protein GI | 119357564 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2873] O-acetylhomoserine sulfhydrylase |
TIGRFAM ID | [TIGR01326] OAH/OAS sulfhydrylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAAGT TATCATCTTT ATGCAGGTTT GAGACACTGC AGGTTCATGC AGGCCAGGAG CCTGATCCAA CCACGAATGC GCGGGCGGTG CCTATATATC AGACAACTTC CTATACGTTT GACAGTGTGG CGCACGGTTC CGATCTTTTT GCTCTCAAAG CGTTTGGCAA TATCTATACC AGGTTGATGA ATCCGACTAC CGATGTTCTC GAAAAGCGTG TTGCTGCGCT TGAAGGGGGA GCGGCAGCGC TTGCGCTTGC AAGCGGACAT TCGGCACAGT TTATTGCAAT CACCAACCTC TGCCAGGCGG GAGACAATAT CGTTTCGTCA AGTTATCTCT ATGGAGGAAC CTATAATCAG TTCAAGGTTA CCTTTCCACG TCTCGGCATC AAGGTGAAGA TTGTTGACGG TCAGAGTCCT GAAGCATTCC GGGCCGCGAT TGATGAACAG ACCAAAGCGC TCTACGTCGA ATCTATCGGT AACCCGGCAT TTCATGTTCC CGATTTTGAT GCTTTGGCTG AGTTGTCCCG TGAATATGGT ATTCCGCTAA TCGTAGACAA TACGTTCGGG TGTGCAGGAT ATCTTGCTCG TCCGCTCGAT CACGGCGCTT CGATTGTTGT TGAGTCAGCT ACAAAATGGA TAGGCGGTCA TGGAACTTCA ATGGGCGGCG TTATCGTTGA TTCGGGAACG TTCAACTGGG GGAACGGAAA ATTTCCGTTG CTCAGTGAGC CCTCGGAAGG CTATCATGGA CTTAAGTTTT ATGAGACGTT CGGGTCTCTT GCCTTTATTA TAAAGGCGAG GGTTGAGGGT CTGCGGGATA TCGGGCCAGC GATCAGTCCC TTTAACTCGT TTTTACTTCT GCAGGGGCTT GAGACGCTTT CATTGCGTGT CCAGCGCCAT GCCGACAATA CGCTTGCGCT TGCCCGCTGG CTTGAGAAGC ACCCTTCTGT AGCCTGGGTG AACTATGCCG GACTTGAAGG GCATCAAACC TGGGAACTGG CAAAAAAATA TCTTCAAAAC GGATTTGGCT GTGTGCTAAC CTTCGGTATC AGGGGCGGAT ATGAGAAAGC CGTTGGTTTT ATCGAGAGCG TCAGGCTTGC AAGCCATCTT GCAAATGTAG GCGATGCCAA GACTCTTGTT ATCCATCCGG CATCAACAAC GCATCAGCAG CTCAGTTCAG GCGAGCAGGA GTCTGCAGGT GTCAGTAGCG ATATGATCCG CGTATCGGTC GGGATAGAGC ATATCGAGGA CATCAAAGAC GATTTTAAGC AAGCATTTAA TAAAATTGGT TGA
|
Protein sequence | MSKLSSLCRF ETLQVHAGQE PDPTTNARAV PIYQTTSYTF DSVAHGSDLF ALKAFGNIYT RLMNPTTDVL EKRVAALEGG AAALALASGH SAQFIAITNL CQAGDNIVSS SYLYGGTYNQ FKVTFPRLGI KVKIVDGQSP EAFRAAIDEQ TKALYVESIG NPAFHVPDFD ALAELSREYG IPLIVDNTFG CAGYLARPLD HGASIVVESA TKWIGGHGTS MGGVIVDSGT FNWGNGKFPL LSEPSEGYHG LKFYETFGSL AFIIKARVEG LRDIGPAISP FNSFLLLQGL ETLSLRVQRH ADNTLALARW LEKHPSVAWV NYAGLEGHQT WELAKKYLQN GFGCVLTFGI RGGYEKAVGF IESVRLASHL ANVGDAKTLV IHPASTTHQQ LSSGEQESAG VSSDMIRVSV GIEHIEDIKD DFKQAFNKIG
|
| |