Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0585 |
Symbol | |
ID | 4076150 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 624873 |
End bp | 626165 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638005882 |
Product | O-acetylhomoserine/O-acetylserine sulfhydrylase |
Protein accession | YP_612580 |
Protein GI | 99080426 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2873] O-acetylhomoserine sulfhydrylase |
TIGRFAM ID | [TIGR01326] OAH/OAS sulfhydrylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.528029 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00000697093 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCCGACA GCCCGACCTA TGGCTTTGAT ACGCTACAAA TTCACGCTGG CGCCCGCCCT GATCCGGCAA CCGGCGCGCG GCAGACGCCG ATCTATCAAT CCACTGCCTA TGTGTTTCGT GACGCCGACC ACGCGGCGGC GCTCTTTAAC CTGCAGGAAG TAGGCTACAT CTATTCACGC CTCACCAACC CGACCGTTGC GGTCCTGCAA GAGCGTCTCG CCACGCTCGA GGGTGGTGTC GGCGCGGTCT GCTGCTCGTC GGGGCACGCC GCACAGATCA TGGCCCTGTT CCCCCTGATG GGACCCGGCA AGAACGTGGT TGCCTCCACC CGCCTTTACG GGGGCACGGT GACGCAGTTC AGCCAGACCA TCAAACGCTT CGGCTGGTCC GCAAAATTTG TTGATTTTGA CGATCTCGAT GCGCTGCGCG AGGCGATCGA CGATGACACG CGGGCCGTCT TCTGCGAGTC CGTTGCCAAC CCCGGTGGCT ATGTCACCGA CATCCGCGCG GTCGCCGATG TGGCAGATGC CGCAGGCCTG CCCCTGATCG TCGACAACAC CTCTGCAACG CCCTACCTGT GCCAGCCGAT TTCCCAGGGC GCGACCTTGG TAGTGCATTC CACAACCAAA TACCTCACCG GCAATGGTAC CGTAACCGGC GGCTGCATCG TGGATTCGGG CAAGTTCGAC TGGTCCGCCT CCGACAAGTT CCCCTCGCTC TCAGCTCCGG AACCCGCCTA TCACGGGCTC AAGTTCCACG AGACCTTCGG CAATCTTGCC TTCACCTTCC ACGGCATTGC CATCGGGCTG CGTGATCTGG GCATGACCAT GAACCCGCAA GCCGCACATT ACACCCTGAT GGGCATTGAG ACCCTGTCGC TTCGGATGCA GCGCCATGTG GAAAATGCTG TGAAAGTCGC GACTTGGCTC GAAAACGACC CGCGCGTGGA TTACGTGACC TATGCGGGTC TCCCCTCCTC GCCATATGCC GAGCGCGCCC AGCGTTGCTA TCCCAAGGGC ACCGGCGGGC TGTTCACAGT CGCCATCAAA GGCGGCTATG ACGCTTGTGT CAAACTGGTC GACAGCCTCG AGATCTTCTC GCATGTGGCC AATCTTGGCG ACACCCGGTC ACTCATCATC CATTCCGCGT CCACGACTCA CCGCCAGCTT ACGCCCGAAC AGCAAGAAGC AGCGGGCGCT GGCCCCAATG TGGTGCGTGT GTCCATCGGC ATCGAGGACG CAGACGATTT GATCCGCGAT CTCGATCAAG CTCTGGCCAA GGCCTGCGGC TGA
|
Protein sequence | MSDSPTYGFD TLQIHAGARP DPATGARQTP IYQSTAYVFR DADHAAALFN LQEVGYIYSR LTNPTVAVLQ ERLATLEGGV GAVCCSSGHA AQIMALFPLM GPGKNVVAST RLYGGTVTQF SQTIKRFGWS AKFVDFDDLD ALREAIDDDT RAVFCESVAN PGGYVTDIRA VADVADAAGL PLIVDNTSAT PYLCQPISQG ATLVVHSTTK YLTGNGTVTG GCIVDSGKFD WSASDKFPSL SAPEPAYHGL KFHETFGNLA FTFHGIAIGL RDLGMTMNPQ AAHYTLMGIE TLSLRMQRHV ENAVKVATWL ENDPRVDYVT YAGLPSSPYA ERAQRCYPKG TGGLFTVAIK GGYDACVKLV DSLEIFSHVA NLGDTRSLII HSASTTHRQL TPEQQEAAGA GPNVVRVSIG IEDADDLIRD LDQALAKACG
|
| |