Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A2075 |
Symbol | |
ID | 3835501 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | + |
Start bp | 2401827 |
End bp | 2403665 |
Gene Length | 1839 bp |
Protein Length | 612 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637826176 |
Product | O-acetylhomoserine/O-acetylserine sulfhydrylase |
Protein accession | YP_427162 |
Protein GI | 83593410 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1832] Predicted CoA-binding protein [COG2873] O-acetylhomoserine sulfhydrylase |
TIGRFAM ID | [TIGR01326] OAH/OAS sulfhydrylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.313252 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGACG CCCTTTCCTA TCCCGATGCC CTGCTCCGCC GCATTCTTGG CGAAACCCGG ACCATCGCCA TGGTCGGGGC CTCGGCCAAT TGGAACCGTC CAAGCTTCTT CGTGATGAAG TATCTGCAAT CAAGGGGATT TCGGGTGATC CCGGTCAACC CCGGCCTCGC CGGCCAGCAG ATCCTGGGCG AGACGGTCTA TGCCCGGTTG GCCGATGTTC CCGCCCCCTT CGAGATGGTC GACATCTTCC GCAATTCGGA CGCCGCCGGG GCGATCCTTG ACGAGGCCGT CGCCCTGCAA GCCGACAAGG GCATCCGCAC CGTCTGGATG CAGTTGTCCG TGCGCAACGA CGCCGCGGCC GAACGCGGCG CCGCCGCTGG GCTTGAGGTG ATCATGGACC GCTGCCCCAA GATCGAATTC GGCCGCCTGG GCGGCGAGTT ATCCTGGCAA GGGGTGAATT CGGGGATCAT TTCCAGCAAG GTGCGCCGGG CGCCCGGCCG CGAAACCGCC CAAGACCGCC GCCAATCGCC GCCGCCAGGG CAAGATCCCG CCGACAAGCC GGCGCCCTTC GGCTTTGAAA CCCGCGCCGT TCACGCCGGC GCCGCCCCCG ATCCAACCAC CGGCGCCCGC GCCACGCCGA TCTATCAAAC CACCTCTTAC GTGTTCGAAG ACACCGATCA GGCGGCCGCG TTGTTCAACC TGCATACCTT CGGCTATCTC TATTCGCGAC TGACCAATCC GACGGTCAGC GTGTTGGAGG AACGCATCGC CAGCCTGGAA GGCGGACGGG CGGCGGTCTG TTGCGCCTCG GGCCATGCGG CGCAGTTCCT GACTTTTTTC ACCCTGCTTG AACCGGGCGA TCATTTCGTC GCCTCGCGCG CCCTCTATGG CGGATCGCTG ACCCAGTTCG GCCAGTCGTT CCGCAAGCTG GGCTGGGAGT GCAGCTTCGT TGATCCCACC GATATCGACG CCTTCCGCGC CGCCATTGGT CCGCGCACCA AGGCGATCTT CCTGGAACTG CTGGCCAATC CGGGGGGGGT GATCGTCGAT GTCGAACAGG TCGCCCGCGT CGCCCAGCAG GCGGGCATTC CGCTGATCGT CGACAACACG CTGGCCACGC CCTATCTGTG CCGCCCCTTC GACTGGGGCG CCGATCTGGT GGTTCATTCG ACCACCAAGT TCCTGTCGGG ACACGGCAAT GCGGTTGGCG GCGCGGTCGT CGAAAGCGGC CGTTTCAACT GGTCGGCCAG CGATAAATTC CCCGGGCTGT CGCAGCCCGA ACCGGCCTAT CACGGAATGA CCTTCCACGA GACCTTTGGC GATTTCGCCT TCACCACCAA GGCCCGCGCC GTCGCCCTGC GCGATTTCGG TCCGGCGATG GCGCCGCAAA ACGCCTTTCT GACCCTCACC GGCATCGAGA CCCTGCCGCT GCGCATGGAT CGCCACGTGG AAAACGCCAA AAAAGTCGCG GCCTTCCTCG CCGCCCATCC CAAGGTGTCG TGGGTCAGCT ATGCCGGACT GCCCGACAGC CCCTTCCACG GGCTTGCCGG CAAATATCTG CCCAAGGGCC CCGGAGCGGT GTTCACCTTC GGGGTCGTCG GCGGCTTCGA AGCCGGCAAG AAGGTGGTCG AGGGAGTGCG GCTGTTCAGC CATCTCGCCA ATGTCGGCGA TACCCGCTCG CTGATCCTCC ACCCCGCCAG CACCACCCAT CGCCAGCTTT CCGACGACCA ACGCCAAGCC GCCGGGGCCG GCGACGAGGT GATCCGCCTG TCGGTGGGCA TCGAGACCGC CGCCGATCTG ATCGCCGACC TTGACCAGGC CCTCGCCCTG ATCGCGTGA
|
Protein sequence | MTDALSYPDA LLRRILGETR TIAMVGASAN WNRPSFFVMK YLQSRGFRVI PVNPGLAGQQ ILGETVYARL ADVPAPFEMV DIFRNSDAAG AILDEAVALQ ADKGIRTVWM QLSVRNDAAA ERGAAAGLEV IMDRCPKIEF GRLGGELSWQ GVNSGIISSK VRRAPGRETA QDRRQSPPPG QDPADKPAPF GFETRAVHAG AAPDPTTGAR ATPIYQTTSY VFEDTDQAAA LFNLHTFGYL YSRLTNPTVS VLEERIASLE GGRAAVCCAS GHAAQFLTFF TLLEPGDHFV ASRALYGGSL TQFGQSFRKL GWECSFVDPT DIDAFRAAIG PRTKAIFLEL LANPGGVIVD VEQVARVAQQ AGIPLIVDNT LATPYLCRPF DWGADLVVHS TTKFLSGHGN AVGGAVVESG RFNWSASDKF PGLSQPEPAY HGMTFHETFG DFAFTTKARA VALRDFGPAM APQNAFLTLT GIETLPLRMD RHVENAKKVA AFLAAHPKVS WVSYAGLPDS PFHGLAGKYL PKGPGAVFTF GVVGGFEAGK KVVEGVRLFS HLANVGDTRS LILHPASTTH RQLSDDQRQA AGAGDEVIRL SVGIETAADL IADLDQALAL IA
|
| |