Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1958 |
Symbol | |
ID | 3908037 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 2224451 |
End bp | 2225467 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637883852 |
Product | glycine oxidase ThiO |
Protein accession | YP_485577 |
Protein GI | 86749081 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | [TIGR02352] glycine oxidase ThiO |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0655991 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCGCGGCA CGCCGCCGGA CCTGCGCAAG GACGCGCCGG TCTCGGTGAT CGGCGCCGGC ATCGCCGGGG CCTGGCAGGC GCTGCTGCTG GCGCGCGCCG GCCGCAACGT CACGCTGTAC GAGCGCGGCG ACCGCGAAAT GACCCAGGCC ACCAGCCATT GGGCCGGCGG CATGCTGGCG CCGTATTGCG AGGCCGAGAC CGCCGAACCG ATGGTCGGGC TGATGGGCCT GCGCTCGCTG GAGATGTGGC GCAAGGAATT CCCCGAGACC GCGTTCAACG GCTCGCTGGT GGTGGCGCAT GCGCGCGACC GCGCCGATTT CGAGCGCTTC GCCAAGATGA CCGCGGGCCA CAAGCGGCTC GACGCCGACG GCGTCGCCGA GCTGGAACCG GCGCTGGCCG GCCGCTTCCG CGAGGGCCTG TACTTCCCCG ACGAGGGCCA TGTCGAGCCG CGGCTGGTGC TGGCGCGGCT GCACGAGCGG CTGGTCGAGG CCGGCGGCGC GATCCATTTC GAATCGGAGA TGACGCCGGA GGAACTCGAC GGCCTGGTGA TCGATTGCCG CGGCCTCGCC GCGCGCGACA AGGCGCCGGA ACTGCGCGGC GTCAAAGGCG AGATGGTGGT GATCAAGACC ACCGAGGTGA CGCTGTCGCG GCCGGTCCGC TTGATGCATC CGCGCTGGCC GCTCTACGTC ATCCCGCGCG AAGACAATCA CTTCATGCTG GGCGCCACCT CGATCGAGAG CGAGGACGAA CTCGTCACCG TGCGCTCGGC GCTGGAACTG CTCAGCGCCG CCTATGCGGT GCATCCGGCG TTCGGCGAGG CGCATATCGT CGAGATCGGC GCCGGCCTCC GCCCGGCCTT CCCCGACAAT CTGCCGCGCA TTTCCATCGG CAACCGCCGC ATCGCCACCA ACGGCCTGTA CCGCCACGGC TTCCTGCTGG CGCCGGCGCT GGCCGAGAAG ATGCTGGCCT ATGTCGAGCG CGGCGTCGTC GACAATCAGG TGATGCGATG CTTGTGA
|
Protein sequence | MRGTPPDLRK DAPVSVIGAG IAGAWQALLL ARAGRNVTLY ERGDREMTQA TSHWAGGMLA PYCEAETAEP MVGLMGLRSL EMWRKEFPET AFNGSLVVAH ARDRADFERF AKMTAGHKRL DADGVAELEP ALAGRFREGL YFPDEGHVEP RLVLARLHER LVEAGGAIHF ESEMTPEELD GLVIDCRGLA ARDKAPELRG VKGEMVVIKT TEVTLSRPVR LMHPRWPLYV IPREDNHFML GATSIESEDE LVTVRSALEL LSAAYAVHPA FGEAHIVEIG AGLRPAFPDN LPRISIGNRR IATNGLYRHG FLLAPALAEK MLAYVERGVV DNQVMRCL
|
| |