Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_6396 |
Symbol | |
ID | 6983468 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011371 |
Strand | + |
Start bp | 44050 |
End bp | 45000 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643399394 |
Product | glycine oxidase ThiO |
Protein accession | YP_002284150 |
Protein GI | 209552235 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | [TIGR02352] glycine oxidase ThiO |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATGTCC TCGTCAAAGG GGCCGGCGTC GCCGGCCTCA CCGTCGCCCG CGAATTGCAT GCCCGTGGTG CCGAGGTGAC AATCTTCGAC CCGCATCAGA ACTTCGCTCA TGCGGCTTCC TGGCTCGCCG GCGGTATGCT GGCGCCCTGG TGCGAGCGGG AAAGCGCCGA TGAGGCCGTG CTGACGCGCG GCCTCGATGC CGCCGACCGC TGGGAAGCGA TCCTGCCGGG CAGCGTCGTC CGCAACGGCA CGCTCGTTGT CGCCTCGACG CGCGACCATA GCGAGCTGAA GCGGTTTGCC AGCCGCACGA CCGGTTACGA ATGGGTGGAA GAGGATGATA TCGCCGCCCT CGAGCCCATG CTCGCCGGCC GTTTCAGGCA CGGCCTGTTC TTCCCACGCG AAGGGCATCT GGATCCGCGT CAGGCGTTGC GCGGATTGAA AGAACAACTG TCGGCAAACG GCGTCACCTT TACCGACACG GATCCGAGCG AAGACGATTT CTCCGATATC GTCGACTGCA CCGGCGCTGC CCGCATCGGG CAAGAGCGCG ACCTGCGCGG CGTGCGGGGC GAAATGCTCT ATCTCCATAC CGATGAGGTC ACCCTCACCC GGCCCGTCCG CCTGCTGCAT CCGCGTTTTC CCGTCTATAT CGTCCCCCGC GGCAACGGGT TTTTCATGAT CGGCGCAACG ATGATCGAGA CCGATGCCGA CGGCCCGATC ACCGCACGTT CGCTGATGGA ACTGCTGAAC ACGGCCTATG CGCTGCACCC GGCCTTTGCC GACGCCGCCG TCGTCGAAAC CGGCGCCGGC ATCCGCCCTT CCCTGCCCGA CAATCTTCCG CGCGTCGTGC GGGAGGGAAA GGCCGTCGTG TTCAACGGCC TCTACCGCCA CGGCTTTCTG CTGGCGCCGA CCATGGCGGC CGAGGCCGCC GATCTCGTTT TTTCCAAATA G
|
Protein sequence | MHVLVKGAGV AGLTVARELH ARGAEVTIFD PHQNFAHAAS WLAGGMLAPW CERESADEAV LTRGLDAADR WEAILPGSVV RNGTLVVAST RDHSELKRFA SRTTGYEWVE EDDIAALEPM LAGRFRHGLF FPREGHLDPR QALRGLKEQL SANGVTFTDT DPSEDDFSDI VDCTGAARIG QERDLRGVRG EMLYLHTDEV TLTRPVRLLH PRFPVYIVPR GNGFFMIGAT MIETDADGPI TARSLMELLN TAYALHPAFA DAAVVETGAG IRPSLPDNLP RVVREGKAVV FNGLYRHGFL LAPTMAAEAA DLVFSK
|
| |