Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3650 |
Symbol | |
ID | 4075079 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 705749 |
End bp | 706723 |
Gene Length | 975 bp |
Protein Length | 324 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638005170 |
Product | glycine oxidase ThiO |
Protein accession | YP_611879 |
Protein GI | 99078621 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | [TIGR02352] glycine oxidase ThiO |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGCTCT ATTCTGTCAT CGGAGCGGGT GTCGCTGGGC TCGCAGTTGC CACGGAACTG GTAGCGCGCG GCGCAAGGGT GCAGGTCTTT GACCCCGCCG GTCCGCCCGG CGCCCATGGG TGCTCGTGGT GGGCAGGGGG CATGCTTGCC CCTTGGTGCG AATACGAAAA CGCCGAAGAG CCGGTCTTGC GGCTGGGGCA AGAGGCAATC CAGTGGTGGC AGGACAGAAC CCAAGTCACC CATCGCGGCA CTTTGGTGGT GGCCGGGCGG CGCGACATTC CGGATCTGCG CCGCTTTGCT CGCCGCACCG AAGGGTTTCG TCAGATCGAC CACGACATCA CGGAACTGGA ACCCGATCTC GTTGGCTTTT CGCAGGCCCT CTTTTTTGAA GAAGAGGCTC ATCTGGACCC GCGCCGGGCG CTTGCGGATC TCTATCAGAG GCTAATGCAA GAGGGCGTGG TGTTTCATGC TGAGTGCGCG CCGGACAATC TTGAAAATGT GATTGATTGC AGAGGTTTGC AGGCGCGCGA TTGCCTGAAG GATCTACGCG GTGTTAAGGG TGAAATGCTG GTCATCCGCT GCCCGGATGT GACGCTGACG CGACCTGTGC GACTGCTGCA CCCGCGGATG CCTCTCTACG TTGTGCCGCG TGGAGACGGC CTCTATATGC TCGGCGCAAC CATGATCGAG AGCGAAGACC GCGCCCGGAT TACCGCGCGC TCGATGCTTG AGTTGCTCAG CGCTGCCTAT GCGCTGAACC CGGGCTTTGG CGAGGCGGAG ATCCTCGAGA TCGGCGTCGA TCTGCGCCCT GCATTTCCCG ACAACCTGCC ACGGATCCGC CGGTTGAAGG GGCGGATCTA TGCCAATGGG CTTTATCGGC ACGGGTATCT TCTTGCGCCC GCGTTGGCGC GTGGCGTGGC GGATCTGGTG CTCAACAACA TACATTCGGA GATGGTTGAT GAAGATCACT GTTAA
|
Protein sequence | MMLYSVIGAG VAGLAVATEL VARGARVQVF DPAGPPGAHG CSWWAGGMLA PWCEYENAEE PVLRLGQEAI QWWQDRTQVT HRGTLVVAGR RDIPDLRRFA RRTEGFRQID HDITELEPDL VGFSQALFFE EEAHLDPRRA LADLYQRLMQ EGVVFHAECA PDNLENVIDC RGLQARDCLK DLRGVKGEML VIRCPDVTLT RPVRLLHPRM PLYVVPRGDG LYMLGATMIE SEDRARITAR SMLELLSAAY ALNPGFGEAE ILEIGVDLRP AFPDNLPRIR RLKGRIYANG LYRHGYLLAP ALARGVADLV LNNIHSEMVD EDHC
|
| |