Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0494 |
Symbol | |
ID | 4026864 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 541123 |
End bp | 542250 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637965653 |
Product | glycine oxidase ThiO |
Protein accession | YP_572555 |
Protein GI | 92112627 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | [TIGR02352] glycine oxidase ThiO |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAGATT GCGGGTTTCG CCAGCCAAGA ACGAGTGTCC CTGTGAGTGA ATTCTTGATC GTCGGGGGCG GCGTCATCGG CATGATGACG GCCCTGCAAC TCGCCGATGC CGGCCGCCGC GTCACCCTGC TCGAGCGCGG CGAGTGCGGC CGTGAGGCCT CCTGGGCCGG CGGTGGCATC GTCTCCCCGT TGTATCCCTG GCGCTACAGC GCGCCGATTT CCACGTTGTC GCGTTATTCC GAAGGGGCCT ACCCCGAGTT GTCGCTGCGC CTGCTGGAGG AAACCGGCAT CGACCCCGAA TACCGCCAGC GGGGGCTCTT GTACCTGCGT GTCGACGATG AAGAGCGGGC GCTCGACTGG GCGCGCCAGG AAGGCAAGCC CCTGCAGCGG GTCGGGCCGG AGACGATCTA CGCCAAGGAG CCCAATGCCG CGCCGGGCGT CGAGTCGGCG CTGTGGATGC CGACCCTGGG CAGCATCCGC AATCCGCGCC TGTGTCGCGC GTTGCGTGCC CGCTTGCAGG CGATGCCCAA CGTGGCGCTG CGCGAGCACG TCGATGTCGA GGAACTGGTG GCATCGGCGG GACGCATCCA GGGGGTGCGC ACCTCGGCGG GACACGAAAC GGCCGAATGC GTGGTGGTAT GTGGCGGTGC CTGGGCCAGC CAGCTGCTGG CGAGCGTCGA TGTGGCGCTG CCCGTGCGCC CGGTGCGCGG GCAGATGATC CTGTTCAAGG CGCCGCCGGG GCTGGTCGAG CGCGTGGTGT TGAAAGATGG TCGCTATGTG ATTCCCCGGG GCGACGGGCG TGTCGTCGCC GGTTCCACGC TCGAAGAGGT AGGATTCGAC AAGCGTACCA CCGAGGCCGC CAAGGGCTCT CTCTATGACA GCGCGCTGTC CATCGTGCCG GGGCTGGCCG ACTGCCCGGT GGAACATCAC TGGGCGGGGT TGCGCCCCGG CTCGCCGGAT GGGGTGCCCC GCATCGGTGC GGTACCGGGC GTCGAGGGCC TCTGGGTCAA TGCCGGACAC TATCGCAATG GCCTGGTGCT GGCACCGGCG TCGACGCGTC TGCTGGCCGA TCAGTTGCTG CAACGCACAC CGGTCGTCGA TCCCGCCCCT TATCGACTGG ACACCTGA
|
Protein sequence | MIDCGFRQPR TSVPVSEFLI VGGGVIGMMT ALQLADAGRR VTLLERGECG REASWAGGGI VSPLYPWRYS APISTLSRYS EGAYPELSLR LLEETGIDPE YRQRGLLYLR VDDEERALDW ARQEGKPLQR VGPETIYAKE PNAAPGVESA LWMPTLGSIR NPRLCRALRA RLQAMPNVAL REHVDVEELV ASAGRIQGVR TSAGHETAEC VVVCGGAWAS QLLASVDVAL PVRPVRGQMI LFKAPPGLVE RVVLKDGRYV IPRGDGRVVA GSTLEEVGFD KRTTEAAKGS LYDSALSIVP GLADCPVEHH WAGLRPGSPD GVPRIGAVPG VEGLWVNAGH YRNGLVLAPA STRLLADQLL QRTPVVDPAP YRLDT
|
| |