Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_0989 |
Symbol | |
ID | 4485932 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | - |
Start bp | 1088557 |
End bp | 1089771 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639729764 |
Product | glycine oxidase ThiO |
Protein accession | YP_872748 |
Protein GI | 117928197 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | [TIGR02352] glycine oxidase ThiO |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.136129 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTCGGTG CCGGAGTCAT CGGATTAGCC ACCGCATGGC GATGTGCGCA GCGTGGATTC AACGTCACCG TCGTCGACCC GGATCCGGGC CGTGGCGCGT CGCACTACGC GGCGGGCATG CTTGCCCCGG TGACTGAGGC GCATTTCGGC GAAGAATCTC TGCTGCACCT CACGATCGAG GCCGCCCGCC GTTATCCCGC TTTCGTTGCC GACCTGCAGG CCGCGGCCGG GATATCCGTT GGCTACCGGA CCACCGGCAT GCTCGCCGTC GCCTTCGACA ACGACGACCG GGCGGTACTG GCGGAGTTGC ACGCCTACCA CAACTCTCTC GGCCTGACGA GCACCCTGCT GTCGTCGCGG GAATGCCGGG ACCGCGAACC TGCGCTGGCG CCGGCCATCC GTGCTGGACT CTGGGTGGAA GGCGACCATC AGGTGGACAA CCGCCGGCTC GTCCAGGCAC TCCGCGCCGC CTGTGACCGG GTCGGAGTCC GGTTCCTCCC GACCGAGGCG CACCTTGACG TCCACGGCAA CCGGGTCAGG GGCGCGAACG GCATTCCGGC CGCCGCGACG GTGCTCGCCG CGGGAGCATG GAGTCCGCAC GTGGCCGGGC TTCCCGAGGC GGTGCGTCCA CCGGTCCGGC CCGTCAAGGG ACAGATCCTG CGGTTGCGGG TCGACCCCAA CCGTCCGCTG CTCACGCGGG CGGTCCGAGC CTTTGTCCGC GGCCGGCCGC TGTACGTCGT CCCGCGGGAA ACCGGCGAGA TTGTGGTCGG CGGCACGGTT GAGGAAATGG GTTTTGACCA GCGGGTCACG GTCGAGGCGG TCGCTGACCT GCTCGATGAC GCGCGGCGTC TTGTCCCCGG CCTCGTGGAC GCCGACTTCG TGGAGGCCTC GGCTGGACTC CGCCCAGGCT CGCCCGATAA CGGTCCCATG GTCGGGCCCA GTGGGGTGGA CGGGCTGGTG ATCGCGACCG GCCACTATCG CAACGGCATC CTGCTCGCGC CGATCACCGC AGACGCCGTC GCCGAGCTGC TCGCCACCGG CGCAATCCCG GAGGAGTTCG TCCCCTTCGA CCCACGCCGG TTCTTTGACC CAGACTGGTC TGCCCGGCAC AGCCATCGCC GCACTGCTCC GGCGGCGAGT CCCCGCAGTG GGAAAGACGG CACGTCCGCT GAGCCGGATG CCGCTCGCGA CGAGAAGGAA GCCGCAACAA AATGA
|
Protein sequence | MVGAGVIGLA TAWRCAQRGF NVTVVDPDPG RGASHYAAGM LAPVTEAHFG EESLLHLTIE AARRYPAFVA DLQAAAGISV GYRTTGMLAV AFDNDDRAVL AELHAYHNSL GLTSTLLSSR ECRDREPALA PAIRAGLWVE GDHQVDNRRL VQALRAACDR VGVRFLPTEA HLDVHGNRVR GANGIPAAAT VLAAGAWSPH VAGLPEAVRP PVRPVKGQIL RLRVDPNRPL LTRAVRAFVR GRPLYVVPRE TGEIVVGGTV EEMGFDQRVT VEAVADLLDD ARRLVPGLVD ADFVEASAGL RPGSPDNGPM VGPSGVDGLV IATGHYRNGI LLAPITADAV AELLATGAIP EEFVPFDPRR FFDPDWSARH SHRRTAPAAS PRSGKDGTSA EPDAARDEKE AATK
|
| |