Gene Acel_0989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0989 
Symbol 
ID4485932 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp1088557 
End bp1089771 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content70% 
IMG OID639729764 
Productglycine oxidase ThiO 
Protein accessionYP_872748 
Protein GI117928197 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID[TIGR02352] glycine oxidase ThiO 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.136129 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTCGGTG CCGGAGTCAT CGGATTAGCC ACCGCATGGC GATGTGCGCA GCGTGGATTC 
AACGTCACCG TCGTCGACCC GGATCCGGGC CGTGGCGCGT CGCACTACGC GGCGGGCATG
CTTGCCCCGG TGACTGAGGC GCATTTCGGC GAAGAATCTC TGCTGCACCT CACGATCGAG
GCCGCCCGCC GTTATCCCGC TTTCGTTGCC GACCTGCAGG CCGCGGCCGG GATATCCGTT
GGCTACCGGA CCACCGGCAT GCTCGCCGTC GCCTTCGACA ACGACGACCG GGCGGTACTG
GCGGAGTTGC ACGCCTACCA CAACTCTCTC GGCCTGACGA GCACCCTGCT GTCGTCGCGG
GAATGCCGGG ACCGCGAACC TGCGCTGGCG CCGGCCATCC GTGCTGGACT CTGGGTGGAA
GGCGACCATC AGGTGGACAA CCGCCGGCTC GTCCAGGCAC TCCGCGCCGC CTGTGACCGG
GTCGGAGTCC GGTTCCTCCC GACCGAGGCG CACCTTGACG TCCACGGCAA CCGGGTCAGG
GGCGCGAACG GCATTCCGGC CGCCGCGACG GTGCTCGCCG CGGGAGCATG GAGTCCGCAC
GTGGCCGGGC TTCCCGAGGC GGTGCGTCCA CCGGTCCGGC CCGTCAAGGG ACAGATCCTG
CGGTTGCGGG TCGACCCCAA CCGTCCGCTG CTCACGCGGG CGGTCCGAGC CTTTGTCCGC
GGCCGGCCGC TGTACGTCGT CCCGCGGGAA ACCGGCGAGA TTGTGGTCGG CGGCACGGTT
GAGGAAATGG GTTTTGACCA GCGGGTCACG GTCGAGGCGG TCGCTGACCT GCTCGATGAC
GCGCGGCGTC TTGTCCCCGG CCTCGTGGAC GCCGACTTCG TGGAGGCCTC GGCTGGACTC
CGCCCAGGCT CGCCCGATAA CGGTCCCATG GTCGGGCCCA GTGGGGTGGA CGGGCTGGTG
ATCGCGACCG GCCACTATCG CAACGGCATC CTGCTCGCGC CGATCACCGC AGACGCCGTC
GCCGAGCTGC TCGCCACCGG CGCAATCCCG GAGGAGTTCG TCCCCTTCGA CCCACGCCGG
TTCTTTGACC CAGACTGGTC TGCCCGGCAC AGCCATCGCC GCACTGCTCC GGCGGCGAGT
CCCCGCAGTG GGAAAGACGG CACGTCCGCT GAGCCGGATG CCGCTCGCGA CGAGAAGGAA
GCCGCAACAA AATGA
 
Protein sequence
MVGAGVIGLA TAWRCAQRGF NVTVVDPDPG RGASHYAAGM LAPVTEAHFG EESLLHLTIE 
AARRYPAFVA DLQAAAGISV GYRTTGMLAV AFDNDDRAVL AELHAYHNSL GLTSTLLSSR
ECRDREPALA PAIRAGLWVE GDHQVDNRRL VQALRAACDR VGVRFLPTEA HLDVHGNRVR
GANGIPAAAT VLAAGAWSPH VAGLPEAVRP PVRPVKGQIL RLRVDPNRPL LTRAVRAFVR
GRPLYVVPRE TGEIVVGGTV EEMGFDQRVT VEAVADLLDD ARRLVPGLVD ADFVEASAGL
RPGSPDNGPM VGPSGVDGLV IATGHYRNGI LLAPITADAV AELLATGAIP EEFVPFDPRR
FFDPDWSARH SHRRTAPAAS PRSGKDGTSA EPDAARDEKE AATK