Gene Rcas_3393 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3393 
SymbolsolA 
ID5540892 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4423906 
End bp4425051 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content64% 
IMG OID640895511 
ProductN-methyltryptophan oxidase 
Protein accessionYP_001433461 
Protein GI156743332 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID[TIGR01377] sarcosine oxidase, monomeric form 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGATG TCATTGTCAT CGGATTAGGC GGGATGGGCA GCGCCGCAGC ATATCATCTG 
GCGCGGCGTG GCTGGCAGGT GATCGGGCTG GAACGTTTCA CACCCGCGCA TAACCGTGGA
TCGAGCCATG GCAGATCGCG GATCATTCGC CAGGCGTATT TCGAGGACCC TGCGTATGTG
CCTTTACTGC TCCGCGCGTA TGAATTGTGG GAAGACCTTC AGCGCACAAG CAGCGAACCG
CTGTTGACGA TCACCGGCGG TCTAATGATT GGTCGAGCGG AGAGCAGCGT CGTGCGCGGC
GCACTGCACA GCGCCCAAAT GCACCACCTG CCTCACGAAC TACTCGATGC CGCCGACATT
CGTCGCCGTT TCCCGCCGTT CAATGTTGGC GACGATGAGG TCGCGCTGTA CGAAGCGCGC
GCCGGTTTTC TCGATCCCGA AGCGACTGTT CGGGCGCACC TCGACCAGGC GGCGCGCCAT
GGCGCCGATC TGCACTTCGA TGAGCCGGTC ACTGCGTGGG AGTCGACCCC TGGCGGCGGC
GTGCGTGTCA CCACGCCGGC GGGAGTCTAC GAAGCCGAAC GCGCCGTGAT TGCGCCGGGC
GCATGGGCGC CGCGCCTGCT CGCCGATCTG TCGCTGCCGC TGACCGTCGA GCGTCAGGTC
CTCTACTGGT TCGAGCCGGT TGGAGGGCGT GAGCCATTCA GCATCGGGCG ATTCCCCATT
TATATCTGGG AAGACGCGCG TGGTGACGCA CTCTACGGCT TTCCGGCACA GGGCGGACCG
CCGGGCGGCG TCAAGGTCGC CTTCTTCTAC CGCGGGCATC CGACCGACCC GGATCGGGTG
GATCGCTCAG TGCACCCCGA GGAGATCGCC GAAATGCGCA CCGCTCTGGC GCAGCGCATT
CCTGCTCTGA ACGGTCCGCT CGTGGCAACG GCCACCTGTC TCTACACCCT TACGCCTGAT
CACCACTTCA TCATTGCGCC ACACCCGCGT GCGCCGCAGG TCATCATCGC ATCGCCCTGT
TCGGGTCATG GGTACAAATT CGCCAGTGTG ATCGGCGAAA TCCTGGCAGA CCTTGCAATT
GACGGCAGCA CCCGCCACTC GATTGCGCTC TTCGATCCGG CGCGGTTCAG AGCAACGGAC
GCATAG
 
Protein sequence
MGDVIVIGLG GMGSAAAYHL ARRGWQVIGL ERFTPAHNRG SSHGRSRIIR QAYFEDPAYV 
PLLLRAYELW EDLQRTSSEP LLTITGGLMI GRAESSVVRG ALHSAQMHHL PHELLDAADI
RRRFPPFNVG DDEVALYEAR AGFLDPEATV RAHLDQAARH GADLHFDEPV TAWESTPGGG
VRVTTPAGVY EAERAVIAPG AWAPRLLADL SLPLTVERQV LYWFEPVGGR EPFSIGRFPI
YIWEDARGDA LYGFPAQGGP PGGVKVAFFY RGHPTDPDRV DRSVHPEEIA EMRTALAQRI
PALNGPLVAT ATCLYTLTPD HHFIIAPHPR APQVIIASPC SGHGYKFASV IGEILADLAI
DGSTRHSIAL FDPARFRATD A