Gene Csal_0494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0494 
Symbol 
ID4026864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp541123 
End bp542250 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content68% 
IMG OID637965653 
Productglycine oxidase ThiO 
Protein accessionYP_572555 
Protein GI92112627 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID[TIGR02352] glycine oxidase ThiO 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAGATT GCGGGTTTCG CCAGCCAAGA ACGAGTGTCC CTGTGAGTGA ATTCTTGATC 
GTCGGGGGCG GCGTCATCGG CATGATGACG GCCCTGCAAC TCGCCGATGC CGGCCGCCGC
GTCACCCTGC TCGAGCGCGG CGAGTGCGGC CGTGAGGCCT CCTGGGCCGG CGGTGGCATC
GTCTCCCCGT TGTATCCCTG GCGCTACAGC GCGCCGATTT CCACGTTGTC GCGTTATTCC
GAAGGGGCCT ACCCCGAGTT GTCGCTGCGC CTGCTGGAGG AAACCGGCAT CGACCCCGAA
TACCGCCAGC GGGGGCTCTT GTACCTGCGT GTCGACGATG AAGAGCGGGC GCTCGACTGG
GCGCGCCAGG AAGGCAAGCC CCTGCAGCGG GTCGGGCCGG AGACGATCTA CGCCAAGGAG
CCCAATGCCG CGCCGGGCGT CGAGTCGGCG CTGTGGATGC CGACCCTGGG CAGCATCCGC
AATCCGCGCC TGTGTCGCGC GTTGCGTGCC CGCTTGCAGG CGATGCCCAA CGTGGCGCTG
CGCGAGCACG TCGATGTCGA GGAACTGGTG GCATCGGCGG GACGCATCCA GGGGGTGCGC
ACCTCGGCGG GACACGAAAC GGCCGAATGC GTGGTGGTAT GTGGCGGTGC CTGGGCCAGC
CAGCTGCTGG CGAGCGTCGA TGTGGCGCTG CCCGTGCGCC CGGTGCGCGG GCAGATGATC
CTGTTCAAGG CGCCGCCGGG GCTGGTCGAG CGCGTGGTGT TGAAAGATGG TCGCTATGTG
ATTCCCCGGG GCGACGGGCG TGTCGTCGCC GGTTCCACGC TCGAAGAGGT AGGATTCGAC
AAGCGTACCA CCGAGGCCGC CAAGGGCTCT CTCTATGACA GCGCGCTGTC CATCGTGCCG
GGGCTGGCCG ACTGCCCGGT GGAACATCAC TGGGCGGGGT TGCGCCCCGG CTCGCCGGAT
GGGGTGCCCC GCATCGGTGC GGTACCGGGC GTCGAGGGCC TCTGGGTCAA TGCCGGACAC
TATCGCAATG GCCTGGTGCT GGCACCGGCG TCGACGCGTC TGCTGGCCGA TCAGTTGCTG
CAACGCACAC CGGTCGTCGA TCCCGCCCCT TATCGACTGG ACACCTGA
 
Protein sequence
MIDCGFRQPR TSVPVSEFLI VGGGVIGMMT ALQLADAGRR VTLLERGECG REASWAGGGI 
VSPLYPWRYS APISTLSRYS EGAYPELSLR LLEETGIDPE YRQRGLLYLR VDDEERALDW
ARQEGKPLQR VGPETIYAKE PNAAPGVESA LWMPTLGSIR NPRLCRALRA RLQAMPNVAL
REHVDVEELV ASAGRIQGVR TSAGHETAEC VVVCGGAWAS QLLASVDVAL PVRPVRGQMI
LFKAPPGLVE RVVLKDGRYV IPRGDGRVVA GSTLEEVGFD KRTTEAAKGS LYDSALSIVP
GLADCPVEHH WAGLRPGSPD GVPRIGAVPG VEGLWVNAGH YRNGLVLAPA STRLLADQLL
QRTPVVDPAP YRLDT