Gene Csal_0998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0998 
Symbol 
ID4026221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1124377 
End bp1125627 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content64% 
IMG OID637966175 
Productsarcosine oxidase beta subunit family protein 
Protein accessionYP_573054 
Protein GI92113126 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID[TIGR01373] sarcosine oxidase, beta subunit family, heterotetrameric form 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACGTT ATTCAGGCTT CGGCCTCGTC AAGCACGCGT TACGCCACCA CGAGGATTGG 
CAGCGCCAAT GGCGCAATCC CACGCCCAAG CCGGGGTACG ACGTGATCAT CGTCGGCGGC
GGCGGGCACG GTCTGGCCAC GGCGTACTAT CTGGCCAAGG AGCACGGCAT CACCAATGTG
GCAGTGCTCG AGAAGGGCTG GCTGGGCGGC GGCAATACTG CGCGCAACAC CACCATCGTG
CGTTCCAATT ACCTGTGGGA CGAGTCGGCG GCGCTCTACG AGCATGCGAT GAAGCATTGG
GAAGGGCTCT CTCAGGAACT CAACTACAAC GTCATGTTCT CCCAGCGTGG CGTGTTGAAC
CTGGGCCATA CCCTGCAGGA CATGCGCGAT ATTCAGCGCC GGGTCAACGC CAACCGACTC
AATGGTATCG ACGGTGAGGT GCTGGATGCC CAGGGCGTGC AGTCGCTGGT GCCGATCATG
GACTGCTCGA AGCATGCACG ATATCCGGTC ATGGGCGCGT CCTGGCAGCC GCGCGCCGGG
GTGGCGCGTC ACGATGCCGT GGCCTGGGGC TATGCCCGCG CCGCCGATGC GCTGGGCGTC
GACCTGTTGC AGAACACCGA GGTCACCGGC TTCAAGATTC GCGATGGGCG GATCCTGGGC
GTGCATACCA ACCGCGGCGA CATCGAGGCC AAGACCGTGG GCTGCGTCAC GGCAGGCAAC
TCCAGCGTGC TGGCCCGCAT GGCCGACCTC AACCTGCCGC TGGAGTCGCA TCCCTTGCAG
GCGCTGGTCT CCGAGCCGCT CAAGCCGGTA CTCGATACCG TGGTGATGTC CAATCACGTG
CACGGTTACA TCAGCCAGTC CGACAAGGGC GACCTGGTCA TCGGTGCCGG CATCGACGGC
TACAACGGCT ACGGCCAGCG GGGCAGCTAT CCCACCGTCG AGCACACCTT GCAGGCCATC
GTCGAGATGT TCCCGATCTT CTCCCGGGTG CGCATGAACC GCCAGTGGGG TGGCATCGTC
GATACCTGTC CGGATGCCTG TCCGATCCTC TCCAAGACCA AGGTCAAGGG GCTCTACTTC
AATTGCGGCT GGGGTACGGG CGGCTTCAAG GCAACGCCGG GCTCGGGGCA TGTCTTCGCG
GCCAGTCTGG CCAAGGGCGA GATGCATCCC ATCGCCGCGC CGTTTTCCAT CGACCGCTTC
CACAGCGGGG CGTTGATCGA CGAGCACGGC GCCGCTGGCG TCGCGCACTA G
 
Protein sequence
MQRYSGFGLV KHALRHHEDW QRQWRNPTPK PGYDVIIVGG GGHGLATAYY LAKEHGITNV 
AVLEKGWLGG GNTARNTTIV RSNYLWDESA ALYEHAMKHW EGLSQELNYN VMFSQRGVLN
LGHTLQDMRD IQRRVNANRL NGIDGEVLDA QGVQSLVPIM DCSKHARYPV MGASWQPRAG
VARHDAVAWG YARAADALGV DLLQNTEVTG FKIRDGRILG VHTNRGDIEA KTVGCVTAGN
SSVLARMADL NLPLESHPLQ ALVSEPLKPV LDTVVMSNHV HGYISQSDKG DLVIGAGIDG
YNGYGQRGSY PTVEHTLQAI VEMFPIFSRV RMNRQWGGIV DTCPDACPIL SKTKVKGLYF
NCGWGTGGFK ATPGSGHVFA ASLAKGEMHP IAAPFSIDRF HSGALIDEHG AAGVAH