Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0998 |
Symbol | |
ID | 4026221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 1124377 |
End bp | 1125627 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637966175 |
Product | sarcosine oxidase beta subunit family protein |
Protein accession | YP_573054 |
Protein GI | 92113126 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | [TIGR01373] sarcosine oxidase, beta subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACGTT ATTCAGGCTT CGGCCTCGTC AAGCACGCGT TACGCCACCA CGAGGATTGG CAGCGCCAAT GGCGCAATCC CACGCCCAAG CCGGGGTACG ACGTGATCAT CGTCGGCGGC GGCGGGCACG GTCTGGCCAC GGCGTACTAT CTGGCCAAGG AGCACGGCAT CACCAATGTG GCAGTGCTCG AGAAGGGCTG GCTGGGCGGC GGCAATACTG CGCGCAACAC CACCATCGTG CGTTCCAATT ACCTGTGGGA CGAGTCGGCG GCGCTCTACG AGCATGCGAT GAAGCATTGG GAAGGGCTCT CTCAGGAACT CAACTACAAC GTCATGTTCT CCCAGCGTGG CGTGTTGAAC CTGGGCCATA CCCTGCAGGA CATGCGCGAT ATTCAGCGCC GGGTCAACGC CAACCGACTC AATGGTATCG ACGGTGAGGT GCTGGATGCC CAGGGCGTGC AGTCGCTGGT GCCGATCATG GACTGCTCGA AGCATGCACG ATATCCGGTC ATGGGCGCGT CCTGGCAGCC GCGCGCCGGG GTGGCGCGTC ACGATGCCGT GGCCTGGGGC TATGCCCGCG CCGCCGATGC GCTGGGCGTC GACCTGTTGC AGAACACCGA GGTCACCGGC TTCAAGATTC GCGATGGGCG GATCCTGGGC GTGCATACCA ACCGCGGCGA CATCGAGGCC AAGACCGTGG GCTGCGTCAC GGCAGGCAAC TCCAGCGTGC TGGCCCGCAT GGCCGACCTC AACCTGCCGC TGGAGTCGCA TCCCTTGCAG GCGCTGGTCT CCGAGCCGCT CAAGCCGGTA CTCGATACCG TGGTGATGTC CAATCACGTG CACGGTTACA TCAGCCAGTC CGACAAGGGC GACCTGGTCA TCGGTGCCGG CATCGACGGC TACAACGGCT ACGGCCAGCG GGGCAGCTAT CCCACCGTCG AGCACACCTT GCAGGCCATC GTCGAGATGT TCCCGATCTT CTCCCGGGTG CGCATGAACC GCCAGTGGGG TGGCATCGTC GATACCTGTC CGGATGCCTG TCCGATCCTC TCCAAGACCA AGGTCAAGGG GCTCTACTTC AATTGCGGCT GGGGTACGGG CGGCTTCAAG GCAACGCCGG GCTCGGGGCA TGTCTTCGCG GCCAGTCTGG CCAAGGGCGA GATGCATCCC ATCGCCGCGC CGTTTTCCAT CGACCGCTTC CACAGCGGGG CGTTGATCGA CGAGCACGGC GCCGCTGGCG TCGCGCACTA G
|
Protein sequence | MQRYSGFGLV KHALRHHEDW QRQWRNPTPK PGYDVIIVGG GGHGLATAYY LAKEHGITNV AVLEKGWLGG GNTARNTTIV RSNYLWDESA ALYEHAMKHW EGLSQELNYN VMFSQRGVLN LGHTLQDMRD IQRRVNANRL NGIDGEVLDA QGVQSLVPIM DCSKHARYPV MGASWQPRAG VARHDAVAWG YARAADALGV DLLQNTEVTG FKIRDGRILG VHTNRGDIEA KTVGCVTAGN SSVLARMADL NLPLESHPLQ ALVSEPLKPV LDTVVMSNHV HGYISQSDKG DLVIGAGIDG YNGYGQRGSY PTVEHTLQAI VEMFPIFSRV RMNRQWGGIV DTCPDACPIL SKTKVKGLYF NCGWGTGGFK ATPGSGHVFA ASLAKGEMHP IAAPFSIDRF HSGALIDEHG AAGVAH
|
| |