Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_1000 |
Symbol | |
ID | 4026223 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 1126172 |
End bp | 1129231 |
Gene Length | 3060 bp |
Protein Length | 1019 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637966177 |
Product | sarcosine oxidase alpha subunit family protein |
Protein accession | YP_573056 |
Protein GI | 92113128 |
COG category | [E] Amino acid transport and metabolism [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) [COG0492] Thioredoxin reductase |
TIGRFAM ID | [TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.99192 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCAGA CAGCATCACA GTCCAACCGC CTGGCGCAAG GCGGCCAGGT CGACCGCACG CAGACGTTGA CCTTTACCTT CAATGGCCGC ACATACACCG GCCATCCCGG CGATACTCTG GCCTCGGCCT TGCTGGCCAA TGGCGTGGAC CTGGTCAATC GCAGCTTCAA GTACTCGCGC CCGCGCGGCA CCGTCGCCGC CGGTGCGGAA GAACCCAACG CGGTGGTTCA ATTGGGCGCC ACGGAGGCGA CCCAGGTGCC CAACGTGCGC GCTACCCAGC AGGCGCTTTA CGACGGCCTG GTCGCGAGTA GCACCAACGG TTGGCCCGAC GCCCAGCGCG ACATCATGCG CTGGGTCGGC AAGCTGGGCA GTCCGTTCAT GCCGCCGGGG TTCTATTACA AGACCTTCAT GGCACCGGCC TCGATGTGGA TGACCTACGA AAAATACATC CGCAAGACGG CCGGTCTGGG GCGTAGTCCG ACACAGCCCG ATCCGGATAT CTACGACCAC ATGCATCGGC ATTGTGACCT GCTCGTCGTC GGCGCCGGCC CTGCAGGCTT GAGCGCGGCG CTGGCGGCGG CGCGTGCCGG GGCCCGGGTA ATCCTCGCTG ACGAACAAGA GACGATGGGT GGCTCGCTGC TCGATACGCG CGAGACGCTG GACGATCAAC CCGCCGCTCA ATGGCTGAAG ACGACATTGG CGGCGCTGGC GGAACACGAC AACGTCACCT TGCTGCCGCG TACCACCGCG CATGGGTATC ACGATCATCA TTTCGTGACG CTTCACGAAC GCCGCACCGA GCACCTGGGC GATAGTGCGC CGTCTCAGGA TGGTCATCGC CAGGCGCGCT CGCGGCTGCA CCGGGTACGT GCCGGCCAGG TAATCCTGGC GACCGGCGCA CATGAGCGCC CGCTGGTGTA TGCCAACAAC GATGTCCCCG GCAACCTGCT GGCTGGAGCG GTGTCGACCT ACATTCGCCG CTACGGCGTG GTGCCCGGCC AGAAGCTGGT GCTCTCCACC AGCAACGATC ACGCGTACCG CGCCGCATTG GATTGGCAGG AGGCCGGGCG CGAGGTGGCG GCTATCGTCG ATGCCCGCAC CGCGCCGGAG GGCGATCTCG TAACGCAGGC CAAGGCGGCG GGCATCCGCG TCATCACCGG CAGTGCGGTG ATCGAGGCCA AAGGGGCGCA GCGCGTCTCA GCGGCACGCG TGGCGGCCAT CGACGTGGAC ACCTTCACGG TCACCGGAAA AGTCGAGACG CTCGAGTGCG ACACCATCGC CAGCTCCGGC GGCTACAGTC CGGTGATCCA CTTGGCCTCG CATACCGGCG CGCGTCCGGT ATGGAGCGAT GACATCATCG GTTTCGTACC GACTACCGTG GCGGGCGTCA CGGCGACGGG CGGTGCCCAT GGCGTCTATC CGCTGGGCGA GGTGCTGGCG GACGGCGTCG AGGCCGGCGT TAACGCAGCG ACGGCACTGG GCTTCGCTGC CACCGTGAAT GACGCGGACC TGCCACATGT CGAGCAGCGC CATGAAGGCC CGGCCTGCGC GCTATTCCAG GTGCCGCATG AGAAAACCAC GCTGCGTGCG CCCAAGCAGT TCGTCGATCT GCAGAATGAC GTCACCGCCG GCGCCATCGA GGTGGCAACG CTCGAAGGGT TCGAATCCAT CGAGCACGTC AAGCGCTACA CCGCCATGGG CTTCGGTACC GACCAGGGCA AGCTGGGTAA TATCAATGGT ATGGCGGTCG CCGCACGCTG CCTGGGGCAG AGCATTCCCG AAACCGGCAC TACGGTATTC AGGCCCAACT ACACACCAGT GACCTTCGGT GCCGTGGTCG GACGCCACTG CCGCGAGCTC TTCGATCCCG AGCGCTACAC GCCGATGCAG GCCTGGCACG TTGCTCAGGG CGCCGAATTC GAAGACGTCG GGCAGTGGAA GCGTCCTTGG TACTATCCTC GCAAGCGCGC CGATGGCGGA CTCGAAACCA TGGCGGAAGC GGTGGCCCGT GAGTGCCGCG CGGTTCGCGA GGGTGTGGGC ATTCTCGACG CCTCGACGCT GGGCAAGATC GATATCCAGG GGCCGGATGC TCGCGAGTTC CTGGGTCGTA TCTACACCAA TAAGTGGCAG AAGCTGGCGC CTGGTCGCGT GCGCTACGGA CTGATGTGCG GCGACGACGG CATGGTGATG GACGACGGCA CCACCAGCTG CCTGGCCGAG AATCACTTCC TCATGACCAC CACCACCGGC AACGCCGCGC CGGTGCTGGA ATGGCTCGAA CTCTGGCACC AGACCGAATG GCCAGAACTC GAGGTGTATT TCAACTCGGT CACCGACCAC TGGGCGACGA TGACCGTGAC CGGTCCCGAG GCGCGCAAGC TGCTCACCGA CTTGACCGAT ATCGACCTCG ACCGCGAGGC GTTCAAGTTC ATGGATTGGC GCGAGGGGCA TGTGGCGGGT GTACCCGCGC GGGTGTTCCG TATTTCCTTC ACCGGGGAGC TGGCCTTCGA GATCAACGTC CAGGCGCACT ATGCCATGCA CGTCTGGGAA GCGTTATTCG CGCACGGCGA CAAGTACAAC CTGACGCCTT ACGGCACTGA GACGATGCAT GTGTTGCGTG CCGAGAAGGG CTTCATCATC GTCGGCCAGG ATACCGATGG GTCGGTGACG CCCGAGGATC TCGGCATGCA CTGGGCCATC GGCTATGACA AGCCGTTCCC GTGGGTCGGC AAGCGGGCGC TGACGCGCTC CGATACGCGT CGCGAAGGGC GCAAGCAGCT CGTCGGCCTC AAACCCAAGG ACGCGAGCGT GGTGCTGGAG GAAGGCGCGC CGGTCGTCTT CGATCCGAAA CACGCCATCC CCATGCCCAT GGCTGGCCAC GTGACCTCGA GCTACTACAG CCCGACGCTG GAAAGCGGCT TTGCCCTGGC GGTGGTCAAG GGTGGCCATC AGCGCATGGG GGAGAGCGTC TATCTGCCGA TGGCCGATGG GCGTGTGCAT GAAGCCGAAA TCGTCGGTAC CCAATTCGTC GATCCCAAGG GAGAGCGCCA GCATGTCTAA
|
Protein sequence | MSQTASQSNR LAQGGQVDRT QTLTFTFNGR TYTGHPGDTL ASALLANGVD LVNRSFKYSR PRGTVAAGAE EPNAVVQLGA TEATQVPNVR ATQQALYDGL VASSTNGWPD AQRDIMRWVG KLGSPFMPPG FYYKTFMAPA SMWMTYEKYI RKTAGLGRSP TQPDPDIYDH MHRHCDLLVV GAGPAGLSAA LAAARAGARV ILADEQETMG GSLLDTRETL DDQPAAQWLK TTLAALAEHD NVTLLPRTTA HGYHDHHFVT LHERRTEHLG DSAPSQDGHR QARSRLHRVR AGQVILATGA HERPLVYANN DVPGNLLAGA VSTYIRRYGV VPGQKLVLST SNDHAYRAAL DWQEAGREVA AIVDARTAPE GDLVTQAKAA GIRVITGSAV IEAKGAQRVS AARVAAIDVD TFTVTGKVET LECDTIASSG GYSPVIHLAS HTGARPVWSD DIIGFVPTTV AGVTATGGAH GVYPLGEVLA DGVEAGVNAA TALGFAATVN DADLPHVEQR HEGPACALFQ VPHEKTTLRA PKQFVDLQND VTAGAIEVAT LEGFESIEHV KRYTAMGFGT DQGKLGNING MAVAARCLGQ SIPETGTTVF RPNYTPVTFG AVVGRHCREL FDPERYTPMQ AWHVAQGAEF EDVGQWKRPW YYPRKRADGG LETMAEAVAR ECRAVREGVG ILDASTLGKI DIQGPDAREF LGRIYTNKWQ KLAPGRVRYG LMCGDDGMVM DDGTTSCLAE NHFLMTTTTG NAAPVLEWLE LWHQTEWPEL EVYFNSVTDH WATMTVTGPE ARKLLTDLTD IDLDREAFKF MDWREGHVAG VPARVFRISF TGELAFEINV QAHYAMHVWE ALFAHGDKYN LTPYGTETMH VLRAEKGFII VGQDTDGSVT PEDLGMHWAI GYDKPFPWVG KRALTRSDTR REGRKQLVGL KPKDASVVLE EGAPVVFDPK HAIPMPMAGH VTSSYYSPTL ESGFALAVVK GGHQRMGESV YLPMADGRVH EAEIVGTQFV DPKGERQHV
|
| |