Gene Csal_1000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1000 
Symbol 
ID4026223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1126172 
End bp1129231 
Gene Length3060 bp 
Protein Length1019 aa 
Translation table11 
GC content65% 
IMG OID637966177 
Productsarcosine oxidase alpha subunit family protein 
Protein accessionYP_573056 
Protein GI92113128 
COG category[E] Amino acid transport and metabolism
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0404] Glycine cleavage system T protein (aminomethyltransferase)
[COG0492] Thioredoxin reductase 
TIGRFAM ID[TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.99192 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAGA CAGCATCACA GTCCAACCGC CTGGCGCAAG GCGGCCAGGT CGACCGCACG 
CAGACGTTGA CCTTTACCTT CAATGGCCGC ACATACACCG GCCATCCCGG CGATACTCTG
GCCTCGGCCT TGCTGGCCAA TGGCGTGGAC CTGGTCAATC GCAGCTTCAA GTACTCGCGC
CCGCGCGGCA CCGTCGCCGC CGGTGCGGAA GAACCCAACG CGGTGGTTCA ATTGGGCGCC
ACGGAGGCGA CCCAGGTGCC CAACGTGCGC GCTACCCAGC AGGCGCTTTA CGACGGCCTG
GTCGCGAGTA GCACCAACGG TTGGCCCGAC GCCCAGCGCG ACATCATGCG CTGGGTCGGC
AAGCTGGGCA GTCCGTTCAT GCCGCCGGGG TTCTATTACA AGACCTTCAT GGCACCGGCC
TCGATGTGGA TGACCTACGA AAAATACATC CGCAAGACGG CCGGTCTGGG GCGTAGTCCG
ACACAGCCCG ATCCGGATAT CTACGACCAC ATGCATCGGC ATTGTGACCT GCTCGTCGTC
GGCGCCGGCC CTGCAGGCTT GAGCGCGGCG CTGGCGGCGG CGCGTGCCGG GGCCCGGGTA
ATCCTCGCTG ACGAACAAGA GACGATGGGT GGCTCGCTGC TCGATACGCG CGAGACGCTG
GACGATCAAC CCGCCGCTCA ATGGCTGAAG ACGACATTGG CGGCGCTGGC GGAACACGAC
AACGTCACCT TGCTGCCGCG TACCACCGCG CATGGGTATC ACGATCATCA TTTCGTGACG
CTTCACGAAC GCCGCACCGA GCACCTGGGC GATAGTGCGC CGTCTCAGGA TGGTCATCGC
CAGGCGCGCT CGCGGCTGCA CCGGGTACGT GCCGGCCAGG TAATCCTGGC GACCGGCGCA
CATGAGCGCC CGCTGGTGTA TGCCAACAAC GATGTCCCCG GCAACCTGCT GGCTGGAGCG
GTGTCGACCT ACATTCGCCG CTACGGCGTG GTGCCCGGCC AGAAGCTGGT GCTCTCCACC
AGCAACGATC ACGCGTACCG CGCCGCATTG GATTGGCAGG AGGCCGGGCG CGAGGTGGCG
GCTATCGTCG ATGCCCGCAC CGCGCCGGAG GGCGATCTCG TAACGCAGGC CAAGGCGGCG
GGCATCCGCG TCATCACCGG CAGTGCGGTG ATCGAGGCCA AAGGGGCGCA GCGCGTCTCA
GCGGCACGCG TGGCGGCCAT CGACGTGGAC ACCTTCACGG TCACCGGAAA AGTCGAGACG
CTCGAGTGCG ACACCATCGC CAGCTCCGGC GGCTACAGTC CGGTGATCCA CTTGGCCTCG
CATACCGGCG CGCGTCCGGT ATGGAGCGAT GACATCATCG GTTTCGTACC GACTACCGTG
GCGGGCGTCA CGGCGACGGG CGGTGCCCAT GGCGTCTATC CGCTGGGCGA GGTGCTGGCG
GACGGCGTCG AGGCCGGCGT TAACGCAGCG ACGGCACTGG GCTTCGCTGC CACCGTGAAT
GACGCGGACC TGCCACATGT CGAGCAGCGC CATGAAGGCC CGGCCTGCGC GCTATTCCAG
GTGCCGCATG AGAAAACCAC GCTGCGTGCG CCCAAGCAGT TCGTCGATCT GCAGAATGAC
GTCACCGCCG GCGCCATCGA GGTGGCAACG CTCGAAGGGT TCGAATCCAT CGAGCACGTC
AAGCGCTACA CCGCCATGGG CTTCGGTACC GACCAGGGCA AGCTGGGTAA TATCAATGGT
ATGGCGGTCG CCGCACGCTG CCTGGGGCAG AGCATTCCCG AAACCGGCAC TACGGTATTC
AGGCCCAACT ACACACCAGT GACCTTCGGT GCCGTGGTCG GACGCCACTG CCGCGAGCTC
TTCGATCCCG AGCGCTACAC GCCGATGCAG GCCTGGCACG TTGCTCAGGG CGCCGAATTC
GAAGACGTCG GGCAGTGGAA GCGTCCTTGG TACTATCCTC GCAAGCGCGC CGATGGCGGA
CTCGAAACCA TGGCGGAAGC GGTGGCCCGT GAGTGCCGCG CGGTTCGCGA GGGTGTGGGC
ATTCTCGACG CCTCGACGCT GGGCAAGATC GATATCCAGG GGCCGGATGC TCGCGAGTTC
CTGGGTCGTA TCTACACCAA TAAGTGGCAG AAGCTGGCGC CTGGTCGCGT GCGCTACGGA
CTGATGTGCG GCGACGACGG CATGGTGATG GACGACGGCA CCACCAGCTG CCTGGCCGAG
AATCACTTCC TCATGACCAC CACCACCGGC AACGCCGCGC CGGTGCTGGA ATGGCTCGAA
CTCTGGCACC AGACCGAATG GCCAGAACTC GAGGTGTATT TCAACTCGGT CACCGACCAC
TGGGCGACGA TGACCGTGAC CGGTCCCGAG GCGCGCAAGC TGCTCACCGA CTTGACCGAT
ATCGACCTCG ACCGCGAGGC GTTCAAGTTC ATGGATTGGC GCGAGGGGCA TGTGGCGGGT
GTACCCGCGC GGGTGTTCCG TATTTCCTTC ACCGGGGAGC TGGCCTTCGA GATCAACGTC
CAGGCGCACT ATGCCATGCA CGTCTGGGAA GCGTTATTCG CGCACGGCGA CAAGTACAAC
CTGACGCCTT ACGGCACTGA GACGATGCAT GTGTTGCGTG CCGAGAAGGG CTTCATCATC
GTCGGCCAGG ATACCGATGG GTCGGTGACG CCCGAGGATC TCGGCATGCA CTGGGCCATC
GGCTATGACA AGCCGTTCCC GTGGGTCGGC AAGCGGGCGC TGACGCGCTC CGATACGCGT
CGCGAAGGGC GCAAGCAGCT CGTCGGCCTC AAACCCAAGG ACGCGAGCGT GGTGCTGGAG
GAAGGCGCGC CGGTCGTCTT CGATCCGAAA CACGCCATCC CCATGCCCAT GGCTGGCCAC
GTGACCTCGA GCTACTACAG CCCGACGCTG GAAAGCGGCT TTGCCCTGGC GGTGGTCAAG
GGTGGCCATC AGCGCATGGG GGAGAGCGTC TATCTGCCGA TGGCCGATGG GCGTGTGCAT
GAAGCCGAAA TCGTCGGTAC CCAATTCGTC GATCCCAAGG GAGAGCGCCA GCATGTCTAA
 
Protein sequence
MSQTASQSNR LAQGGQVDRT QTLTFTFNGR TYTGHPGDTL ASALLANGVD LVNRSFKYSR 
PRGTVAAGAE EPNAVVQLGA TEATQVPNVR ATQQALYDGL VASSTNGWPD AQRDIMRWVG
KLGSPFMPPG FYYKTFMAPA SMWMTYEKYI RKTAGLGRSP TQPDPDIYDH MHRHCDLLVV
GAGPAGLSAA LAAARAGARV ILADEQETMG GSLLDTRETL DDQPAAQWLK TTLAALAEHD
NVTLLPRTTA HGYHDHHFVT LHERRTEHLG DSAPSQDGHR QARSRLHRVR AGQVILATGA
HERPLVYANN DVPGNLLAGA VSTYIRRYGV VPGQKLVLST SNDHAYRAAL DWQEAGREVA
AIVDARTAPE GDLVTQAKAA GIRVITGSAV IEAKGAQRVS AARVAAIDVD TFTVTGKVET
LECDTIASSG GYSPVIHLAS HTGARPVWSD DIIGFVPTTV AGVTATGGAH GVYPLGEVLA
DGVEAGVNAA TALGFAATVN DADLPHVEQR HEGPACALFQ VPHEKTTLRA PKQFVDLQND
VTAGAIEVAT LEGFESIEHV KRYTAMGFGT DQGKLGNING MAVAARCLGQ SIPETGTTVF
RPNYTPVTFG AVVGRHCREL FDPERYTPMQ AWHVAQGAEF EDVGQWKRPW YYPRKRADGG
LETMAEAVAR ECRAVREGVG ILDASTLGKI DIQGPDAREF LGRIYTNKWQ KLAPGRVRYG
LMCGDDGMVM DDGTTSCLAE NHFLMTTTTG NAAPVLEWLE LWHQTEWPEL EVYFNSVTDH
WATMTVTGPE ARKLLTDLTD IDLDREAFKF MDWREGHVAG VPARVFRISF TGELAFEINV
QAHYAMHVWE ALFAHGDKYN LTPYGTETMH VLRAEKGFII VGQDTDGSVT PEDLGMHWAI
GYDKPFPWVG KRALTRSDTR REGRKQLVGL KPKDASVVLE EGAPVVFDPK HAIPMPMAGH
VTSSYYSPTL ESGFALAVVK GGHQRMGESV YLPMADGRVH EAEIVGTQFV DPKGERQHV