Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0078 |
Symbol | |
ID | 4027257 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 98181 |
End bp | 99122 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637965229 |
Product | polysaccharide deacetylase |
Protein accession | YP_572141 |
Protein GI | 92112213 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0726] Predicted xylanase/chitin deacetylase |
TIGRFAM ID | [TIGR03212] putative urate catabolism protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCATGG CATCCTCCAC CGATCCGTCC GAGGTTTACC CACGCGATCT CGTCGGCTAT GGCCGCACGC CACCGCAAGC GAACTGGCCG GGCCAGGCCC GCGTCGCCGT GCAGTTCGTC CTCAACTACG AAGAGGGTGG CGAGAACAGC GTGCTGCACG GCGATAGCCA CTCCGAACAG TTTCTCTCCG AGATCGCCGG GGCCGAGGCC TATCCCGACC GCCACCTGAG CATGGAGTCG ATCTACGAAT ACGGTTCGCG GGCGGGGGTG TGGCGCGTGC TGCGCGAGTT CGAGCGCCGC GGGTTGCCGT TGACGGTATT CGGCGTTGCC ATGGCGCTGG AGCGTCATCC CGAGGTGGCC CAGGCCTTCC AGGAACTGGG CCACGAGATC GCCTGCCATG GCTGGCGCTG GATTCACTAC CAGAACGTGC CGGAAGCACT CGAACGCGAC CACATGCAGC GTGCCATCGA GGTGTTCCGC CGCTTGTACG GCGAGGCGCC CCTGGGGTGG TACACCGGTC GCGACAGCCC CAATACACGG CGGCTGTTGC TCGATCAGGG CGGCTTCCTC TACGACAGCG ACTACTACGG TGACGATCTG CCGTTCTGGA GCGACGTCCA GGACAGTCAG GGCCAGACGC ACCGCCACCT GATCGTGCCC TACACGCTGG ATACCAACGA CATGCGCTTC GCCTCGCCCA CCGGCTTCGA CCACGGCGAG CCCTTCTTCC AGTACCTGCG CGATGCCTTC GACGTGCTCT ACGCAGAAGG CGCGGAAACA CCGAAGATGC TCTCCATTGG TCTGCATTGC CGACTGATCG GCCGCCCCGG ACGCTTCCGC GCGCTGCAGC GCTTCCTCGA TCACCTCGAG ACCCATGACC GGGTGTGGAT AACCCGGCGT GTGGATATCG CGCGTCACTG GGCGGCAACG CACCCGGCTT GA
|
Protein sequence | MGMASSTDPS EVYPRDLVGY GRTPPQANWP GQARVAVQFV LNYEEGGENS VLHGDSHSEQ FLSEIAGAEA YPDRHLSMES IYEYGSRAGV WRVLREFERR GLPLTVFGVA MALERHPEVA QAFQELGHEI ACHGWRWIHY QNVPEALERD HMQRAIEVFR RLYGEAPLGW YTGRDSPNTR RLLLDQGGFL YDSDYYGDDL PFWSDVQDSQ GQTHRHLIVP YTLDTNDMRF ASPTGFDHGE PFFQYLRDAF DVLYAEGAET PKMLSIGLHC RLIGRPGRFR ALQRFLDHLE THDRVWITRR VDIARHWAAT HPA
|
| |