Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4618 |
Symbol | |
ID | 5318929 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 1118904 |
End bp | 1120319 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640776417 |
Product | chitin deacetylase |
Protein accession | YP_001313349 |
Protein GI | 150376753 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0726] Predicted xylanase/chitin deacetylase [COG3195] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR03164] OHCU decarboxylase [TIGR03212] putative urate catabolism protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.253154 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGATATC CCCGCGATCT CCTGGGCCAT GGTCCGAACC CGCATATTGC CTGGCCCGAT GGCGCCCGGA TCGCCGTGCA ATTCGTGATC AATTACGAGG AGGGCGGAGA GAATTGCGTG CTTCACGGCG ACGCCGCCTC GGAGGCCTTT CTCTCGGAAA TCGTCGGCGC TCAGGCCTGG CCTGCCCAGC GCCACTGGAA TATGGAATCG ATCTACGAAT ATGGTGCGCG CGCCGGCTTC TGGCGGCTGC ACCGCCTTTT CACCGAGAGA CAAATAGCGG CCACCGTCTA TGGCGTCGCC ACTGCCCTCA AGCGCTCGCC CGCCCAGGTG GCGGCCATGC AGGAAGCCGG CTGGGAGATC GCCTCCCACG GCCTCAAATG GATCGAGCAC AAGGATTTCG ACGCCGAGCG CGAGCGCGCC GAGATCGCCG AGGCGATCCG CCTCCACACG ATCGTAACCG GAAAGGGGCC GACCGGCTGG TACACGGGGC GCTGCTCCGT GAACACGCTC GACCTCGTGA CCGAAGCCGG CGGCTTCGAC TACGTCTCTG ATTCCTACGC CGACGACCTG CCCTACTGGC ATGAGCATGC CGGCCGGCAT CAGCTCGTCA TTCCATACAC CCTCGATGCC AACGACATGC GCTTTGCGAC CCCGCAAGGT TTCAACAGCG GCGATCAGTT CTTCAGCTAT CTGAAGGACA GCTTCGACGT TCTCTATGCC GAGGGTACCG CCGGCGCACC GAAGATGATG AGCATCGGCC TCCATTGCCG TCTAGCCGGC CGACCCGGCC GCGCGGCCGC ACTGGCGCGC TTCCTCGATT ACGTGAAGGG CCACGAGAAA GTCTGGGTCG CACGCCGCAT CGACATTGCC CGCTACTGGG CAGAGGCCTA TCCGTTCCGG CCGAACGAAA ACCGACCGTC GCGGCTCTCG AAGGATGACT TCATTTCTCG ATTCGGTGGA GTTTTCGAGC ACTCGGACTG GATCGCCAGA CGCGCCTTTG CCGGTGAACT CGCACCGGCT AACGATACGG CATCGGGACT GCATGCGGCC CTTTGTGCGG TCTTCCGTGA AGCGAGCGAA GAGGAACGGC TGGCGGTCCT GAACGCCCAC CCGGATCTTG CCGGCAAGCT CGCCCAGGCG AAGCGGCTGA CAGAGAGCTC GACTTCGGAG CAGGCCTCCG CGGGGCTCGA CGCGCTGACC GACGAGGAGC GCGAGCGTTT CACTGCGCTC AACGATGCCT ATGTCGAGAA ATTCGGCTTT CCTTTCATCA TGGCCGTCAA GGGGCGCAGC AAGGACGAGA TCCTCGCGGC CTTCGAGACC CGCATCGGCA ATGATGGAAA AGCCGAATTC AATACGGCAT GCCTCCAGGT AGAACGGATC GCGTTGTTGC GGCTCCGCGA AATGCTGCCG GAGTGA
|
Protein sequence | MRYPRDLLGH GPNPHIAWPD GARIAVQFVI NYEEGGENCV LHGDAASEAF LSEIVGAQAW PAQRHWNMES IYEYGARAGF WRLHRLFTER QIAATVYGVA TALKRSPAQV AAMQEAGWEI ASHGLKWIEH KDFDAERERA EIAEAIRLHT IVTGKGPTGW YTGRCSVNTL DLVTEAGGFD YVSDSYADDL PYWHEHAGRH QLVIPYTLDA NDMRFATPQG FNSGDQFFSY LKDSFDVLYA EGTAGAPKMM SIGLHCRLAG RPGRAAALAR FLDYVKGHEK VWVARRIDIA RYWAEAYPFR PNENRPSRLS KDDFISRFGG VFEHSDWIAR RAFAGELAPA NDTASGLHAA LCAVFREASE EERLAVLNAH PDLAGKLAQA KRLTESSTSE QASAGLDALT DEERERFTAL NDAYVEKFGF PFIMAVKGRS KDEILAAFET RIGNDGKAEF NTACLQVERI ALLRLREMLP E
|
| |