Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0218 |
Symbol | mdoD |
ID | 4027301 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 243982 |
End bp | 245634 |
Gene Length | 1653 bp |
Protein Length | 550 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637965369 |
Product | glucan biosynthesis protein D |
Protein accession | YP_572281 |
Protein GI | 92112353 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3131] Periplasmic glucans biosynthesis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.451813 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACCGCA GAACCCTGCT CAAGCGCTCA CTGGCCCTGG CGGCCTTTTA TGGACTGCCC GCCGCGCCCT TGCTGGCGGC GACGCGTCAG ATGCCCGCCA TCGCCGACGG CGAGGCGCAA CGTTTCAGTT TCGACTGGTT GCAGCGTCTG GCCCGGGAAA TGGCCAATGC GCCGTATCGC AACAACATGC GCGAGCTGCC CGAGACCCTG GCGTCTCTCA CGCCCCAGCA GTACAACGCC ATCGGCTACG ATGCCGGGCA CTCGCTGTGG CATGACCTGG ATGGCAAGCT CGACGTGCAG TTCTTCCACG TGGGCATGGG CTTCAACCAA CCCGTGCGCA TGTACGCCCT CGACCCGGAC TCGCACCAGG CGCGCGAGAT TCACTTCGAC CCCGACATCT TTCGCTACGA CGGTGCGAAC GTCGATGTCG CCCAGCTCGA GGGCGAGGAC ACCTTGGGCT TCGCCGGTTT CCGGGTCTTC AAGGTACCCT CGCTGACCGA GCGCGATATC GTCTCCTTTC TCGGCGCGAG TTACTTCCGT GCGGTGGACG ATACCTATCA GTACGGCATC TCGGCACGCG GTCTGGCGGT CAATACCTTC GCCGAGTCGG AAGACGAGGA GTTCCCCACC TATACGCGCT TCTGGCTGGA GACCCCCGCC CCGGACAGCA CCACCTTCAC CGCCTATGCC CTGCTCGATT CGCCCAAGGT GGCCGGCGCC TACCGCTTCG TGATCGACTG CCAGCCGACG CGGGTGGTGA TGGATATCGA AAAGCACCTG TATCCTCGCG AGGCCATCAA GCAGCTGGGC ATTGCTCCGA TGACCAGCAT GTTCAGCTGT GGCACCCACC AGCGCCGCAT GTGCGACACC ATTCATCCGC AGATTCACGA TTCCGACCGT CTCACGATGT GGCGGGGCAA CGGGGAGTGG GTGTGTCGTC CGCTCAACAA TCCGCCGGTC TTGCAGTACA ACGCCTTCGC CGATGAGTCG CCCAAGGGCT TCGGGTTGCT GCAGACCGAG CGCGATTTCG AGGCCTACGA AGACGTGATC GGCAATTATC ACGAACGCCC CAGCCTGTGG ATCGAGCCGC GCAGCGACTG GGGCAAGGGG GAGATCCAAC TGATGGAGAT CCCCACCACC GGGGAAACCA TGGACAACGT GGTGGCCTTC TGGAAACCCG CCGCGCCCGT CGAGCCCGGC GATTCGTTGA CCTTTGCGTA TCGCCAGTAC TGGAGCGCAC TGCCGCCGGT GCGGCCCGAC CTGGCACGCA TTCGCGAGAC GCGCAGCGGC ATGGGCGCCT TTCCCGAAGG CTGGGCCCCG GGGGAGCACT ATCCCGAGGA GTGGGCGCGG CGCTTCGCCG TGGACTACGT GGGCGGCGAT ATCCTGAACA TTGCCAAGAA CGGTCCTCCG ATCATGGCGC AGTTGGAGAT ATCGGGGGGC AGGACCAGCG ATATCCAGAT CTTTCAGGTC GAGGAGTTCG AAGGCATTCG CGTCATTTTC GACTGGTACC CCACGGATGC CTCGACCGAT CCCATCGACA TGCGCATGGT GCTGGAGGGC GACGGCGACC CCCTGAGCGA AACCTGGCTC TATCAGTACT TTCCGCCACC GCCGGAGCAG CGCGAGCATC CGCCGCATCC GCTCGATGAC TGA
|
Protein sequence | MDRRTLLKRS LALAAFYGLP AAPLLAATRQ MPAIADGEAQ RFSFDWLQRL AREMANAPYR NNMRELPETL ASLTPQQYNA IGYDAGHSLW HDLDGKLDVQ FFHVGMGFNQ PVRMYALDPD SHQAREIHFD PDIFRYDGAN VDVAQLEGED TLGFAGFRVF KVPSLTERDI VSFLGASYFR AVDDTYQYGI SARGLAVNTF AESEDEEFPT YTRFWLETPA PDSTTFTAYA LLDSPKVAGA YRFVIDCQPT RVVMDIEKHL YPREAIKQLG IAPMTSMFSC GTHQRRMCDT IHPQIHDSDR LTMWRGNGEW VCRPLNNPPV LQYNAFADES PKGFGLLQTE RDFEAYEDVI GNYHERPSLW IEPRSDWGKG EIQLMEIPTT GETMDNVVAF WKPAAPVEPG DSLTFAYRQY WSALPPVRPD LARIRETRSG MGAFPEGWAP GEHYPEEWAR RFAVDYVGGD ILNIAKNGPP IMAQLEISGG RTSDIQIFQV EEFEGIRVIF DWYPTDASTD PIDMRMVLEG DGDPLSETWL YQYFPPPPEQ REHPPHPLDD
|
| |