Gene Csal_0218 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0218 
SymbolmdoD 
ID4027301 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp243982 
End bp245634 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content64% 
IMG OID637965369 
Productglucan biosynthesis protein D 
Protein accessionYP_572281 
Protein GI92112353 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3131] Periplasmic glucans biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.451813 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACCGCA GAACCCTGCT CAAGCGCTCA CTGGCCCTGG CGGCCTTTTA TGGACTGCCC 
GCCGCGCCCT TGCTGGCGGC GACGCGTCAG ATGCCCGCCA TCGCCGACGG CGAGGCGCAA
CGTTTCAGTT TCGACTGGTT GCAGCGTCTG GCCCGGGAAA TGGCCAATGC GCCGTATCGC
AACAACATGC GCGAGCTGCC CGAGACCCTG GCGTCTCTCA CGCCCCAGCA GTACAACGCC
ATCGGCTACG ATGCCGGGCA CTCGCTGTGG CATGACCTGG ATGGCAAGCT CGACGTGCAG
TTCTTCCACG TGGGCATGGG CTTCAACCAA CCCGTGCGCA TGTACGCCCT CGACCCGGAC
TCGCACCAGG CGCGCGAGAT TCACTTCGAC CCCGACATCT TTCGCTACGA CGGTGCGAAC
GTCGATGTCG CCCAGCTCGA GGGCGAGGAC ACCTTGGGCT TCGCCGGTTT CCGGGTCTTC
AAGGTACCCT CGCTGACCGA GCGCGATATC GTCTCCTTTC TCGGCGCGAG TTACTTCCGT
GCGGTGGACG ATACCTATCA GTACGGCATC TCGGCACGCG GTCTGGCGGT CAATACCTTC
GCCGAGTCGG AAGACGAGGA GTTCCCCACC TATACGCGCT TCTGGCTGGA GACCCCCGCC
CCGGACAGCA CCACCTTCAC CGCCTATGCC CTGCTCGATT CGCCCAAGGT GGCCGGCGCC
TACCGCTTCG TGATCGACTG CCAGCCGACG CGGGTGGTGA TGGATATCGA AAAGCACCTG
TATCCTCGCG AGGCCATCAA GCAGCTGGGC ATTGCTCCGA TGACCAGCAT GTTCAGCTGT
GGCACCCACC AGCGCCGCAT GTGCGACACC ATTCATCCGC AGATTCACGA TTCCGACCGT
CTCACGATGT GGCGGGGCAA CGGGGAGTGG GTGTGTCGTC CGCTCAACAA TCCGCCGGTC
TTGCAGTACA ACGCCTTCGC CGATGAGTCG CCCAAGGGCT TCGGGTTGCT GCAGACCGAG
CGCGATTTCG AGGCCTACGA AGACGTGATC GGCAATTATC ACGAACGCCC CAGCCTGTGG
ATCGAGCCGC GCAGCGACTG GGGCAAGGGG GAGATCCAAC TGATGGAGAT CCCCACCACC
GGGGAAACCA TGGACAACGT GGTGGCCTTC TGGAAACCCG CCGCGCCCGT CGAGCCCGGC
GATTCGTTGA CCTTTGCGTA TCGCCAGTAC TGGAGCGCAC TGCCGCCGGT GCGGCCCGAC
CTGGCACGCA TTCGCGAGAC GCGCAGCGGC ATGGGCGCCT TTCCCGAAGG CTGGGCCCCG
GGGGAGCACT ATCCCGAGGA GTGGGCGCGG CGCTTCGCCG TGGACTACGT GGGCGGCGAT
ATCCTGAACA TTGCCAAGAA CGGTCCTCCG ATCATGGCGC AGTTGGAGAT ATCGGGGGGC
AGGACCAGCG ATATCCAGAT CTTTCAGGTC GAGGAGTTCG AAGGCATTCG CGTCATTTTC
GACTGGTACC CCACGGATGC CTCGACCGAT CCCATCGACA TGCGCATGGT GCTGGAGGGC
GACGGCGACC CCCTGAGCGA AACCTGGCTC TATCAGTACT TTCCGCCACC GCCGGAGCAG
CGCGAGCATC CGCCGCATCC GCTCGATGAC TGA
 
Protein sequence
MDRRTLLKRS LALAAFYGLP AAPLLAATRQ MPAIADGEAQ RFSFDWLQRL AREMANAPYR 
NNMRELPETL ASLTPQQYNA IGYDAGHSLW HDLDGKLDVQ FFHVGMGFNQ PVRMYALDPD
SHQAREIHFD PDIFRYDGAN VDVAQLEGED TLGFAGFRVF KVPSLTERDI VSFLGASYFR
AVDDTYQYGI SARGLAVNTF AESEDEEFPT YTRFWLETPA PDSTTFTAYA LLDSPKVAGA
YRFVIDCQPT RVVMDIEKHL YPREAIKQLG IAPMTSMFSC GTHQRRMCDT IHPQIHDSDR
LTMWRGNGEW VCRPLNNPPV LQYNAFADES PKGFGLLQTE RDFEAYEDVI GNYHERPSLW
IEPRSDWGKG EIQLMEIPTT GETMDNVVAF WKPAAPVEPG DSLTFAYRQY WSALPPVRPD
LARIRETRSG MGAFPEGWAP GEHYPEEWAR RFAVDYVGGD ILNIAKNGPP IMAQLEISGG
RTSDIQIFQV EEFEGIRVIF DWYPTDASTD PIDMRMVLEG DGDPLSETWL YQYFPPPPEQ
REHPPHPLDD