Gene Csal_1145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1145 
Symbol 
ID4027710 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1309547 
End bp1311349 
Gene Length1803 bp 
Protein Length600 aa 
Translation table11 
GC content66% 
IMG OID637966322 
Productdihydroxy-acid dehydratase 
Protein accessionYP_573200 
Protein GI92113272 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGATC ACAAGATCAC CCCCGACCAG CTGCGCTCGC GCTGGTGGTT CGACAACCCC 
GAGCATCCGG GAACCACCGC GCTGTGCATC GAACGCTACA TGAACTACGG CCTGACCTTG
GAAGAGCTCA AGAGCGGCAA GCCGATCATC GGCATTGCCC AGTCGGGCTC CGACTTGACG
CCTTGCAACC GTCATCACAT CGAGCTGGTG AAGCGCGTCA AGGACGGTAT CCGTGCGGCG
GGGGGCGTGC CGTTCGAGTT TCCGCTGCAC CCGATTCACG AGAACGTGCG TCGACCCACC
GCGGCGCTGG ATCGCAACCT GGCGTACCTG GGGCTGGTGG AAGTGCTGCA TGGCTACCCG
CTCGATGGCG TGGTGTTGAC CACCGGCTGC GACAAGACCA CGCCGGCGTG CTTGATGGCC
GCCGCGACCG TCAACATCCC GGCGATCGTG CTCTCCGGTG GACCGATGCT CAACGGCTGG
CGCGGCAGTC AGGAACGCGT GGGCTCGGGC ACCATCATCT GGGAGCTGCG CAAGCGTCTC
GCGGCGGGCG ATATCGACTA CGAGGAATTC CTGTCCCGGG CCACCGACTC GGCGCCTTCC
ATCGGTCACT GCAATACCAT GGGCACCGCG TCGACGATGA ACTCCATGGC CGAGGCGCTG
GGCATGAGCC TGCCCGGCTC GGCGATGATC CCGGCGCCTT ACAAGGAGCG CTCGATGGTC
GCCTATCAGA CCGGGACGCG CATCGTCGAC ATGGTGTGGG AAGATCTGCG CCCGCTGGAC
ATCCTCACGC GTGAGGCGTG CGAGAACGCC ATCGTGGTGT GTTCGGCGCT GGGCGGTTCC
TCCAACGCGC CGGTGCACAT CAACGCCATC GCCCGTCATG CCGGCGTCGA GTTGACCAAC
GACGACTGGC AACGGCTGGG CCATGACGTG CCGCTGCTGG CCAATGTCAT GCCCGCGGGT
GCGTATCTGT CGGAGGAATT CCACCGTGCC GGTGGCGTTC CCGCGGTACT CAGCGAGCTG
CTCAAGGCGG GGCGCGTGCA TGGCGACGTG CCGACGGTCA ACGGCAGGAC GCTGGCGGAG
AACGTCGCCG GGCGTGAAAC CCTGGACGAC GACATCATCC GCCGCTACGA CAATCCTCTG
GTCGAGGCTG CCGGCTTCAT CAACCTCAAG GGCAACCTGT TCGACTCGGC GTTGATGAAG
ACCAGCGTGA TCGCCAGCGA CTTCCGGCAG CGTTTCCTGA GCCGTGCCGA GGATCCCGAT
GCCTTCGAAG GGCGGGTCGT GGTGTTCGAC GGCTCCGAGG ACTACCACGC ACGGATCGAC
GATCCGGCCC TCGAGGTCGA CGAGAACACC ATTCTGGTGA TGCGCGGGGC CGGCCCCGTG
GGGCACCCCG GCAGCGCCGA GGTCGTCAAC ATGCAGCCGC CGGAAGCCTT GATCAAGCAG
GGGATCGAAT CGCTGCCGTG TCTCGGCGAC GGGCGCCAGT CGGGTACCTC GGGCTCGCCG
TCGATTCTCA ACGCCGCGCC GGAAGCTGCC GTGGGCGGTG GCCTGGCGTT GCTGGAAACG
GGCGATCGGG TGCGTATCGA TCTCAAGCGC GGGGAGGCCA ACCTGCTCGT CGACGACGCC
GAATTGTCCC GTCGTCGCCA GGCGCTGGCA TCGCGCGGCG GGTACATTTA TCCGGTGGAT
CAAACGCCCT GGCAGGAAAT CCAGCGCGGC ATGGTCGAGC AGCTCGATCG CGGCATGACC
CTGGGCCCGG CGACCAAGTA TCGGGATGTC GCACGTCAGA GTCCGCCCCG GGACAATCAC
TGA
 
Protein sequence
MNDHKITPDQ LRSRWWFDNP EHPGTTALCI ERYMNYGLTL EELKSGKPII GIAQSGSDLT 
PCNRHHIELV KRVKDGIRAA GGVPFEFPLH PIHENVRRPT AALDRNLAYL GLVEVLHGYP
LDGVVLTTGC DKTTPACLMA AATVNIPAIV LSGGPMLNGW RGSQERVGSG TIIWELRKRL
AAGDIDYEEF LSRATDSAPS IGHCNTMGTA STMNSMAEAL GMSLPGSAMI PAPYKERSMV
AYQTGTRIVD MVWEDLRPLD ILTREACENA IVVCSALGGS SNAPVHINAI ARHAGVELTN
DDWQRLGHDV PLLANVMPAG AYLSEEFHRA GGVPAVLSEL LKAGRVHGDV PTVNGRTLAE
NVAGRETLDD DIIRRYDNPL VEAAGFINLK GNLFDSALMK TSVIASDFRQ RFLSRAEDPD
AFEGRVVVFD GSEDYHARID DPALEVDENT ILVMRGAGPV GHPGSAEVVN MQPPEALIKQ
GIESLPCLGD GRQSGTSGSP SILNAAPEAA VGGGLALLET GDRVRIDLKR GEANLLVDDA
ELSRRRQALA SRGGYIYPVD QTPWQEIQRG MVEQLDRGMT LGPATKYRDV ARQSPPRDNH