Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_1145 |
Symbol | |
ID | 4027710 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 1309547 |
End bp | 1311349 |
Gene Length | 1803 bp |
Protein Length | 600 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637966322 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_573200 |
Protein GI | 92113272 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGATC ACAAGATCAC CCCCGACCAG CTGCGCTCGC GCTGGTGGTT CGACAACCCC GAGCATCCGG GAACCACCGC GCTGTGCATC GAACGCTACA TGAACTACGG CCTGACCTTG GAAGAGCTCA AGAGCGGCAA GCCGATCATC GGCATTGCCC AGTCGGGCTC CGACTTGACG CCTTGCAACC GTCATCACAT CGAGCTGGTG AAGCGCGTCA AGGACGGTAT CCGTGCGGCG GGGGGCGTGC CGTTCGAGTT TCCGCTGCAC CCGATTCACG AGAACGTGCG TCGACCCACC GCGGCGCTGG ATCGCAACCT GGCGTACCTG GGGCTGGTGG AAGTGCTGCA TGGCTACCCG CTCGATGGCG TGGTGTTGAC CACCGGCTGC GACAAGACCA CGCCGGCGTG CTTGATGGCC GCCGCGACCG TCAACATCCC GGCGATCGTG CTCTCCGGTG GACCGATGCT CAACGGCTGG CGCGGCAGTC AGGAACGCGT GGGCTCGGGC ACCATCATCT GGGAGCTGCG CAAGCGTCTC GCGGCGGGCG ATATCGACTA CGAGGAATTC CTGTCCCGGG CCACCGACTC GGCGCCTTCC ATCGGTCACT GCAATACCAT GGGCACCGCG TCGACGATGA ACTCCATGGC CGAGGCGCTG GGCATGAGCC TGCCCGGCTC GGCGATGATC CCGGCGCCTT ACAAGGAGCG CTCGATGGTC GCCTATCAGA CCGGGACGCG CATCGTCGAC ATGGTGTGGG AAGATCTGCG CCCGCTGGAC ATCCTCACGC GTGAGGCGTG CGAGAACGCC ATCGTGGTGT GTTCGGCGCT GGGCGGTTCC TCCAACGCGC CGGTGCACAT CAACGCCATC GCCCGTCATG CCGGCGTCGA GTTGACCAAC GACGACTGGC AACGGCTGGG CCATGACGTG CCGCTGCTGG CCAATGTCAT GCCCGCGGGT GCGTATCTGT CGGAGGAATT CCACCGTGCC GGTGGCGTTC CCGCGGTACT CAGCGAGCTG CTCAAGGCGG GGCGCGTGCA TGGCGACGTG CCGACGGTCA ACGGCAGGAC GCTGGCGGAG AACGTCGCCG GGCGTGAAAC CCTGGACGAC GACATCATCC GCCGCTACGA CAATCCTCTG GTCGAGGCTG CCGGCTTCAT CAACCTCAAG GGCAACCTGT TCGACTCGGC GTTGATGAAG ACCAGCGTGA TCGCCAGCGA CTTCCGGCAG CGTTTCCTGA GCCGTGCCGA GGATCCCGAT GCCTTCGAAG GGCGGGTCGT GGTGTTCGAC GGCTCCGAGG ACTACCACGC ACGGATCGAC GATCCGGCCC TCGAGGTCGA CGAGAACACC ATTCTGGTGA TGCGCGGGGC CGGCCCCGTG GGGCACCCCG GCAGCGCCGA GGTCGTCAAC ATGCAGCCGC CGGAAGCCTT GATCAAGCAG GGGATCGAAT CGCTGCCGTG TCTCGGCGAC GGGCGCCAGT CGGGTACCTC GGGCTCGCCG TCGATTCTCA ACGCCGCGCC GGAAGCTGCC GTGGGCGGTG GCCTGGCGTT GCTGGAAACG GGCGATCGGG TGCGTATCGA TCTCAAGCGC GGGGAGGCCA ACCTGCTCGT CGACGACGCC GAATTGTCCC GTCGTCGCCA GGCGCTGGCA TCGCGCGGCG GGTACATTTA TCCGGTGGAT CAAACGCCCT GGCAGGAAAT CCAGCGCGGC ATGGTCGAGC AGCTCGATCG CGGCATGACC CTGGGCCCGG CGACCAAGTA TCGGGATGTC GCACGTCAGA GTCCGCCCCG GGACAATCAC TGA
|
Protein sequence | MNDHKITPDQ LRSRWWFDNP EHPGTTALCI ERYMNYGLTL EELKSGKPII GIAQSGSDLT PCNRHHIELV KRVKDGIRAA GGVPFEFPLH PIHENVRRPT AALDRNLAYL GLVEVLHGYP LDGVVLTTGC DKTTPACLMA AATVNIPAIV LSGGPMLNGW RGSQERVGSG TIIWELRKRL AAGDIDYEEF LSRATDSAPS IGHCNTMGTA STMNSMAEAL GMSLPGSAMI PAPYKERSMV AYQTGTRIVD MVWEDLRPLD ILTREACENA IVVCSALGGS SNAPVHINAI ARHAGVELTN DDWQRLGHDV PLLANVMPAG AYLSEEFHRA GGVPAVLSEL LKAGRVHGDV PTVNGRTLAE NVAGRETLDD DIIRRYDNPL VEAAGFINLK GNLFDSALMK TSVIASDFRQ RFLSRAEDPD AFEGRVVVFD GSEDYHARID DPALEVDENT ILVMRGAGPV GHPGSAEVVN MQPPEALIKQ GIESLPCLGD GRQSGTSGSP SILNAAPEAA VGGGLALLET GDRVRIDLKR GEANLLVDDA ELSRRRQALA SRGGYIYPVD QTPWQEIQRG MVEQLDRGMT LGPATKYRDV ARQSPPRDNH
|
| |