Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_1625 |
Symbol | |
ID | 4028229 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 1846124 |
End bp | 1847203 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637966814 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_573677 |
Protein GI | 92113749 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.445973 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAGAAC AGCAGGTCAA CAATATCAAC GTCCTCGCGC AGGACGTGCT GGTCACTCCC GAAGCTCTCA AGCGGGAAAT CCCTCTTTCC GAGGAAGCCG AGCGCACGGT ACTCGAGGGT CGTCGGACCC TGCAGCGCAT TCTCGATGGC GACGACCCGC GCCTGGTCGT GGTCATCGGC CCGTGCTCGA TCCACGACGT GGAAGCCGCG CGCGACTACG CGCGGCGTCT GCGCAAGCTG GCCGACGAGG TCAGCGACAG CCTGTACGTC GTGATGCGCG TGTATTTCGA GAAGCCGCGT ACCACCGTCG GCTGGAAGGG ACTGATCAAC GACCCTCACC TCGACGATAC CTTCGACATC CAGGAGGGGC TGCGCACCGC CCGGCGCCTG CTGGTGGAAC TCGCCGAAAT GGGGCTGCCG CTGGCCACCG AGGCGCTCGA CCCCATCTCG CCCCAGTATC TGCAGGATTG CATCAGCTGG TCGGCCATCG GCGCCCGCAC CACCGAGTCC CAGACGCACC GTGAAATGGC CTCCGGTCTC TCGTCGCCGG TGGGCTTCAA GAACGGCACC GACGGCAGTC TCGATGTCGC GGTCAACGCC CTCAAGTCGG TGGCCCACCC GCACAATTTC CTGGGCATCG ATCAGGGTGG CCAGGTCGCG GTGATCCGCA CGCGCGGCAA CGCCTACGGG CACATCGTCC TGCGCGGCGG CAACGGCAAG CCCAACTACG ACAGCGTCAG CGTCACGCTG GCCGAGAACG AGCTGCGTGC GGCCGGCGTC ACCCCCAACA TCATGGTCGA CTGCTCGCAC GCCAACTCCA ACAAGGACCC GTCCCTGCAG CCGCTGGTGA TGGACAACAT CGCCAATCAG ATCCTCGAGG GCAACACCTC GATCATGGGC CTGATGGTCG AATCGCATAT CGGCTGGGGC AACCAGAAGA TTCCCGAGGA CCGCTCGCAG CTGCAATACG GGGTCTCGAT CACCGATGCC TGCATCGACT GGCCGACCAC CGAAACGGCC ATGCGCAGCA TGGACGAGAA GCTCAAGCCG GTACTCGGTC AGCGCCAGCG CGGCGCCTGA
|
Protein sequence | MSEQQVNNIN VLAQDVLVTP EALKREIPLS EEAERTVLEG RRTLQRILDG DDPRLVVVIG PCSIHDVEAA RDYARRLRKL ADEVSDSLYV VMRVYFEKPR TTVGWKGLIN DPHLDDTFDI QEGLRTARRL LVELAEMGLP LATEALDPIS PQYLQDCISW SAIGARTTES QTHREMASGL SSPVGFKNGT DGSLDVAVNA LKSVAHPHNF LGIDQGGQVA VIRTRGNAYG HIVLRGGNGK PNYDSVSVTL AENELRAAGV TPNIMVDCSH ANSNKDPSLQ PLVMDNIANQ ILEGNTSIMG LMVESHIGWG NQKIPEDRSQ LQYGVSITDA CIDWPTTETA MRSMDEKLKP VLGQRQRGA
|
| |