Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0588 |
Symbol | |
ID | 4026247 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 653719 |
End bp | 654789 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637965756 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_572649 |
Protein GI | 92112721 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0609679 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGCAC CCTTGACGAT GCCCCAACAC CCCGACGCCA CCGCGTCGCC CTCGCACCAC CCATTGCCCA CGCCGGGTGA GCTCAAGCAG CGCATACCGC TGGCCCCGGC CCTCGAAGCC CGCATCGCGC AACAGCGCGA CGCCGTCCAG GCGGTCCTCG ACGGACGGGA TGATCGCCTT CTGGTGGTCG TCGGCCCTTG CTCGATCCAT GACCCGCAGG CGGCATTGGC GTATGCCGAG AAGCTCAGCG AGCTTGCCGC GCGTCTCGAC GATCGCCTGC TGATGGTGAT GCGCGTCTAC ATCGAGAAGC CGCGTACCAC CGTGGGCTGG AAAGGCCTGG CCTACGATCC CCATCTGGAT GGCAGCGACG ACATGGCACA TGGCCTGGAA GTGTCGCGGC GCCTGATGCG CGACATCGCC GCCCTGGGCA TGCCCGTGGC CACCGAGTTG CTGCATCCGA TGACGGCACC TTACCTCGAG GATCTGCTCA GTTGGGTCGC CATCGGCGCA CGTACCACCG AATCCCAGGT GCATCGCGAA CTGGCCAGTG GCCTGCAGGC CGCGGTGGGA TTCAAGAACG GCACCGATGG CAGCGTCGAC GTGGCCATCG CCGCGATGCA GTCGGCCGCG CATCCGCATC GTCATTTCGC CATCGACGAT GCCGGGCGTC CGGCGATGCG CGAGACCCGG GGCAATCCGC ATACCCATGT CGTGCTGCGT GGCGGTCACG GCAAGCCGAA CTACGATGCG GCTTCGATGC GCGCCTGTCG GCTGGATCTG CAGGCGGCCG GTATCACGCC GCGTTTGATG GTCGATTGCA GCCATGCCAA TTCGCGCAAG GATCACCGCC GCCAGACCGA GGTCATGCTG GATGTGCTCG AACAGCGCTT GGCCGGCAAT CACGACGTGA TCGGCGTCAT GCTCGAAAGC TATTTGCACG AGGGCAAGCA GCCGCTCAAG CTGGGCGCGT TGCGCTATGG CGTCTCCGTC ACCGATGCCT GCCTGGGCTG GGAAGCGACC GAGCACTTGC TCACGCTGGC CGCCGAGCGG CTGGGCGGTG TCGCGCGTTG A
|
Protein sequence | MNAPLTMPQH PDATASPSHH PLPTPGELKQ RIPLAPALEA RIAQQRDAVQ AVLDGRDDRL LVVVGPCSIH DPQAALAYAE KLSELAARLD DRLLMVMRVY IEKPRTTVGW KGLAYDPHLD GSDDMAHGLE VSRRLMRDIA ALGMPVATEL LHPMTAPYLE DLLSWVAIGA RTTESQVHRE LASGLQAAVG FKNGTDGSVD VAIAAMQSAA HPHRHFAIDD AGRPAMRETR GNPHTHVVLR GGHGKPNYDA ASMRACRLDL QAAGITPRLM VDCSHANSRK DHRRQTEVML DVLEQRLAGN HDVIGVMLES YLHEGKQPLK LGALRYGVSV TDACLGWEAT EHLLTLAAER LGGVAR
|
| |