Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3635 |
Symbol | |
ID | 8334988 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 4067112 |
End bp | 4068218 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 644956776 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_003114379 |
Protein GI | 256392815 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0253276 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCTCGC CCGTTCCCTC CTCGCCCGCA GAGCCCGCCG AGACCACCGA AACCACCAGT GACCTGCGTG TGACCAGCTT CCAGCCCCTC ATCCCACCGG CCGACCTGCG GGCCGAGTTG CCGCTGGGCG AGAAACGCGC CGCGTTGGTG CGTGAGAGCC GGCGTACCGT GCGCGACATC CTCGCCGGCG CCGACGACCG GCTGCTGGTC GTCGTCGGGC CGTGCTCGGT CCATGACCCC GCCGCCGCCC TGGAATACGC GCACCGGCTC GCCGCGGCCG CCGCCGAGCA CCGCGACGAC GTGTTCGTGG TCATGCGCGT CTACTTCGAG AAGCCGCGCA CCACGGTGGG CTGGAAGGGC CTGATCAACG ACCCGGGCAT GGACGGGACC CACGACGTCC CCCGAGGACT GCGCCTGGCG CGTCAGGTCC TGCTCGACGT ACTGGACGCC GGCCTGCCGA CCGGCTGCGA ATTCCTGGAG CCCACCAGCC CTCAGTACAT CGCCGACACC GTGTCCTGGG GCGCGATCGG CGCGCGAACG CCCGAAAGCC AAGTCCACAG GCAGCTCGCC TCCGGCATGT CGATGCCGGT CGGCTTCAAG AACGCCACCG ACGGCGCCAT CCAGCCCGCC ATCGACGGCT GCCGAGCCGC CGCCAGCGCG CAGTCCTTCT TCGGCATGGA CGAGCAAGGC CGCGGCGCGG TCGTCTCCAC CACCGGCAAC CCCGACTGCC ACATCATCCT GCGCGGCGGA CGCACCGGAC CCAACTACAG CACCGAAGAC GTGCGAGCCG CCCTGGACCT CGTCCGCGAG GCAGGCAAGC CGGAGCACCT GATCATCGAC GCCAGCCACG GCAACAGCGG CAAGGACCAC ACCCGCCAGA GCCTCGCCGT CCGCGAGATC GCGAACCGCC TCGCGGCCGG AGACACCGGC GTCGCGGGCA TGATGCTCGA GAGCTTCCTG GTCCCGGGAC GCCAGGAGCC GGGCCCGCTC GAGGGACTGC GCTACGGGCA GAGCGTGACG GACGCGTGTA TCGGCTGGGA GGAGACCGAG GAACTGCTCC AGGTCATGGC CACGGCGGTC CGCGACCGGC GCACGGCGCG GAGCTGA
|
Protein sequence | MPSPVPSSPA EPAETTETTS DLRVTSFQPL IPPADLRAEL PLGEKRAALV RESRRTVRDI LAGADDRLLV VVGPCSVHDP AAALEYAHRL AAAAAEHRDD VFVVMRVYFE KPRTTVGWKG LINDPGMDGT HDVPRGLRLA RQVLLDVLDA GLPTGCEFLE PTSPQYIADT VSWGAIGART PESQVHRQLA SGMSMPVGFK NATDGAIQPA IDGCRAAASA QSFFGMDEQG RGAVVSTTGN PDCHIILRGG RTGPNYSTED VRAALDLVRE AGKPEHLIID ASHGNSGKDH TRQSLAVREI ANRLAAGDTG VAGMMLESFL VPGRQEPGPL EGLRYGQSVT DACIGWEETE ELLQVMATAV RDRRTARS
|
| |