Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0281 |
Symbol | |
ID | 4068825 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 293829 |
End bp | 294884 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637982282 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_589360 |
Protein GI | 94967312 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCGCC CGACCGAAGA CTTACGCATC CAGTGGACAA AAGTTGTTCT TCCGCCGGCG TTCCTCGACG AGGAACTGCC GACGACGGAG AACGCTTCAG CGACCGTCGC GAACGCGCGC AACACGGTGC GCGACATAAT TCGTGGAGAA GATTCGCGGC TGCTGGTGGT GCTGGGACCG TGTTCCATCC ATGACGTGAA AGCGGCGCGC GAATATGCGG CGCTGCTGAA AGACGCAATC ACCGAGCTTT CGAACGATCT CTTCCTGGTA ATGCGGGTGT ACTTCGAGAA GCCTCGCACC ACGATTGGGT GGAAGGGGCT GATCAACGAT CCGCACCTCG ACGAGTCGTT CAACATCAAC GACGGCCTGC GGATTTCGCG GCATTTGCTG CTGGATCTCG CGGAGATGGG TGTTCCCGCA GGCACGGAAT TTCTTGACAT GATTACGCCG CAGTATCTTG CGGGATTGGT GTGCTGGGGC GCGATCGGAG CGCGTACCAC CGAGAGCCAG ATCCATCGCG AGCTGGTGAG TGGACTGTCG TGTCCGGTGG GATTTAAGAA TGGGACGTCG GGAAATGTCG GCATCGCGAT TGAGGCGGTG CAGTCGGCAG CACATCCGCA TACGTTCCTC GGACACACGA AGTATGGGCA GTCGGCGATC TTCGCGACCA CGGGCAATCC GGATTGCCAT GTGATCCTGC GCGGCGGACG CAAGCTGGCA AATTACGACG CGGCGTCCGT GAAAGAAGCG TGTGGGCTGC TGGAGAAGGC TGGGTTGCCG CAGCGCTTGA TGATCGATTG CAGCCACGCC AACAGCAACA AGGACCACAC GCGTCAGGGC GCGGTGGCGC GCGATGTGGC TGGGCAGATT GCCGGCGGAA ACAAAGCGAT CATCGGCGTG ATGATTGAGA GCAACCTCGT CGGTGGCGCA CAGAAGTTCG TGAAGGGCAA GCCGCTGGTC TACGGGCAGA GCATTACCGA CGCTTGCATT GACTGGAAAG AAACGCGGGG GCTGCTGGGG GAACTGGCGG CTGCGGTGCG CTCTCGCCGG AAGTAG
|
Protein sequence | MIRPTEDLRI QWTKVVLPPA FLDEELPTTE NASATVANAR NTVRDIIRGE DSRLLVVLGP CSIHDVKAAR EYAALLKDAI TELSNDLFLV MRVYFEKPRT TIGWKGLIND PHLDESFNIN DGLRISRHLL LDLAEMGVPA GTEFLDMITP QYLAGLVCWG AIGARTTESQ IHRELVSGLS CPVGFKNGTS GNVGIAIEAV QSAAHPHTFL GHTKYGQSAI FATTGNPDCH VILRGGRKLA NYDAASVKEA CGLLEKAGLP QRLMIDCSHA NSNKDHTRQG AVARDVAGQI AGGNKAIIGV MIESNLVGGA QKFVKGKPLV YGQSITDACI DWKETRGLLG ELAAAVRSRR K
|
| |