Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1883 |
Symbol | |
ID | 4073344 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 2261497 |
End bp | 2262564 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637983892 |
Product | 3-deoxy-D-arabinoheptulosonate-7-phosphate synthase |
Protein accession | YP_590958 |
Protein GI | 94968910 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0314468 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTAGTCG TGATGAACGC GCACGCCACC GAGGAACAGG TCGGCGCGGT CTGTAAACGG ATTGAGGAAC TTGGGTTCCG CGCGCATCCG ATCCCGGGAG CGCAGCGGAC TGCCATTGGT ATTACCGGCA ACCAGGGCGA AGTCGAACCT GGCGCCATCG AAGAACTGCC TGGCGTAGTG GAAGTCATCC GGGTCAGCAA GCCTTACAAG CTCGTAAGCC GAGATGTGAA GGAAGACAAC ACCGTCGTCC GCTTCGCGAA CGGCGCGACC ATCGGCAGCG AAGAACTGGC GGTCGTGGCA GGGCCGTGCG CGATCGAGAA CCACAAGCAG GCCTTTGCGA TCGCCGAGCA CGTTGCCAAA TCGGGAGTGC GGTTCTTCCG CGGCGGGGCG TATAAGCCGC GCACCTCGCC GTATTCGTTC CAGGGACTGG GCGAAGAGGG CCTGAAAATT ATGGCCGAGA TTCGCGACCA GTTCGGCTTG CTGATCGTCA CCGAGGCAGT GGACAACGAG TCGCTCGACC AGGTTGAGAA ATATGCCGAT GTGATCCAGA TCGGCGCGCG CAACATGCAG AACTTCTCGC TCCTCAAGCG TGCCGGACGC GCTCGCAAGC CGGTGCTGCT GAAACGAGGC ATGTCGGCGA CACTGGAAGA GTTCCTGATG GCTGCCGAGT ACGTAATGAG CGAGGGCAAC TACAACGTTG TGCTCTGCGA GCGCGGCGTG CGGACGTTCT CCGATTACAC ACGCAATACC CTCGACCTGA GCGTGGTGCC AGCAGTGCAT CGGTTGAGCC ATCTGCCGAT CCTTGTGGAT CCGAGCCATG GGACGGGGGT ACGCAGCAAG GTGACGCCGT TGTCCCGCGC ATCGGTCGCC GTTGGAGCGG ATGGGCTGAT CGTGGAAGTG CACAACGAAC CGGACCGCGC TCTCTCCGAT GGCAAGCAGT CTCTTTATCT CGAACAGTTC GACGAACTGA TGACACAGGT TCGGCAGATC GCGCCGGTGG TTCAGCGCAA AGTCGCCGAC CGTGGCCTGG CGCTCACGAC GCGATTGAAT TCGGCCAGCG CCCGATGA
|
Protein sequence | MLVVMNAHAT EEQVGAVCKR IEELGFRAHP IPGAQRTAIG ITGNQGEVEP GAIEELPGVV EVIRVSKPYK LVSRDVKEDN TVVRFANGAT IGSEELAVVA GPCAIENHKQ AFAIAEHVAK SGVRFFRGGA YKPRTSPYSF QGLGEEGLKI MAEIRDQFGL LIVTEAVDNE SLDQVEKYAD VIQIGARNMQ NFSLLKRAGR ARKPVLLKRG MSATLEEFLM AAEYVMSEGN YNVVLCERGV RTFSDYTRNT LDLSVVPAVH RLSHLPILVD PSHGTGVRSK VTPLSRASVA VGADGLIVEV HNEPDRALSD GKQSLYLEQF DELMTQVRQI APVVQRKVAD RGLALTTRLN SASAR
|
| |