Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Veis_2189 |
Symbol | |
ID | 4692294 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Verminephrobacter eiseniae EF01-2 |
Kingdom | Bacteria |
Replicon accession | NC_008786 |
Strand | - |
Start bp | 2482580 |
End bp | 2483695 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639849951 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_996955 |
Protein GI | 121609148 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.398062 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGCCC ACACCACCCC TGCCAGCGAT GCCTGGTACC GCAGCGTCGA GAAAACCAGC CAAACCGACG ACGAACGTAT CAAGGACATC ACCGTGTTGC CCCCTCCAGA GCATCTGATC CGCTTTTTCC CGATCCGTGG CACGCCGGTC GAGACGCTGA TCACCCAGAC CCGCAAGAAC ATCCACCAGA TCATGACCGG CAAGGACGAT CGCCTGCTGA TGGTGATCGG CCCCTGCTCG ATCCACGACC CGGCGGCCGC CGTCGAGTAC GCGCGCCGCC TGATGGCCGC CCGCACCCGC TACGCCGGCA CCCTGGAAAT CGTGATGCGC GTGTACTTCG AGAAGCCGCG CACCACGGTC GGCTGGAAAG GGCTGATCAA CGACCCCTAC CTGGACGAGA GCTACCGCAT CGACGAGGGG CTGCGCATCG CGCGCCAACT GCTGATCGAA ATCAACCGCC TGGGCATGCC CGCCGGCAGC GAGTTTCTCG ACGTGATCTC GCCCCAGTAC ATCGGCGACC TGATCAGTTG GGGCGCCATC GGTGCCCGCA CCACCGAAAG CCAGGTGCAC CGCGAGCTGG CCTCGGGCAT TTCGGCGCCG ATAGGCTTCA AGAACGGCAC CGACGGCAAC ATCCGCATCG CCACCGACGC CATACAGTCG GCCAGCCGGG GCCATCACTT CCTGTCGGTG CACAAAAATG GCCAGGTCGC CGTCGTCAAC ACCAAGGGCA ACAAGGATTG CCACGTCATC CTGCGCGGCG GCAAAACGCC CAACTACGAC GCCGCCAGCG TCGCCGCCGC CTGCCAAGAC CTGCAAGCGG CCAAACTGCC CGCGCTGCTG ATGGTCGATT GCAGCCATGC CAACAGTTGC AAGCAGCACG AAAAGCAGCT CGACGTGGCC CGCGATGTCG CGGCGCAACT GGCCGCCGGC TCGCGCAGCA TCTTCGGCCT GATGATCGAA AGCCATCTGC ACGCAGGCGC CCAGAAGTTC ACGCCCGGCA AGGACCAGCC CGGCGCGCTC GAATACGGCA AGAGCATCAC CGATCCCTGC CTGGGCTGGG ACGATTCGCT GCAAGCGCTG GCAGAGTTGT CCGCCGCCGT GCAGGCGCGC AGGTAA
|
Protein sequence | MTAHTTPASD AWYRSVEKTS QTDDERIKDI TVLPPPEHLI RFFPIRGTPV ETLITQTRKN IHQIMTGKDD RLLMVIGPCS IHDPAAAVEY ARRLMAARTR YAGTLEIVMR VYFEKPRTTV GWKGLINDPY LDESYRIDEG LRIARQLLIE INRLGMPAGS EFLDVISPQY IGDLISWGAI GARTTESQVH RELASGISAP IGFKNGTDGN IRIATDAIQS ASRGHHFLSV HKNGQVAVVN TKGNKDCHVI LRGGKTPNYD AASVAAACQD LQAAKLPALL MVDCSHANSC KQHEKQLDVA RDVAAQLAAG SRSIFGLMIE SHLHAGAQKF TPGKDQPGAL EYGKSITDPC LGWDDSLQAL AELSAAVQAR R
|
| |