Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Vapar_0944 |
Symbol | |
ID | 7970119 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Variovorax paradoxus S110 |
Kingdom | Bacteria |
Replicon accession | NC_012791 |
Strand | + |
Start bp | 1038513 |
End bp | 1039631 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644791540 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_002942861 |
Protein GI | 239813951 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACGA ACACTGCCCC CGCCAGCGAC AGCTGGTATG CGAGCGTCGA AAAAACCAGC AAGACCGACG ACGAACGCAT CAAGGACATC AACGTGCTGC CCCCTCCCGA ACACCTGATC CGCTTCTTCC CGATCCGCGG CACGCCGGTC GAGACGCTGA TCGAAGGCAC CCGCCGCAGC ATCCACAACA TCATGGCGGG CAAGGACGAC CGGCTGCTGG TGGTCATGGG CCCGTGCTCG ATCCACGACC CGGCCGCGGC GCTCGAATAC GCCCGCCGCC TCAAGGTGGA ACGCGAAAAG TACGCCGGCA CGCTCGAGAT CGTGATGCGC GTGTACTTCG AGAAGCCGCG CACCACGGTC GGCTGGAAGG GCCTCATCAA CGACCCGTAC CTGGACGAGA GCTACCGCAT CGACGAAGGC CTGCGCATGG CGCGCCAGCT GCTGATCGAC ATCAACCGGC TCGGCCTGCC GGCGGGCAGC GAGTTCCTCG ACGTGATCTC GCCGCAGTAC ATCGGCGACC TGATCGCCTG GGGCGCGATC GGCGCGCGCA CCACCGAAAG CCAGGTGCAC CGCGAACTGG CCTCGGGCCT GTCGGCGCCC ATCGGCTTCA AGAACGGCAC CGACGGCAAC ATCCGCATCG CCACCGACGC CATCCAGGCG GCGGCGCGCG GCCACCACTT TCTCTCGGTG CACAAGAACG GCCAGGTCGC GATCGTGCAG ACCAACGGCA ACCGCGACTG CCACGTGATC CTGCGCGGCG GCAAGGCGCC CAACTACGAC GCGGCCAGCG TCGAGGCCGC CTGCAAGGAC CTCGAGGCCG CCAAGCTGCC GCCCACGCTG ATGGTCGACT GCAGCCACGC CAACAGCTCC AAGCAGCACC AGAAGCAGAT CGACGTGGCC AAGGACATCG CCGGCCAGAT CGCCGGGGGT TCGAACCGCG TCTTCGGCGT GATGGTCGAA AGCCACCTGC AGGCCGGCGC GCAGAAGTTC ACGCCGGGCA AGGACCAGCT CTCAGGCCTC GAATACGGCA AGAGCATCAC CGACGCCTGC CTGGGCTGGG ACGACTCGGT GCAGGTGCTG GACACCCTGT CGCAGGCGAT CAAGCAGCGC CGGGGCTGA
|
Protein sequence | MSTNTAPASD SWYASVEKTS KTDDERIKDI NVLPPPEHLI RFFPIRGTPV ETLIEGTRRS IHNIMAGKDD RLLVVMGPCS IHDPAAALEY ARRLKVEREK YAGTLEIVMR VYFEKPRTTV GWKGLINDPY LDESYRIDEG LRMARQLLID INRLGLPAGS EFLDVISPQY IGDLIAWGAI GARTTESQVH RELASGLSAP IGFKNGTDGN IRIATDAIQA AARGHHFLSV HKNGQVAIVQ TNGNRDCHVI LRGGKAPNYD AASVEAACKD LEAAKLPPTL MVDCSHANSS KQHQKQIDVA KDIAGQIAGG SNRVFGVMVE SHLQAGAQKF TPGKDQLSGL EYGKSITDAC LGWDDSVQVL DTLSQAIKQR RG
|
| |