Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bind_0751 |
Symbol | |
ID | 6200713 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Beijerinckia indica subsp. indica ATCC 9039 |
Kingdom | Bacteria |
Replicon accession | NC_010581 |
Strand | + |
Start bp | 841927 |
End bp | 843315 |
Gene Length | 1389 bp |
Protein Length | 462 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641704748 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_001831890 |
Protein GI | 182677744 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.326609 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGTCG AAAGCTGGTC GCCTGCGAGT TGGAGAAATA AGCCGATCGT GCAGGTGCCC GAATTTACCG ACGCGGCGAA ACTCGCCGAT GTCGAGGGCC AGCTCGCCGG TTTCCCTCCC CTCGTCTTCG CCGGCGAGGC GCGCAAGCTC AAGAGCCTGC TTGGCAAGGT TGCCGATGGG GAAGCTTTTC TCCTGCAGGG CGGCGATTGC GCGGAAAGCT TCGCCGAGCA TCATGCCGAT AATATTCGCG ATTTCTTCCG GGTCTTTTTG CAAATGGCTG TGGTCGCGAC CTTCGCCGCC GCTTTGCCGG TCGTGAAAGT CGGGCGCATC GCCGGCCAGT TTGCCAAGCC GCGTTCAGCG CCCAATGAAA AGGTTGGCGA CGTCGAATTG CCGAGCTATC GCGGCGATAT CGTCAATGAC ATTGCCTTTG ACAGCGCGGC GCGCGAACCA GATCCCGTGC GCCAGCTCAT GGCCTATCGG CAATCGGCGG CGACCTTGAA CCTGTTGCGC GCCTTCGCGA CCGGTGGCTA TGCCAATCTG GAAAATGCGC ATCGCTGGAT GCTTGGCTTC GTCTCTGATA GTCCTCAGTC GGCCCGCTAC CAGGAATTGG CCGACCGGAT CACTGAGACG CTCGGTTTCA TGCGGGCGAT CGGCCTTGAT CCGGAATCCC ATCATGAATT GCGGCAGACG GATTTCTTCA CCTCGCATGA GGCCTTGCTG CTTGGCTATG AAGAAGCCCT GACCCGCATC GATTCGACGA GCGGTGATTA TTATGCGACC TCGGGCCATA TGCTCTGGAT TGGCGATAGG ACCCGCGATC CGGGCCAGGC GCATGTGGAA TATGCCCGTG GCGTGAAAAA TCCGATCGGC CTCAAATGCG GCCCTTCGCT GAAGCCCGAC GATCTTCTGC GCCTGATCGA TCTGCTCAAT CCTGCGAACG AGGCAGGGCG GCTCACCTTG ATCTGCCGTT TCGGTGCCGA CAAGATCGGC GATCACCTGC CAGCCTTGAT CCGCGCTGTG CAACAGGAAG GGCGACGCGT TGTCTGGTCC TGCGATCCCA TGCATGGCAA CACGATCAAG GCGGCGTCCG GTTTCAAGAC GAGGCCTTTC GAAAAGATCA TGAGCGAGAT CCGCAGCTTC TTCGCGGTGC ACCGGGCCGA GGGCACCTAT GCCGGCGGCG TTCATCTCGA AATGACCGGC AAGAACGTCA CCGAATGCAC GGGCGGCGCG CGGGCCATCA CCGATGCCGA TCTGCATGAT CGTTATCACA CTTATTGCGA TCCGCGCCTC AATGCGGAAC AGTCGATCGA GGTGGCGTTC CTGGTGGCCG AATTGCTTAA GGAAGAGCGG ATTGGACGCG GCCAGCGTCT CATCCATGCG GCCGAATAA
|
Protein sequence | MAVESWSPAS WRNKPIVQVP EFTDAAKLAD VEGQLAGFPP LVFAGEARKL KSLLGKVADG EAFLLQGGDC AESFAEHHAD NIRDFFRVFL QMAVVATFAA ALPVVKVGRI AGQFAKPRSA PNEKVGDVEL PSYRGDIVND IAFDSAAREP DPVRQLMAYR QSAATLNLLR AFATGGYANL ENAHRWMLGF VSDSPQSARY QELADRITET LGFMRAIGLD PESHHELRQT DFFTSHEALL LGYEEALTRI DSTSGDYYAT SGHMLWIGDR TRDPGQAHVE YARGVKNPIG LKCGPSLKPD DLLRLIDLLN PANEAGRLTL ICRFGADKIG DHLPALIRAV QQEGRRVVWS CDPMHGNTIK AASGFKTRPF EKIMSEIRSF FAVHRAEGTY AGGVHLEMTG KNVTECTGGA RAITDADLHD RYHTYCDPRL NAEQSIEVAF LVAELLKEER IGRGQRLIHA AE
|
| |