Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSc2660 |
Symbol | aroG1 |
ID | 1221507 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ralstonia solanacearum GMI1000 |
Kingdom | Bacteria |
Replicon accession | NC_003295 |
Strand | + |
Start bp | 2869768 |
End bp | 2870847 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637239061 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | NP_520781 |
Protein GI | 17547379 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.493931 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGAAGA ACACCGACGA CCTGCGCATT CGCGAACTGA AAGAGCTGAC CCCGCCGGCG CACCTGATCC GCGAATTCCC GTGCAACGAC GCCGTTTCCG AGCTGATCTA CCAGAGCCGC ACGGCGATGC ACCGCATCCT GCACGGCATG GACGACCGCC TGATCGTCAT CATCGGGCCG TGCTCGATCC ACGACACCAA GGCCGCCCTC GACTACGCGC GCCGCCTGGT CGAACAGCGC GAGCGCTTCA AGGCCGACCT GGAAATCGTC ATGCGCGTGT ACTTCGAGAA GCCGCGCACC ACGGTCGGCT GGAAGGGCCT GATCAACGAC CCGTACATGG ACGGCAGCTT CAAGATCAAC GACGGCCTGC GCACCGCGCG CGAGCTGCTG GTGAACATCA ACGAGCTGGG CGTGCCGGCG GGCACCGAAT ACCTCGACAT GATCAGCCCG CAGTACATCG CCGACCTGGT GAGCTGGGGC GCGATCGGGG CGCGCACGAC CGAATCGCAG GTGCACCGCG AACTCGCCTC CGGACTGTCG TGCCCGGTCG GCTTCAAGAA CGGCACCGAT GGCAACGTGA AGATCGCCGT GGACGCGATC AAGGCGGCGT CGCAGCCGCA CCACTTCCTG TCGGTCACCA AGGGCGGCCA CTCGGCCATC GTGTCGACGG CGGGCAACGA GGATTGCCAC ATCATCCTGC GCGGCGGCAA GGCGCCCAAC TACGATGCCG CCAGCGTGCA GGCGGCCTGC GAAGACATCG CCAAGTCCGG CCTGGCCGCG CGCCTGATGA TCGATGCCTC GCACGCCAAC AGCAGCAAGA AGCACGAGAA CCAGATCCCG GTCTGCGAAG ACATCGGCCG CCAGATCGCC GGCGGCGACG ACCGCATCGT CGGCGTGATG GTGGAATCGC ACATCAATGC CGGCCGCCAG GACCACGTGC AGGGCACCCC GGTCGAGGAC CTGAACTACG GCCAGAGCGT GACCGACGCC TGCATCGGCT GGGACGATTC GCTCAAGGTA CTGGAAACGC TGGCCGACGC CGTGCGCAAG CGCCGGCTGG TGCCCCGCAA CGGCAACTGA
|
Protein sequence | MPKNTDDLRI RELKELTPPA HLIREFPCND AVSELIYQSR TAMHRILHGM DDRLIVIIGP CSIHDTKAAL DYARRLVEQR ERFKADLEIV MRVYFEKPRT TVGWKGLIND PYMDGSFKIN DGLRTARELL VNINELGVPA GTEYLDMISP QYIADLVSWG AIGARTTESQ VHRELASGLS CPVGFKNGTD GNVKIAVDAI KAASQPHHFL SVTKGGHSAI VSTAGNEDCH IILRGGKAPN YDAASVQAAC EDIAKSGLAA RLMIDASHAN SSKKHENQIP VCEDIGRQIA GGDDRIVGVM VESHINAGRQ DHVQGTPVED LNYGQSVTDA CIGWDDSLKV LETLADAVRK RRLVPRNGN
|
| |