Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_2280 |
Symbol | |
ID | 7293750 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | - |
Start bp | 2561365 |
End bp | 2563083 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643590684 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_002488334 |
Protein GI | 220913025 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.00000860305 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGTGAGA GCCAGATCGA AACAGAGAGC AAGCCTGACA TCAAGCCCCG CAGCCGGGTG GTTACCGACG GAATCCACGC CGCACCGGCG CGCGGCATGT TCCGTGCGGT CGGCATGGGC GACGACGACT TCGCGAAGCC CCAGGTCGGG GTTGCGAGTT CGTGGAACGA GATCACTCCC TGCAACCTCT CCCTGAACCG GCTGGCCCAG GGCGCCAAGG AGGGTGTCCA CGCCGGTGGC GGCTTCCCCA TGCAGTTCGG CACCATCTCC GTGTCCGACG GTATTTCCAT GGGCCACGAG GGCATGCACT TCTCCCTGGT CTCCCGCGAA GTGATCGCCG ATTCCGTCGA AACCGTCATG CAGGCCGAGC GGATTGACGG TTCGGTCTTG CTCGCCGGCT GCGACAAGTC CCTTCCCGGC ATGCTCATGG CCGCGGCCCG GCTCAACCTG TCCAGCGTGT TCCTCTACGC CGGCTCGATC ATGCCGGGCT GGGTAAAGCT CGAGGACGGC TCCGAGAAGG AAGTCACCCT CATCGATGCC TTTGAGGCCG TGGGCGCCTG CGCCGCCGGC AAGATGAGCA TGGAAGACCT CACGCGCATT GAAAAGGCCA TCTGTCCCGG CGAAGGCGCC TGCGGCGGCA TGTACACGGC CAACACCATG GCCTGCATCG GTGAGGCGCT GGGCATGTCC CTGCCCGGAT CTGCCGCCCC GCCCTCGGCA GACCGCCGTC GTGATGACTT TGCGCGCAAG TCCGGTGAAG CAGTGGTGAA CCTGCTGCGC AAGGGCATCA CCGCCCGCGA CATCATGACC AAGGAGGCCT TTGAAAACGC CATCGCCGTC ACCATGGCGT TCGGCGGGTC CACCAACGCG GTCCTGCACC TGCTGGCCAT TGCCCGCGAA GCCGAGGTAG AGCTTACCCT CGACGACTTC AACCGCATCG GTGACAGGAT CCCGCACCTC GGCGACCTCA AGCCGTTCGG CCGCTACGTC ATGACCGACG TCGACAAAAT CGGCGGCGTG CCCGTCATCA TGAAGGCACT GCTCGACGCC GGGCTGCTGC ACGGCGACTG CCTCACCGTC ACCGGCAAGA CCGTGGCGGA AAACCTGGAA GCCATCAACC CGCCGGATGT TGACGGCAAG ATCCTGCGCG CAATGGACAA CCCCATCCAC AAGACCGGCG GCATCACCAT CCTGCACGGC ACCATGGCGC CGGAAGGCGC CGTAGTGAAG ACTGCAGGAT TCGACGCCGA CGTCTTCGAG GGGACCGCCC GCGTGTTCGA CCGCGAGCAG GGCGCCCTGC AGGCGCTGGA CCAGGGCGAA ATCCATGCCG GCGACGTCGT GGTCATCCGC TACGAGGGGC CCAAGGGCGG CCCGGGCATG CGGGAGATGC TGGCCATCAC CGGCGCCATC AAGGGCGCGG GCCTCGGCAA GGACGTCCTG CTGCTGACCG ATGGCCGGTT CTCCGGCGGG ACCACCGGGC TGTGCATCGG CCACGTCGCG CCGGAAGCGG TGGACGGCGG CCCTATCGCC TTCGTCAGGG ACGGTGACCG GATCCGCGTG GACATCGCCG CGCGCAGCTT CGACCTGCTG GTCGATGACG CCGAGCTTGA GGCCCGCAAG GTTGGCTGGG AGCCGCTGCC GGCCCGCTAC ACCAAGGGCG TCCTGGCCAA GTACGCCAAG CTCGTGCACA GCGCGAGCAC CGGCGCATAC TGCGGCTGA
|
Protein sequence | MSESQIETES KPDIKPRSRV VTDGIHAAPA RGMFRAVGMG DDDFAKPQVG VASSWNEITP CNLSLNRLAQ GAKEGVHAGG GFPMQFGTIS VSDGISMGHE GMHFSLVSRE VIADSVETVM QAERIDGSVL LAGCDKSLPG MLMAAARLNL SSVFLYAGSI MPGWVKLEDG SEKEVTLIDA FEAVGACAAG KMSMEDLTRI EKAICPGEGA CGGMYTANTM ACIGEALGMS LPGSAAPPSA DRRRDDFARK SGEAVVNLLR KGITARDIMT KEAFENAIAV TMAFGGSTNA VLHLLAIARE AEVELTLDDF NRIGDRIPHL GDLKPFGRYV MTDVDKIGGV PVIMKALLDA GLLHGDCLTV TGKTVAENLE AINPPDVDGK ILRAMDNPIH KTGGITILHG TMAPEGAVVK TAGFDADVFE GTARVFDREQ GALQALDQGE IHAGDVVVIR YEGPKGGPGM REMLAITGAI KGAGLGKDVL LLTDGRFSGG TTGLCIGHVA PEAVDGGPIA FVRDGDRIRV DIAARSFDLL VDDAELEARK VGWEPLPARY TKGVLAKYAK LVHSASTGAY CG
|
| |