Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_2540 |
Symbol | |
ID | 4444948 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 2848632 |
End bp | 2850353 |
Gene Length | 1722 bp |
Protein Length | 573 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639690357 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_832019 |
Protein GI | 116671086 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0297342 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGAGG ACACCCAAAC CGCGACAGAA AACAAGCCGG ACATCAAGCC CCGCAGCCGG GTCGTAACCG ACGGAATCCA CGCCGCTCCC GCGCGAGGAA TGTTCCGGGC GGTCGGCATG GGCGACGATG ACTTTGCGAA GCCCCAGATT GGCGTGGCGA GTTCCTGGAA CGAGATCACT CCCTGCAACC TTTCCCTGAA CCGGCTGGCC CAGGGCGCCA AGGAAGGCGT CCACGCCGGC GGCGGGTTCC CCATGCAGTT CGGCACCATC TCGGTCTCCG ACGGCATCTC CATGGGTCAC GAGGGCATGC ACTTCTCCCT CGTTTCGCGC GAAGTCATTG CCGACTCCGT GGAAACCGTG ATGCAGGCCG AGCGGATTGA CGGCTCGGTG CTCCTGGCCG GCTGCGACAA GTCCCTCCCC GGAATGCTGA TGGCGGCCGC GCGCCTGGAC CTCGCCAGCG TGTTCCTCTA CGCCGGTTCC ATCATGCCCG GCTGGGTCAA GCTGGAGGAC GGTTCCGAAA AGGAAGTCAC CCTCATCGAC GCATTCGAGG CCGTGGGCGC CTGCGCCGCG GGCAAGATGA GCAGGGGAGA CCTTGACCGC ATCGAACGCG CCATCTGCCC CGGCGAAGGT GCCTGCGGCG GGATGTACAC GGCCAACACC ATGGCCTGCA TCGGCGAAGC CCTGGGCATG TCCCTGCCGG GCTCCGCCGC TCCTCCGTCG GCAGACCGCC GTCGTGATGA ATTCGCCCGC AAATCCGGAG AAGCAGTGGT CAACCTGCTC CGCCTCGGCA TCACTGCGCG CGACATCATG ACCAAGAAGG CATTCGAGAA CGCCATCGCC GTGACCATGG CATTCGGCGG CTCCACGAAC GCAGTGCTGC ACCTGCTGGC CATCGCCCGC GAAGCTGAAG TGGAACTGAC GCTCGATGAC TTCAACCGCA TCGGCGACAA GATTCCGCAC CTGGGCGACC TGAAGCCGTT CGGACGCTAC GTGATGACCG ACGTCGACAA GATCGGCGGC GTTCCGGTCA TCATGAAGGC ACTGCTCGAC GCCGGGCTGC TGCACGGCGA CTGCCTGACC GTCACCGGCA AGACCCTGGC GGAAAACCTT GCATCCATCA ACCCGCCGGA CCTGGATGGC AAGATCCTGC GTGCCCTGGA CAACCCGATC CACAAGACCG GCGGCATCAC CATCCTGCAC GGTTCCATGG CACCTGAAGG CGCCGTCGTG AAGAGCGCGG GCTTCGACGC CGACGTTTTC GAAGGCACGG CCCGCGTGTT CGAGCGCGAG CAGGGCGCCC TTGACGCGCT GGACAACGGC AAAATCAACA AGGGCGACGT CGTGGTCATT CGCTATGAAG GGCCGAAGGG CGGCCCGGGC ATGCGCGAAA TGCTCGCTAT CACCGGCGCC ATCAAGGGTG CCGGGCTGGG CAAAGATGTG CTGCTTCTCA CGGATGGCCG CTTCTCCGGC GGTACCACCG GCCTGTGCAT CGGCCACGTC GCGCCTGAAG CCGTCGACGG CGGTCCTATC GCCTTCGTCA AGGACGGTGA CCGCATCCGC GTTGACATTG CCGCCCGCAG CTTCGACCTG CTGGTGGACG AGGCTGAGCT CGAGTCCCGC AAGGTCGGCT GGGAGCCGCT CCCGGCCAAG TTCACCAAGG GCGTCCTGGC CAAGTACGCC AAGCTGGTGC ACAGCGCCTC CACCGGCGCA TACTGCGGGT AG
|
Protein sequence | MSEDTQTATE NKPDIKPRSR VVTDGIHAAP ARGMFRAVGM GDDDFAKPQI GVASSWNEIT PCNLSLNRLA QGAKEGVHAG GGFPMQFGTI SVSDGISMGH EGMHFSLVSR EVIADSVETV MQAERIDGSV LLAGCDKSLP GMLMAAARLD LASVFLYAGS IMPGWVKLED GSEKEVTLID AFEAVGACAA GKMSRGDLDR IERAICPGEG ACGGMYTANT MACIGEALGM SLPGSAAPPS ADRRRDEFAR KSGEAVVNLL RLGITARDIM TKKAFENAIA VTMAFGGSTN AVLHLLAIAR EAEVELTLDD FNRIGDKIPH LGDLKPFGRY VMTDVDKIGG VPVIMKALLD AGLLHGDCLT VTGKTLAENL ASINPPDLDG KILRALDNPI HKTGGITILH GSMAPEGAVV KSAGFDADVF EGTARVFERE QGALDALDNG KINKGDVVVI RYEGPKGGPG MREMLAITGA IKGAGLGKDV LLLTDGRFSG GTTGLCIGHV APEAVDGGPI AFVKDGDRIR VDIAARSFDL LVDEAELESR KVGWEPLPAK FTKGVLAKYA KLVHSASTGA YCG
|
| |