Gene Arth_2540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2540 
Symbol 
ID4444948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2848632 
End bp2850353 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content65% 
IMG OID639690357 
Productdihydroxy-acid dehydratase 
Protein accessionYP_832019 
Protein GI116671086 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0297342 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGAGG ACACCCAAAC CGCGACAGAA AACAAGCCGG ACATCAAGCC CCGCAGCCGG 
GTCGTAACCG ACGGAATCCA CGCCGCTCCC GCGCGAGGAA TGTTCCGGGC GGTCGGCATG
GGCGACGATG ACTTTGCGAA GCCCCAGATT GGCGTGGCGA GTTCCTGGAA CGAGATCACT
CCCTGCAACC TTTCCCTGAA CCGGCTGGCC CAGGGCGCCA AGGAAGGCGT CCACGCCGGC
GGCGGGTTCC CCATGCAGTT CGGCACCATC TCGGTCTCCG ACGGCATCTC CATGGGTCAC
GAGGGCATGC ACTTCTCCCT CGTTTCGCGC GAAGTCATTG CCGACTCCGT GGAAACCGTG
ATGCAGGCCG AGCGGATTGA CGGCTCGGTG CTCCTGGCCG GCTGCGACAA GTCCCTCCCC
GGAATGCTGA TGGCGGCCGC GCGCCTGGAC CTCGCCAGCG TGTTCCTCTA CGCCGGTTCC
ATCATGCCCG GCTGGGTCAA GCTGGAGGAC GGTTCCGAAA AGGAAGTCAC CCTCATCGAC
GCATTCGAGG CCGTGGGCGC CTGCGCCGCG GGCAAGATGA GCAGGGGAGA CCTTGACCGC
ATCGAACGCG CCATCTGCCC CGGCGAAGGT GCCTGCGGCG GGATGTACAC GGCCAACACC
ATGGCCTGCA TCGGCGAAGC CCTGGGCATG TCCCTGCCGG GCTCCGCCGC TCCTCCGTCG
GCAGACCGCC GTCGTGATGA ATTCGCCCGC AAATCCGGAG AAGCAGTGGT CAACCTGCTC
CGCCTCGGCA TCACTGCGCG CGACATCATG ACCAAGAAGG CATTCGAGAA CGCCATCGCC
GTGACCATGG CATTCGGCGG CTCCACGAAC GCAGTGCTGC ACCTGCTGGC CATCGCCCGC
GAAGCTGAAG TGGAACTGAC GCTCGATGAC TTCAACCGCA TCGGCGACAA GATTCCGCAC
CTGGGCGACC TGAAGCCGTT CGGACGCTAC GTGATGACCG ACGTCGACAA GATCGGCGGC
GTTCCGGTCA TCATGAAGGC ACTGCTCGAC GCCGGGCTGC TGCACGGCGA CTGCCTGACC
GTCACCGGCA AGACCCTGGC GGAAAACCTT GCATCCATCA ACCCGCCGGA CCTGGATGGC
AAGATCCTGC GTGCCCTGGA CAACCCGATC CACAAGACCG GCGGCATCAC CATCCTGCAC
GGTTCCATGG CACCTGAAGG CGCCGTCGTG AAGAGCGCGG GCTTCGACGC CGACGTTTTC
GAAGGCACGG CCCGCGTGTT CGAGCGCGAG CAGGGCGCCC TTGACGCGCT GGACAACGGC
AAAATCAACA AGGGCGACGT CGTGGTCATT CGCTATGAAG GGCCGAAGGG CGGCCCGGGC
ATGCGCGAAA TGCTCGCTAT CACCGGCGCC ATCAAGGGTG CCGGGCTGGG CAAAGATGTG
CTGCTTCTCA CGGATGGCCG CTTCTCCGGC GGTACCACCG GCCTGTGCAT CGGCCACGTC
GCGCCTGAAG CCGTCGACGG CGGTCCTATC GCCTTCGTCA AGGACGGTGA CCGCATCCGC
GTTGACATTG CCGCCCGCAG CTTCGACCTG CTGGTGGACG AGGCTGAGCT CGAGTCCCGC
AAGGTCGGCT GGGAGCCGCT CCCGGCCAAG TTCACCAAGG GCGTCCTGGC CAAGTACGCC
AAGCTGGTGC ACAGCGCCTC CACCGGCGCA TACTGCGGGT AG
 
Protein sequence
MSEDTQTATE NKPDIKPRSR VVTDGIHAAP ARGMFRAVGM GDDDFAKPQI GVASSWNEIT 
PCNLSLNRLA QGAKEGVHAG GGFPMQFGTI SVSDGISMGH EGMHFSLVSR EVIADSVETV
MQAERIDGSV LLAGCDKSLP GMLMAAARLD LASVFLYAGS IMPGWVKLED GSEKEVTLID
AFEAVGACAA GKMSRGDLDR IERAICPGEG ACGGMYTANT MACIGEALGM SLPGSAAPPS
ADRRRDEFAR KSGEAVVNLL RLGITARDIM TKKAFENAIA VTMAFGGSTN AVLHLLAIAR
EAEVELTLDD FNRIGDKIPH LGDLKPFGRY VMTDVDKIGG VPVIMKALLD AGLLHGDCLT
VTGKTLAENL ASINPPDLDG KILRALDNPI HKTGGITILH GSMAPEGAVV KSAGFDADVF
EGTARVFERE QGALDALDNG KINKGDVVVI RYEGPKGGPG MREMLAITGA IKGAGLGKDV
LLLTDGRFSG GTTGLCIGHV APEAVDGGPI AFVKDGDRIR VDIAARSFDL LVDEAELESR
KVGWEPLPAK FTKGVLAKYA KLVHSASTGA YCG