Gene Achl_0143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_0143 
Symbol 
ID7291569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp156438 
End bp158198 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content65% 
IMG OID643588542 
ProductDihydroxy-acid dehydratase 
Protein accessionYP_002486235 
Protein GI220910926 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones100 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGACC CGAATTACAT GGATCTCCGC AGCGCACGCT GGTTCGCGCC GCACGATCTC 
ACCGGTTTCG TGCACCGTAC CGCCATCCAG GCCGAAGGTT TCTCGCGCTT CGCCATCAAG
GACCGGCCGG TGATCGGCAT CGCCAACTCC TGGTCGGAGC TGGTCAACTG CAACATCCAT
TTCAAGCTGC TTGCCGAGGC TGTGAAGCGC GGCGTCCTAA TGGCCGGGGG CCTACCCTTG
GAGTTCCCCA CCATCTCCCT GGGGGAGAGC CTGATGAAGC CCTCAGCCAT GCAGTTCCGC
AACCTCATGG CCATGGACGT GGAGGAATCC ATCCGCGCGT ATCCGCTGGA TGCGATCGTG
CTGCTGGGCG GCTGCGACAA GACCGTTCCT GCCCAGCTCA TGGGCGCCGC CAGCGCCGAT
ATTCCCACCA TCATGCTCAC CGGCGGCCCC CAGGAGCCGG CCCACTTCCG GGGCAAGCAG
CTTGGCGTCG GAACGGACAC CTGGAAGTAC GCAGACGAGC TGCGGGCCGG TAAGATCACC
GAGGCCGACT TTGACGAGCT CGAATCCGCG GCCAAGCCTT CCGCTGGCCA CTGCAGCGAA
ATGGGCACGG CGTCCACCAT GACGTCCCTC GTTGAGGCCT TGGGCATGTG TCTGCCCGGC
AGCGCTTCCA TTCCGGCCGT CGATTCACGC CGCGGCCAGG CAGCAGAGGC CACGGGACGC
CGGGCCGTGG AAATGGCATT GTCCCAGGGG CCGAAGCCCA GCGAAATTCT GACCAAGGAA
GCGTTCGATA ACGCCATTAC GCTCCTCATG GCCGTGGGCG GATCCACCAA CGCCGTGGTC
CACCTCCTGG CGTTGGCGCG AAGGGTGGGC TACGAACTGC AGCTTGACCG CTTCCACGAA
ATTTCGCAGC GGACCCCGCG CATCGTAAAC GTCCGTCCTT CCGGCGAGTA CCTCGTGAAG
CAACTCTTCG AGGTTGGCGG CATTCCCACC GTGCTCAAGG CCCTTGACCC CCTGCTGAAC
CGGGACGCCA TAACCGTCAC CGGCGAGTCC CTCGAGAAGG GCTACATCCA CGCGCCCGAG
GCGGATGGAG TCGTCGTGAG CTCGCTTGAG GCGCCCTTCG ACGCCTCCGG TGGCATCGCC
GTCGTCCGTG GTTCCCTGGC TCCGAACGGT GCGGTGATTA AGCGCAGTGC AGCTTCTAAG
GACCTGCTGC AGCACAAGGG CTCGGCCATT GTCTTCGACG ACATCTACGA TCTCGGACGG
CGGATCGACG ATCCGGACCT GGACATCACC GAGGATTCGG TCCTGGTGCT CCGTAACAGC
GGGCCCGTCG GCGCGCCCGG CATGCCCGAG TGGGGCATGC TGCCAATCCC GCAGAAGCTG
CTGCGCAGGG GCATCCGGGA CATTGTGCGC ATCTCCGATG CCCGCATGAG CGGCACCGCA
TTCGGCACGA CCGTGCTCCA TGTCTCGCCC GAGGCTGCGG TAGGTGGTCC GCTGGCGATC
GTCCGTGACG GCGATCCGAT AGTGCTGGAT GTCGAGAACC AGCGGCTGGA CCTTGATCTC
CCCGAGGAAG AGATCGAGGC CAGGCTTGCG GAGTTGAAGC TGCCCGAGCC CAAGTACCGC
CGCGGCTATG GACGCCTGTT CCTCGACCAC GTCAACCAAG CGCACGAAGG CTGCGACTTC
GACTTCCTCA AAGGCCTGCC GGATGAGGAG CCCCAGCGGC TGCCCTACGG CCTGATGAGC
GGCTGGCAAG GCGGCTGGTA G
 
Protein sequence
MPDPNYMDLR SARWFAPHDL TGFVHRTAIQ AEGFSRFAIK DRPVIGIANS WSELVNCNIH 
FKLLAEAVKR GVLMAGGLPL EFPTISLGES LMKPSAMQFR NLMAMDVEES IRAYPLDAIV
LLGGCDKTVP AQLMGAASAD IPTIMLTGGP QEPAHFRGKQ LGVGTDTWKY ADELRAGKIT
EADFDELESA AKPSAGHCSE MGTASTMTSL VEALGMCLPG SASIPAVDSR RGQAAEATGR
RAVEMALSQG PKPSEILTKE AFDNAITLLM AVGGSTNAVV HLLALARRVG YELQLDRFHE
ISQRTPRIVN VRPSGEYLVK QLFEVGGIPT VLKALDPLLN RDAITVTGES LEKGYIHAPE
ADGVVVSSLE APFDASGGIA VVRGSLAPNG AVIKRSAASK DLLQHKGSAI VFDDIYDLGR
RIDDPDLDIT EDSVLVLRNS GPVGAPGMPE WGMLPIPQKL LRRGIRDIVR ISDARMSGTA
FGTTVLHVSP EAAVGGPLAI VRDGDPIVLD VENQRLDLDL PEEEIEARLA ELKLPEPKYR
RGYGRLFLDH VNQAHEGCDF DFLKGLPDEE PQRLPYGLMS GWQGGW