Gene Arth_2539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2539 
Symbol 
ID4444947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2846349 
End bp2848250 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content65% 
IMG OID639690356 
Productacetolactate synthase 1 catalytic subunit 
Protein accessionYP_832018 
Protein GI116671085 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID[TIGR00118] acetolactate synthase, large subunit, biosynthetic type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0377208 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAAG GATCGCCGAT CAGCCCCTCG CTGATGGCTA CAAAGTCCGC TGGAGCCCCC 
AAGGCTCCGG AACGCGCCGA CCGCACGGCC GACGCCGGCG TCGAGCACGC TGCTGCTGTC
TCTCCTGTCC TTGGACCGAA CAACGTCGTA CCCCCGACGG TGATGACCGG TTCGCAAGCA
ATTGTCCGCT CGCTCGAAGA ACTCGGCGTG GACGATATTT TTGGTTTGCC CGGTGGCGCG
ATCCTGCCCA CCTATGACCC CTTGATGGCC TCAAGCATGA ATCACGTGCT GGTCCGTCAC
GAACAGGGAG CCGGCCACGC CGCGCAAGGC TACGCCATGG TTACCGGGCG GGTTGGCGTT
TGTATTGCCA CCTCGGGTCC CGGGGCCACC AACCTCGTTA CCGCCATCAT GGATGCGCAC
ATGGACTCCG TGCCGCTCGT GGCCATCACC GGCCAGGTGT CCAGCGGAGT CATTGGTACG
GACGCTTTCC AGGAAGCGGA CATCGTGGGC ATCACCATGC CGATCACCAA GCACTCCTTC
CTGGTGACCG ACCCCAACGA CATACCGCAT GTCATGGCCG AGGCGTTCCA CCTTGCTTCC
ACGGGCCGGC CCGGCCCGGT GCTGGTGGAT GTCGCCAAGG ATGCCCAGCA AGGCCAGATG
ACCTTCTCCT GGCCGCCGAA GATCGACCTC CCCGGATACC GCCCCGTGCT CCGCGGCCAC
AACAAGCAGG TTCGCGAAGC AGCGAAGCTG ATTGCCGCGG CCAGCAAGCC CGTCCTGTAC
GTTGGCGGCG GCGTTGTGAA GGCGCACGCT TCCGCGGAGC TGCGGGAACT GGCCGAGATC
ACCGGCGCGC CCGTGGTCAC CACCCTGATG GCGCGGGGCG TGTTCCCTGA CTCGCACCCG
CAGCACGTCG GCATGCCCGG CATGCACGGC ACCGTCTCCG CCGTGACCGC GCTGCAGCAG
TCCGACCTGC TGATCACGCT CGGCGCCCGT TTCGACGACC GCGTAACCGG TGTCCTGAAG
ACGTTCGCCC CGAACGCGAA GGTGATCCAC GCGGACATCG ATCCTGCCGA GATCTCCAAG
AACCGCACAG CGGACGTTCC GATCGTGGGC TCCGTCAAGG AGATCATTCC GGAACTCACC
GAGGCCGTGA AGACGCAGTT TGCGGCGTCC GGCAAGCCGG ACTTGGAGAA CTGGTGGACG
TTCCTTAACA ATCTGAAGGA AACGTATCCG CTGGGATGGA CCGAGCCGGA GGACGGCCTC
ACCGCACCGC AGCGCGTCAT TGAGCGCATC GGTGCCCTGA CCGGCCCCGA AGGGATCTAC
GTTGCGGGCG TTGGCCAGCA CCAGATGTGG GCGGCGCAGT TCATCAAGTA CGAACGCCCC
CACGCCTGGC TGAACTCGGG CGGAGCCGGC ACCATGGGCT ACGCCGTCCC CGCAGCCATG
GGCGCCAAAG TGGGCGCGCC CGACCGCGTG GTCTGGGCCA TCGACGGCGA CGGCTGCTTC
CAGATGACCA ATCAGGAACT GGCCACCTGC GCCATCAACA AGATCCCCAT CAAGGTTGCC
GTCATCAACA ACTCCTCGCT GGGCATGGTG CGCCAGTGGC AGACCCTCTT CTACGAAGGC
CGCTACTCGA ACACCGACCT GAACACCGGC CACCAAACCG TCCGGATCCC GGACTTCGTG
AAACTGGGCG AGGCCTACGG CTGCGCATCC TTCCGGTGCG AACGCGCTGA GGACATCGAC
GCCACCATCC AGAAAGCCCT TGAAATCAAT GACCGCCCCG TCGTGATCGA CTTCGTGGTG
AGCCCAGACT CCATGGTGTG GCCGATGGTG CCCGCCGGAG TGAGCAACGA CCAGATCCAG
GTCGCCCGCA ACATGACCCC GGAATGGGAA GAGGAGGACT GA
 
Protein sequence
MSKGSPISPS LMATKSAGAP KAPERADRTA DAGVEHAAAV SPVLGPNNVV PPTVMTGSQA 
IVRSLEELGV DDIFGLPGGA ILPTYDPLMA SSMNHVLVRH EQGAGHAAQG YAMVTGRVGV
CIATSGPGAT NLVTAIMDAH MDSVPLVAIT GQVSSGVIGT DAFQEADIVG ITMPITKHSF
LVTDPNDIPH VMAEAFHLAS TGRPGPVLVD VAKDAQQGQM TFSWPPKIDL PGYRPVLRGH
NKQVREAAKL IAAASKPVLY VGGGVVKAHA SAELRELAEI TGAPVVTTLM ARGVFPDSHP
QHVGMPGMHG TVSAVTALQQ SDLLITLGAR FDDRVTGVLK TFAPNAKVIH ADIDPAEISK
NRTADVPIVG SVKEIIPELT EAVKTQFAAS GKPDLENWWT FLNNLKETYP LGWTEPEDGL
TAPQRVIERI GALTGPEGIY VAGVGQHQMW AAQFIKYERP HAWLNSGGAG TMGYAVPAAM
GAKVGAPDRV VWAIDGDGCF QMTNQELATC AINKIPIKVA VINNSSLGMV RQWQTLFYEG
RYSNTDLNTG HQTVRIPDFV KLGEAYGCAS FRCERAEDID ATIQKALEIN DRPVVIDFVV
SPDSMVWPMV PAGVSNDQIQ VARNMTPEWE EED