Gene Pars_1724 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1724 
Symbol 
ID5054723 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1552849 
End bp1554495 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content54% 
IMG OID640469267 
Productacetolactate synthase, large subunit, biosynthetic type 
Protein accessionYP_001153927 
Protein GI145591925 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID[TIGR00118] acetolactate synthase, large subunit, biosynthetic type 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.00248016 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTCTTT ACGATGCGCT GTACAACCAA GATATTCAGC CCATTATGTT TAGACACGAG 
CAAGGCGCCA TACACGCCGC GGAGGCCTTC GCAAGAGTCT CGGGAAGACC CGCCGTGGTG
GCCGTCACCA GCGGGCCTGG AGCCACAAAC CTAGTTACTG GCCTGGCTAA CGCGTACATG
GATTCGGCAC CTGTAGTGGC TATCACAGGC CAGGTGCCCA CTAGCGTTTT CGGCAGAGAT
GGCTTCCAGG AAACAGACAT ATTGGGCGTG GTTACTCCTA TTACTAAATT CGCATACCAA
GTCAAGAAGG CCGAAGAGGC AGTGGCTGCT TTTAAGACTG CATACGAGAT TTCGATAATC
GGAAGGCCGG GACCCACCTT AGTCGATATA CCGAGAGATA TACAGCTCGC GGCCGCTCCA
GATCGCGAGG AAAAAATTCC AGTAAACAGG GAGAAGTTTA TACCTCCGCC ACCCGACGAG
GATAAACTTA GGCTGGCCGC CAAGTATTTA ATAGAGGCTA GAAGACCGGT GGTATTAGTC
GGCGGCGGCG TGTTGTGGTC AGGCGCGACA CCGGAGGTCT TGGAGTTATC CCGCATGCTG
TACGCCCCTA TCGTGTCGAC GTTGCCGGGC AAGGCGGCGG TGCCGCATGA CTATCCTCTC
TACATGGGCC CCGCCGGGAT GCACGGAAGA GCTGAGGCAG ATGCCGCGTT GGCAAACGCA
GATGTCATCT TGGCAGTCGG CACGAGATTC AGCGACAGAA CATGGGGCAG GTTTAAAGAA
CTCCAAGAAA GCGTGAGGAG TGGGAGAGTA AAGCTGATAC ACATAGACAT AGACAAGAGC
GAGATTGGGA AAAACGTGAA GCCCACAATA GGCGTCGTCG CCGATGCCAA AGAGGCGCTT
GGGACCCTGC TCAAATTTGT AGACGCCGGC GCCCAGAGAG ATGAAAAATT CATGTCATGG
TTACTGGAGA TACGGAAGAA GTATGAGGAG GCAATGTCTA AAGTTGCCGA CACGTCTAAG
GCCTTTCATC CCTGGCGTGT GCTTAAAGTA CTCAGAAGGG CCGCGCCGAG GAACACCATC
ACGACGACGG GCGTTGGGAG CCACCAGATG TGGGCGGAGG TGGCATGGGA GGTGTTCGAG
CCTGGCACTT TCATCACCTC TGCAGGCCTC GGTACAATGG GCTTCTGCGT GCCGGCGGCA
CTGGGGGCAA AACTGGCCGA CCCGACAAGG CCGGTTCTGT GCATTGACGG GGATGGGTCG
TTCCAGATGA CCATGAACAA CTTAGCCTTA GTGAGGGAGT ACGACCTACC CATAGTTGTG
ACAATCTTCG ACAACAGAGC CCTGCAACTG GTAAAACAAT GGCAGATATA CCTCTACAAG
CGGAGGATTA TAGCCACGGA GTTTGGCAAA ATGCCCGACT TCATGAAAAT AGCTGAGGCA
TACGACATAG AGGGGGTAAA GCCGGAGAGC TACGACCAGT TAGAAAAGGC TGTGGCCAAG
GCTTTGAGAA ACAACGAGGC GCTGATAGTA GACTTGACGA TTGACAGCGA AGAGGACATA
GTGCTACCCT GGGTCAAGCC AGGCGACTGG CTCACATCGG CGTTACTCCC AGAGGGCGTG
AATACGAAGT TGGTATATGA GAATTAA
 
Protein sequence
MALYDALYNQ DIQPIMFRHE QGAIHAAEAF ARVSGRPAVV AVTSGPGATN LVTGLANAYM 
DSAPVVAITG QVPTSVFGRD GFQETDILGV VTPITKFAYQ VKKAEEAVAA FKTAYEISII
GRPGPTLVDI PRDIQLAAAP DREEKIPVNR EKFIPPPPDE DKLRLAAKYL IEARRPVVLV
GGGVLWSGAT PEVLELSRML YAPIVSTLPG KAAVPHDYPL YMGPAGMHGR AEADAALANA
DVILAVGTRF SDRTWGRFKE LQESVRSGRV KLIHIDIDKS EIGKNVKPTI GVVADAKEAL
GTLLKFVDAG AQRDEKFMSW LLEIRKKYEE AMSKVADTSK AFHPWRVLKV LRRAAPRNTI
TTTGVGSHQM WAEVAWEVFE PGTFITSAGL GTMGFCVPAA LGAKLADPTR PVLCIDGDGS
FQMTMNNLAL VREYDLPIVV TIFDNRALQL VKQWQIYLYK RRIIATEFGK MPDFMKIAEA
YDIEGVKPES YDQLEKAVAK ALRNNEALIV DLTIDSEEDI VLPWVKPGDW LTSALLPEGV
NTKLVYEN