Gene Pars_2059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2059 
Symbol 
ID5054760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1840451 
End bp1841920 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content57% 
IMG OID640469608 
Product2-isopropylmalate synthase 
Protein accessionYP_001154257 
Protein GI145592255 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR02146] homocitrate synthase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.0773223 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTATATA GCTGGCTAAG CTTTTTAAAT AGTTGTTATC TGTGGTGGCA TGTCCCAGGG 
GAAAGCCGCG TCTCAATACT AGACTCAACG CTTAGAGAAG GAGAGCAGAC TCCAGGGGTT
GTATTCTCGG AGGAGTGGCG TGTTAGAATT GCCAAGGCCC TCTCGGATAT AGGCGTTGGG
ATGATAGAAG TGGGCGATCC CAGCGTGGCC CCCGACATAA AGTCGGCGAT AAAGAAGATC
GTTAAGTTGA AGAAGGACGG CGAGATTAGA AGCCATATCG TCGTCCACAG CAGGGCGGTG
AAGCAAGATC TGGAAAACGC CGCGTCTCTG GAGCCGGATA GGGTCGCGGT GTTCTACGGG
GTGAGCGACT TGCACTTGAA GCACAAGCAC AGGAAGACGA GGGAGGAGGC GCTGTCCATA
ATTGCAGAGA TGGTCTCCTT TGCCAAGTCA CACGGCGTTG CTGTTAGGTT CACCGCAGAG
GACGCATCGA GGGCCGATCT CGGCTACTTG ATAGAGGTCG TAAAGACCGC ATATGAGGCC
GGCGCGGATC GGGTCAGCAT AGCGGACACG GTAGGGGTCT TCACGCCGGA CAGAGCCAGG
GAGGTGTTCG CCAAGGTGAA GGCCGCCGTG CCGGGGGTTG GGCTGGACAT ACACGCACAC
AACGACTTCG GGATGGCCGT GGCCAACAGC TTGGCGGCGG TTGAGGGCGG GGCAGACGTA
GTCCACACCA CGGTGAACGG GCTTGGGGAG AGGGCAGGCA TCACGCCCCT GCAAGTCTTC
GCCGCGGCTT ACTACTACCA CAAGGGGGTT AAGCTAGTGG AGCTGGAAAA GCTACCCGAG
ATAACCGCCA TGGTAGAGGC GGCCAGCGGG ATAACGCTGA TGCCGACATA CCCTATTGTT
GGAGAAAACG CCTTTACGCA CAAGGCCGGG GTACACCAGG CAGGCGTCTT GGCCAACCCC
GAGACGTACG AGCCGATCCC GCCAGAGGTG GTGGGGAGGA CGCGCGACTT CTCTCTGGAC
AAGTACAGCG GGAGGAAGGC TATCCAGCAC CGCTTGGAGA AGCTAGGCGT CTCGCTCCAG
CCTGAGGCCT TGGACAAGGT GGTGGAGGAG GTTAAGAGGA TGAACGCGCC TAGGCTGAGG
GACGAGGATT TGTTAGAGAT AGTGGAAAAG GTGTCCGGCG TCAGGTACAG GGCTATCGTC
AACAGGCACA TAGAGGCCTA CATCTGGCTT AAGGTGGCGA ACAACGTGTA CACCACCTCA
GTGGCGAGGA GGGTCGCCGC CTTGAAAAAC GTCGTGTCGG TCTCCGAAAT AACTGGCGAG
TACGACATCG TCGTTAAACT TGTGGCGGAG AACACCGAGG AGCTGAACCA GGCGATTGAA
AGCATAAGGC AGATCAAAGG CGTCGCCTCG ACCTTCACCA GTATTGTGCT GAAGGAGTTG
CCGACGATCA GCATGCAGAC CAGGGCTTAA
 
Protein sequence
MLYSWLSFLN SCYLWWHVPG ESRVSILDST LREGEQTPGV VFSEEWRVRI AKALSDIGVG 
MIEVGDPSVA PDIKSAIKKI VKLKKDGEIR SHIVVHSRAV KQDLENAASL EPDRVAVFYG
VSDLHLKHKH RKTREEALSI IAEMVSFAKS HGVAVRFTAE DASRADLGYL IEVVKTAYEA
GADRVSIADT VGVFTPDRAR EVFAKVKAAV PGVGLDIHAH NDFGMAVANS LAAVEGGADV
VHTTVNGLGE RAGITPLQVF AAAYYYHKGV KLVELEKLPE ITAMVEAASG ITLMPTYPIV
GENAFTHKAG VHQAGVLANP ETYEPIPPEV VGRTRDFSLD KYSGRKAIQH RLEKLGVSLQ
PEALDKVVEE VKRMNAPRLR DEDLLEIVEK VSGVRYRAIV NRHIEAYIWL KVANNVYTTS
VARRVAALKN VVSVSEITGE YDIVVKLVAE NTEELNQAIE SIRQIKGVAS TFTSIVLKEL
PTISMQTRA