Gene Pars_2097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2097 
Symbol 
ID5054218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1872651 
End bp1874141 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content62% 
IMG OID640469647 
Productputative alpha-isopropylmalate/homocitrate synthase family transferase 
Protein accessionYP_001154295 
Protein GI145592293 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR00977] 2-isopropylmalate synthase/homocitrate synthase family protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.588766 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGACG AGCTCGGCGT AGACTATATA GAGGGCGGAT GGCCCTACTC CAACCCCAAA 
GACCTGGACT TCTTCAAGGC GATGAGGGAA TACCCACTCG CAAAGGCCAA GCTAGCCGCT
TTTGGAAGCA CGAGGAGAAA GGGGGTGAAG CCTGAGAAAG ACGAAAACCT AAACGCGATA
GTAAAGGCGG ATGTCCCCGT TGCGGTTATC TTCGGCAAGA GCTGGACTCT CCACGTGGAG
AAGGTGCTGG AAGCCACCTG GGAGGAGAAC TTGGCTATGA TAGCGGAGAG CGTGGAGTAC
CTAAAATCCC ACGGCATGGA GGTGATCTAC GATGCCGAGC ACTTTTTCCA GGGGTATCAG
GAGGACCCGG AGCGGGCGCT GGCCTCTATA GAGGCTGCCT GGAGGGCGGG GGCTAGGGTT
GTGGTGCTGG CCGACACCAA CGGCGGGACT CCTACGCACG AGGTGTATAG AATAACAGCA
GAGGTGAAGA GGAGGTTCCC CGCGATGCCG CTGGGAGCCC ACATGCACAA CGACATCGGT
TGCGCCGTGG CTAACACCCT AATGGCAGTG GCCGCCGGGG CTAGGCACGT CCAGGGAACA
ATAAACGGAG TGGGTGAGCG GACGGGCAAT GCGGACCTGA CCGCGGTTTT GCCGACGCTG
GAGCTGAAGA TGGGCTTCAA GGTCCTGGGC GGCTCCCCGC CCCGGGTTAA GTTCGCCAAG
CTGAGGGAGG TGTCACGCTT CGTCTACGAG GCCTTGGGGA TGAGCCCAAA CCCATATCAG
CCCTACATCG GCGACTACGC CTTTGCCCAC AAGGGAGGGG TACACGCCGC GGCTGTGATG
AAGGTGCCCA GGGCATACGA GCACATAGAC CCCGAGCTGG TGGGCAACAG GAGGGTCTTC
GTCGTGTCGG AGATGGCCGG CGCCGCCAGC GTGGTGCTGA AGGCGGCGGA GGAGCTGGGG
ATATCGCTAG ACAAGCGCCA GGAGGCTGTG AGGGCGGCGC TGGAGGAGAT AAAGGCGCTG
GAGAGGCAGG GCTACTCCTT TGACTCGGCC CCGGCCTCCG CCATGCTGAT ACTGCTTAGG
CACATGGGGC TCTACCAGGA GAGGTTTAGG CTAGTGGAGT GGCGCGTGGT CACCGGCCCC
ACCAACACGT CCTACGCCGT GGTGAAGGTA TGGGTAAGCG GCGAGGTAAA GCTGGAGGCC
GGCGAGGGCG TCGGCCCCGT ACACGCCGTC GACGTTGCGC TGAGGCGCGC GCTGGTGTCA
GCCTTCCCGG AGCTGGCGGA GGTTAGGCTG AGGGACTACA AGGTGGTGCT CCCCACTGCG
GTAAGGAGCA CGGAGAGCGT GGTGAGGGTC ACCGTTGAGT TTACCGACGG CGGGAGGATA
TGGCGCACAG TCGGCGTATC CAGCAACGTC GTCGAGGCGT CGATCAAGGC GCTGGTCGAC
GGCTACGACT TCGCCCTACA GCAGAGGCAG TTGCAAAACC GCAAGGCCTA G
 
Protein sequence
MLDELGVDYI EGGWPYSNPK DLDFFKAMRE YPLAKAKLAA FGSTRRKGVK PEKDENLNAI 
VKADVPVAVI FGKSWTLHVE KVLEATWEEN LAMIAESVEY LKSHGMEVIY DAEHFFQGYQ
EDPERALASI EAAWRAGARV VVLADTNGGT PTHEVYRITA EVKRRFPAMP LGAHMHNDIG
CAVANTLMAV AAGARHVQGT INGVGERTGN ADLTAVLPTL ELKMGFKVLG GSPPRVKFAK
LREVSRFVYE ALGMSPNPYQ PYIGDYAFAH KGGVHAAAVM KVPRAYEHID PELVGNRRVF
VVSEMAGAAS VVLKAAEELG ISLDKRQEAV RAALEEIKAL ERQGYSFDSA PASAMLILLR
HMGLYQERFR LVEWRVVTGP TNTSYAVVKV WVSGEVKLEA GEGVGPVHAV DVALRRALVS
AFPELAEVRL RDYKVVLPTA VRSTESVVRV TVEFTDGGRI WRTVGVSSNV VEASIKALVD
GYDFALQQRQ LQNRKA