Gene Pars_2063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2063 
SymbolaksA 
ID5056297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1844745 
End bp1845914 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content55% 
IMG OID640469612 
Producttrans-homoaconitate synthase 
Protein accessionYP_001154261 
Protein GI145592259 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR02090] isopropylmalate/citramalate/homocitrate synthases 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.214289 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAGTTGT CCGCCTATGG CTTCGGGGCT CGTACAATAA GGATATTCGA CACAACTCTC 
AGAGACGGCG AGCAGATGCC CGGCGTCGCC TTGTCACCTT CTGAAAAGTT GCAAATCGCC
CTAGCTCTAG ACGAGGCAGG TGTAGACATG ATAGAGGCGG GTTTTGCCGC GGTCTCAAAA
GACGAACAGA TGGCCATTAG GCAGATCTCT AAAGAAGTGG CGACAGCCAA GGTGGTTAGC
CTTGCCCGCA TGGCGAAATC TGACGTCGAC GCGGCTCTCG ATGCCGATGT GGATATGATC
CACTTGTTCA TTGCCACGTC TGATATACAC CTGAAGTATA AGCTTGGCAT TACGAGGGAG
GAGGCCATTA GGCGGATAGA GGAGGTGGTC TCATACGCCA AATCGCACGG GGTCGACATA
TTGTTCAGTG CAGAAGACGC CACAAGGAGC GACCTAGATT TCCTGGTGGA GGCTTATAAG
ACAGCTATCA GCGCCGGCGC CGACGAGATC AACGTCCCAG ACACGGTGGG TGTGATGACC
CCTAGCCGGA TGGTCTATCT TATAGGCTAC CTAAAGCAGA GGCTCCCCCC GGTGCCTATG
CACGTCCACT GCCACGACGA CTTCGGCATG GCTGTGGCCA ATACAGTAAC CGCCATAGAA
AACGGCGCCG ACGTGGCGCA GGTGGTTGTT AACAACTTCG GCGAGAGGGC CGGCAACGCG
GCGCTGGAAG AGGTAGTAGC CGCAGTGCAC TACCTGCTTG GCTACAAGAC AAATATCAAG
TTGGAGAAAC TCTACGAGCT GTCGCAGTTA GTGTCTAAAC TATTCGGCAT CCCCGTGCCG
CCGAATAAGG CGGTGGTGGG GGAAAACGCC TTCAGCCACG AGGCCGGCAT CCACGTACAC
GGCGTTTTGA ACAACCCATT CACCTACGAG CCTATGAGGC CTGAAGACGT GGGCAATCGG
CGCAGGATAG TCCTCGGAAA GCATTCGGGG AGACACGCGG TGGTGTGGGC TTTGAAGAAC
ATAGGCGTCG AGCCCACAGA CGACTTGGTG GACTACGTCT TAAACGCCGT GAAGGAGCTG
GCTGTGAGGA AAGTAAAGGT GGACGAGTCT GTTCTTAGGC AAGTCGTAAA TGATTATAGG
AGGGGGGTAT TTGTACCCTA TGCCGTATAA
 
Protein sequence
MKLSAYGFGA RTIRIFDTTL RDGEQMPGVA LSPSEKLQIA LALDEAGVDM IEAGFAAVSK 
DEQMAIRQIS KEVATAKVVS LARMAKSDVD AALDADVDMI HLFIATSDIH LKYKLGITRE
EAIRRIEEVV SYAKSHGVDI LFSAEDATRS DLDFLVEAYK TAISAGADEI NVPDTVGVMT
PSRMVYLIGY LKQRLPPVPM HVHCHDDFGM AVANTVTAIE NGADVAQVVV NNFGERAGNA
ALEEVVAAVH YLLGYKTNIK LEKLYELSQL VSKLFGIPVP PNKAVVGENA FSHEAGIHVH
GVLNNPFTYE PMRPEDVGNR RRIVLGKHSG RHAVVWALKN IGVEPTDDLV DYVLNAVKEL
AVRKVKVDES VLRQVVNDYR RGVFVPYAV