Gene Pars_0795 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0795 
Symbol 
ID5054710 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp707997 
End bp709106 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content58% 
IMG OID640468356 
Productacetyl-CoA C-acyltransferase 
Protein accessionYP_001153033 
Protein GI145591031 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGACG TATACGTAGT AGGCGGGGCC CTCCACCCCG CTGGGCGTCA TTACGACAAG 
AATATCGACG ATCTCGCTGC CGCCGTGTTA GACAAGGCGA TCGCAGACGC CCAGGCGGAT
ATAGAGGCCC TCTTTTTGGC CTCCTCCACC GCCGAACTCG GCAACAAGCA ACAACTACTC
GGCGTTTACG TGTTGGAGTC GCTTGGCTTG GATAAGATAC CGGTTTTTAG GATTGAAAAC
GGCGATGGAT CCGGCGGCGC CGCAGTGGTA GCGGCGTACC ACGCGTTAAG GGCCGGGGAG
TACAACTGCG TTGCCGTCGT GGGCGTGGAT AAGCCAAACG ACGTCTTGAG CAACCAGCAA
CAGGACATAT ACGCCACCAC GCTCGACACT CACTTCGAGC GGTACTTCGG CTTCACCCCA
CTCTCCTACG CTGCGCTTAT GGCGAAGATG TACTTAAAGA AGTACGAGTA CAAGTACGAG
GACTTGGCCA GGTGGGCTGT CCTAATGCAC GCTCACGGAG CTGGGAATCC CTACGCCTAT
TTCAGACGCC CCGTCAAGCT GGAAGACGCC GTGAACAGCG AAGTTGTCAG CGAGCCTCTC
CGCCTATACG ACGTAGGCCC CTTGGCCGAC GGGGCGGCGG CCGCCGTGTT GTGCAACAAC
AAAAAGAAAG ATGGGCCACG GATACTTTCA GTGACGACCT CGACAAATGC TGTGGGCTTC
AACGCGAGGA ACGAATACGA CGTCCTCTAC AGCCTCGAAG AAGCGGCGAG AAGCGCGCTG
AAAAAAGCCG GCGTTACGCC TAGGGACATC GCCGCGGCGG AGGTCCACGA CTCCTTCTCC
ATATTCGGCG CATTGGCGTT AGAGGGGCTT GGCATTGTGA AGAGGGGAGG CGCTCTGGCC
GCGTTGAGGG AAGGGGACTT GCCGGTGAAT CTCAGCGGCG GTTTTAAGGC CCGGGGGAAT
ATTCTAGGCG CCACCGGCGT GTACCAAGTG GTGGAGTTGG CGTGGCAACT CATGGGCCGG
GAGTTTAAAC GGGTTGAGGG CAACTACGGA GTTGTCCACA GCATGGGCGG CGTAGATAGG
GTTTCGACAG TTATTGTAGT AGGATTATGA
 
Protein sequence
MKDVYVVGGA LHPAGRHYDK NIDDLAAAVL DKAIADAQAD IEALFLASST AELGNKQQLL 
GVYVLESLGL DKIPVFRIEN GDGSGGAAVV AAYHALRAGE YNCVAVVGVD KPNDVLSNQQ
QDIYATTLDT HFERYFGFTP LSYAALMAKM YLKKYEYKYE DLARWAVLMH AHGAGNPYAY
FRRPVKLEDA VNSEVVSEPL RLYDVGPLAD GAAAAVLCNN KKKDGPRILS VTTSTNAVGF
NARNEYDVLY SLEEAARSAL KKAGVTPRDI AAAEVHDSFS IFGALALEGL GIVKRGGALA
ALREGDLPVN LSGGFKARGN ILGATGVYQV VELAWQLMGR EFKRVEGNYG VVHSMGGVDR
VSTVIVVGL