Gene Pars_0457 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0457 
Symbol 
ID5055311 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp402546 
End bp403721 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content58% 
IMG OID640468022 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_001152707 
Protein GI145590705 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAAAG TCGCAGTTGT AGGTGTCGGG ACTAGTAAAT TTGGAAATAG GACAGACGTC 
TCCCTCCCAG AGCTTGCGTG GGAGTCAGTA AAGGAGGCGC TCGACGACGC GCGCCTCGGC
ACTGAGGATA TCGAGGCTTT TGTTGTGGGT AATGTGGGCG GGTGGTCTTC GGAAATGTTG
CCGGCGGTGG TGGTAGGCGA GTACTGCGGC CTCGTCCCGA AAAGCGGCGT GAGGGTGGAG
GCGGCTTGCG CCACCGGCTC CGCAGCGGTT AGAACAGCAT ACCACATGAT CGCCAGCGGG
GAGGCGGACA TAGTGATGGC AATTGGGGTG GAGAAAATGA ACGAGTCGCC CACCCCCACA
GTGGTGGAGT TCATAGGCAG AGCGGGGAAC TACTTCTGGG AATTCGAGAA CTTCGGGCTC
ACCTTCCCCG GCTACTACGC CCTATATGCC ACCGCCTACA TGCACAGATA CGGCGCCACC
GAGGAGGATC TGTGCAAAGT AGCCGTCAAG AACCACTACT ACGGCTCTCT CAACCCCAAG
GCGCAGTTCC AAAAGGCCAT TACCGTGGAG GAGTGTCTAA ACTCGCGCTA CGTCGCCTGG
CCGCTTAAGC TCTACGACAG TAGCCCAATT ACAGACGGCT CATCCGCCGT GATCCTGGCA
AGCGAAGAAG CGGCGAAGAA GATTACGGAC ACGCCGGTGT GGATAAAAGC CATAGGATAC
GCCAACGGCA CGGCAAATCT GAGCAAGAGG CTTGACTTCA TAGGCCTAGA GGCCGCCCAA
ATCGCAGCCC AGATGGCCTA CAAGAAGGCC GGCATAGACC CACAAGAGCC TGTTAAGTAC
CTAGACGTGG CCGAGGTACA CGACTGCTTC ACCATCGCAG AGATAATGGC CTACGAAGAC
TTGGGCTTTG CGAAGAGGGG CGAGGGCTAC AAGCTGGTGA GGGAGGGCCA GACCTACATC
GGTGGCCTAA TACCCGTAAA CGTCGACGGC GGTCTAAAGG CCAAGGGACA CCCAATAGGC
GCAACCGGCG TTTCGATGAT CGCGGAGCTT ACGAGGCAAC TGAGGCAACA AGTAGAGAAG
TCGAGGCAGG CGCCGATAAG GAAGGGGATG GCGTTGGCGC ACAACATAGG AGGCACAGGC
CACTACGCCT TTGTGACAAT CCTCAGCCTG AGCTGA
 
Protein sequence
MRKVAVVGVG TSKFGNRTDV SLPELAWESV KEALDDARLG TEDIEAFVVG NVGGWSSEML 
PAVVVGEYCG LVPKSGVRVE AACATGSAAV RTAYHMIASG EADIVMAIGV EKMNESPTPT
VVEFIGRAGN YFWEFENFGL TFPGYYALYA TAYMHRYGAT EEDLCKVAVK NHYYGSLNPK
AQFQKAITVE ECLNSRYVAW PLKLYDSSPI TDGSSAVILA SEEAAKKITD TPVWIKAIGY
ANGTANLSKR LDFIGLEAAQ IAAQMAYKKA GIDPQEPVKY LDVAEVHDCF TIAEIMAYED
LGFAKRGEGY KLVREGQTYI GGLIPVNVDG GLKAKGHPIG ATGVSMIAEL TRQLRQQVEK
SRQAPIRKGM ALAHNIGGTG HYAFVTILSL S