Gene Arth_2986 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2986 
Symbol 
ID4444508 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3354284 
End bp3355483 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content69% 
IMG OID639690809 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_832465 
Protein GI116671532 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAACT CCCCAGACAA CAATGATGTT GTCATCCTCG CTGCGGCCCG CACGCCGCAG 
GGGCGCCTGA ACGGCCAGCT AGCCGGCTTC ACGGCGGTGG AGCTCGGGGC GCACGCCATC
AAGGCGGCCC TGGCTGCGAG CGGCGTTGCC GCGGAGCAGG TGGATGCGGT CATCATGGGC
CAGGTCCTGC AGGCGGGAGC GGGCCAGAAC CCCGCGCGGC AGAGCGCCAT CGGCGCCGGC
ATCGGCTGGA ACGTCCCCAC GGTCACTATC AACAAAGTGT GCCTTTCCGG CCTCACGGCC
GTGATCGACG CCGCCCGCAT GATCCGCAGC GGTGACGCCG CCGTCGTCGT CGCCGGCGGT
CAGGAATCCA TGTCCCGGGC GCCGCACATC CTGCCGGGTT CCCGGCAGGG TTGGACCTAC
GGGACTGTCC AGGCGCTGGA CGTGGCCGCG CATGACGGCC TGACCGACGC CTTCGACGGA
CAATCCATGG GGCTGTCCAC GGAAAGCAAG AACCTGGTTC TGGGCATCGA CCGGACCTCG
CAGGACAACG TGGCAGCCCA GTCCCACCAG CGCGCCGCCC TGGCCGCGAA GAACGGAGTT
TTCGACGACG AAATCGCCCC GATCAGCGTC AAACAGCGGA GGGGGGACCC GGTGGTGGTG
GCCACCGACG AAGGCGTGCG CCCGAACACG TCGGTCGAGT CGCTGGCCGG TCTCCGTGCC
GCGTTCGTCA GCGACGGCAC CATCACGGCA GGCAACTCCT CTCCCCTGTC CGACGGCGCT
GCCGCCCTGG TATTGACCAC CCGGAAGTTC GCGGAAGACA ACGGCCTGGA CTACCTCGCA
GTTGTGGGCA AGCCGGGCCA GGTTGCGGGC CCGGACAATT CGCTGCACTC GCAGCCGTCC
AATGCAATCA AGAGCGCCTT GGACCGTGCC GGATGGACCA CCGCGGACCT CGACTTCATT
GAGATCAACG AGGCCTTCGG TTCCGTTGCC GTCCAGTCGC TCAAGGACCT CCAGTACCCG
CTGGAGAAGT GCAACATCCA TGGCGGCGCC ATCGCGCTCG GGCACCCCAT CGGGGCCTCA
GGCGCCCGCC TGGCCGGACA TGCCGCGCAC GAGCTGAAAC GCCGCGGCTC CGGCAAGGCC
GCTGTATCCC TGTGCGGCGG CGGCGGGCAG GGCGAAGCCC TCCTCCTCTA CCGGGACTGA
 
Protein sequence
MSNSPDNNDV VILAAARTPQ GRLNGQLAGF TAVELGAHAI KAALAASGVA AEQVDAVIMG 
QVLQAGAGQN PARQSAIGAG IGWNVPTVTI NKVCLSGLTA VIDAARMIRS GDAAVVVAGG
QESMSRAPHI LPGSRQGWTY GTVQALDVAA HDGLTDAFDG QSMGLSTESK NLVLGIDRTS
QDNVAAQSHQ RAALAAKNGV FDDEIAPISV KQRRGDPVVV ATDEGVRPNT SVESLAGLRA
AFVSDGTITA GNSSPLSDGA AALVLTTRKF AEDNGLDYLA VVGKPGQVAG PDNSLHSQPS
NAIKSALDRA GWTTADLDFI EINEAFGSVA VQSLKDLQYP LEKCNIHGGA IALGHPIGAS
GARLAGHAAH ELKRRGSGKA AVSLCGGGGQ GEALLLYRD