Gene Arth_0489 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0489 
Symbol 
ID4447044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp519550 
End bp520734 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content69% 
IMG OID639688286 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_829988 
Protein GI116669055 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.201568 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCTGC GCGAACAGTT CGGTAAAGAT GTCCTGCTCA CCGGCTGGGG CCACAGCCGC 
TTCGGCAAAC TCACGGACGA CACCCTGGAG TCCCTGATCG TCCAGGTCGC CACGGAGGCG
ATCGGCAACG CCGGGATCGA CCCGGGCCAG ATCGATGAGA TCTACCTGGG CCAGTTCAAC
TCCGGCATGA TGCCGCTGGC GTTCCCGTCC TCGCTGGCCC TGCAGGTCTC GGAGCAGCTG
GCCAACGTCC CCTCCACCCG GGTGGAAAAC GCCTGCGCAT CCGGCTCGGC TGCGTTCCAG
CAGGGCACCA AGTCGCTGCT GGCCGGTACC GCGAAGACGG TCCTCGTGAT CGGCGCCGAA
AAGATGACCC ACGCAGGTGC GGACGTCGTC GGGGCGGCCC TGCTGGGTGC CGACTACGAC
ATGGCCGGCA AGGCCTCCAC CACAGGCTTC ACCGGCCTGT TCGCCGAGGT CGCCAAGCAC
TACGAGAAGC GCTACGGACC GGTGTCCGAT GTCCTGGGCA CCATCGCGGC AAAGAACCAC
CGCAACGGCG TCGACAACCC CTACGCCCAG CTCCGCAAGG ACCTCGGCGA GGAGTTCTGC
CGCACCGTTT CGGACAAGAA CCCGATGGTG GCGGACCCGC TGCGCCGCAC CGACTGCTCC
CCCGTGTCCG ACGGCGCCGC CGCGGTCGTG CTGTCTGTCT CGCCTACCGG CGGGGCCACC
GCCCCGGTAC GGCTCGCCGG CATCGGCCAC GCGAACGATT TCTTCCCGGC CGAAAGGCGG
GACCCCACCG CCTTCGCCGC AACCCGCGTC TCCTGGCAGC GCGCGCTGGG GATGGCCGGC
GTCGGGCTGG AGGACCTGGA CTTCGCCGAA GTGCATGACT GCTTCACCAT CGCCGAACTG
CTCATGTATG AGGCCATGGG ACTGACCGAA CCCGGCCAAG GTGCCCGCGC CGTCGAGGAA
GGCTGGGTCT TCAAGGACGG CAAGCTGCCC ATCAACGTGT CCGGCGGGCT CAAGGCCAAG
GGCCACCCCG TGGGTGCCAC CGGCGTCTCG CAGCACGTCA TCGCAGCCAT GCAGCTCACC
GGCACCGCGG GCGGCATGCA GCTCGCCAAC CCCCGCCGCG CCGCCGTGCA GAACATGGGC
GGGGTGGGCA TCGCCAACTA CGTGAGCGTC CTCGAGGCGG TCTAG
 
Protein sequence
MSLREQFGKD VLLTGWGHSR FGKLTDDTLE SLIVQVATEA IGNAGIDPGQ IDEIYLGQFN 
SGMMPLAFPS SLALQVSEQL ANVPSTRVEN ACASGSAAFQ QGTKSLLAGT AKTVLVIGAE
KMTHAGADVV GAALLGADYD MAGKASTTGF TGLFAEVAKH YEKRYGPVSD VLGTIAAKNH
RNGVDNPYAQ LRKDLGEEFC RTVSDKNPMV ADPLRRTDCS PVSDGAAAVV LSVSPTGGAT
APVRLAGIGH ANDFFPAERR DPTAFAATRV SWQRALGMAG VGLEDLDFAE VHDCFTIAEL
LMYEAMGLTE PGQGARAVEE GWVFKDGKLP INVSGGLKAK GHPVGATGVS QHVIAAMQLT
GTAGGMQLAN PRRAAVQNMG GVGIANYVSV LEAV