Gene Arth_2144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2144 
Symbol 
ID4445221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2416918 
End bp2418288 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content66% 
IMG OID639689952 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_831624 
Protein GI116670691 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0404851 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCTTTA ACGGACAGTC CGCCACCGGA CCAGATGAAT CAGCCGCAGC CCCCGCAGCC 
ACACCCGGCG CAGGCCTGCT GCGCAAGGCT GTGGTCGTGG GCGGAAACCG GATTCCGTTT
GCCCGGACCG GCGGCGCGTA CACCAAGTCC TCGAACCAGG ACATGCTGAC CGCAGCCCTG
GACGGCCTGA TTGCGCGTTT CGGCCTCGCT GATGAACGGA TCGGCGAGGT TGCCGCAGGC
GCTGTGCTCA AACACTCCCG CGACTTTAAC CTCACCCGCG AGGCGGTGCT GGGCTCGGCA
CTCTCCGCGG AGACTCCTGC CTATGATCTC CAGCAGGCGT GTGCCACGGG ACTCGAGACG
GTCCTCGGCC TTGCCAACAA GATCAAGCTC GGACAAATCG ATTCGGCCAT TGCCGGTGGA
GTCGACTCCG CGTCCGATGC GCCCATCGCC GTGAGTGAGG GCCTGCGCGA GGTCCTGCTG
GACCTCAACC GCGCCAAGAC CCTGCCCCAG CGCCTGAAGG TCCTGGGCCG GCTTCGCCCC
AAGGACCTGG CGCCCGACGC ACCCAATACG GGGGAGCCCC GGACGGGGCT TTCGATGGGC
GAACACCAGG CCCTCACCAC CGCGCAGTGG AAGATTACCC GCGAGGCCCA GGATGAGCTC
GCATATAACA GCCACCGCAA CCTCGCGGCC GCTTACGATG CCGGCTTCTT CGACGATCTG
CTCACGCCGT ACCGCGGACT GAACCGGGAC TCGAACCTCC GCGCCGACAC AACACGTGAA
AAACTGTCCA CGCTCAAGCC CGTTTTCGGT AAGAACCTGG GCGCCGAAGC AACCATGACG
GCCGGCAACT CCACCCCGCT CACTGACGGC GCGTCCACCG TGCTGCTCGC CTCGGAGGAG
TGGGCGGACG CCCACGAACT TCCCAAGCTC GCGACGGTTG TTGACGGCGA GGCGGCCGCC
GTCGACTTCG TCCATGGCAA GGACGGACTG CTGATGGCGC CGGCCTTCGC GGTGCCCCGC
CTGCTGGCCC GCAACGGTCT GACGCTGGAT GACATCGATT TCTTCGAGAT CCATGAAGCA
TTTGCCGGCA CAGTCCTCAG CACGCTGGCA GCGTGGGAAG ACGAAGAATT CGGCCGCACC
CGCCTCGGCC TGGACGGTCC GCTGGGCAGC ATCGACCGGG CCAAGCTGAA CGTGAACGGT
TCCTCGCTGG CAGCCGGGCA CCCCTTCGCC GCGACCGGAG GGCGGATCGT GGCCACGCTG
GCCAAGATGC TTCACGACAA GGGGCAGGTG GACGGCCGAC CGGCCCGCGG CCTGATTTCC
ATCTGCGCTG CCGGCGGCCA AGGCGTCGTC GCCATTCTTG AAGCATCCTA G
 
Protein sequence
MSFNGQSATG PDESAAAPAA TPGAGLLRKA VVVGGNRIPF ARTGGAYTKS SNQDMLTAAL 
DGLIARFGLA DERIGEVAAG AVLKHSRDFN LTREAVLGSA LSAETPAYDL QQACATGLET
VLGLANKIKL GQIDSAIAGG VDSASDAPIA VSEGLREVLL DLNRAKTLPQ RLKVLGRLRP
KDLAPDAPNT GEPRTGLSMG EHQALTTAQW KITREAQDEL AYNSHRNLAA AYDAGFFDDL
LTPYRGLNRD SNLRADTTRE KLSTLKPVFG KNLGAEATMT AGNSTPLTDG ASTVLLASEE
WADAHELPKL ATVVDGEAAA VDFVHGKDGL LMAPAFAVPR LLARNGLTLD DIDFFEIHEA
FAGTVLSTLA AWEDEEFGRT RLGLDGPLGS IDRAKLNVNG SSLAAGHPFA ATGGRIVATL
AKMLHDKGQV DGRPARGLIS ICAAGGQGVV AILEAS