Gene Bind_1756 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_1756 
Symbol 
ID6200767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp1988965 
End bp1990101 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content59% 
IMG OID641705747 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_001832875 
Protein GI182678729 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.404662 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.415846 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTAAAG TTGTCATTGC CGGATACATT CGTTCGCCGT TCACCCTCGC CAAGAAAGGC 
GAACTGGCGA CCGTCCGCCC GGATGATCTT GCGGCCCAGG TGGTCAAGGG GCTCATTAAG
AAAACGGGTA TCCCGGCGGA GGATATCGAG GACTTGCTGC TTGGGTGCGC CTTTCCGGAG
GGGGAGCAGG GCTTCAATGT CGCGCGGCTC GTGAGCTTCC TTGCGGGGCT ACCTCTTTCG
GTCGGTGCGT CAACCGTCAA CCGCTTCTGC GGCTCCTCCA TGACCACCGT GCATATGGCT
GCCGGCGCAA TTCAAATGAA TGCGGGCAAT GCCTTCATCG CGGCCGGTGT CGAGAGCATG
TCCCGTGTGC CGATGATGGG GTTCAATCCT TTGCCCAATC CGGAGCTCGC GGCGACCATG
CCCGGCGCCT ATATGGGCAT GGGTGATACG GCTGAAAATG TCGCTGCTAA ATGGACGATT
TCCCGCAAGG AGCAGGAAGA GTTCGCGCTG CGGTCGCATC AGCGCGCCAC GGCGGCGCAG
AAAGAGGGCC GGTTGACCGG TGAAATCATC CCAATCACCG GCCGCAAAGG CACGATCACG
ACGGATGGCT GCATCCGCCC CGACACAACG CTTGAAGGGC TGGCGGAGTT GAAACCTGCC
TTCAGTGCAA ACGGTGTCGT TACAGCCGGT ACATCCTCGC CTTTAACCGA CGGGGCCGCT
GCAGTACTGG TGTGCAGTGA AGATTACGCT AAGCACCATC ATCTCGATGT GCTCGCTTCG
GTCAAAGCCA TCGCGGTCTC TGGTTGCAGC CCGGAAATCA TGGGCATCGG GCCTGTGGCG
GCTTCGCGCA AGGCTCTAGC TCGTGCCGGA CTCGAAGCCG GTCAGATCGA TATCGTCGAA
CTGAACGAAG CCTTCGCCTC TCAATCGATT GCCTGTATGC GCGAGCTGAA CCTTTCACCG
GATCGAGTGA ATATCGACGG CGGCGCCATT GCTCTTGGTC ATCCGTTAGG AGCCACCGGC
GCGCGTATCG TCGGTAAGGC CGCTTCTTTG TTGAAGCGTG AAAAAGGCAA ATATGCGCTT
GCGACGCAAT GTATTGGCGG CGGTCAAGGC ATCGCGACAG TTCTGGAGGC TTTCTGA
 
Protein sequence
MTKVVIAGYI RSPFTLAKKG ELATVRPDDL AAQVVKGLIK KTGIPAEDIE DLLLGCAFPE 
GEQGFNVARL VSFLAGLPLS VGASTVNRFC GSSMTTVHMA AGAIQMNAGN AFIAAGVESM
SRVPMMGFNP LPNPELAATM PGAYMGMGDT AENVAAKWTI SRKEQEEFAL RSHQRATAAQ
KEGRLTGEII PITGRKGTIT TDGCIRPDTT LEGLAELKPA FSANGVVTAG TSSPLTDGAA
AVLVCSEDYA KHHHLDVLAS VKAIAVSGCS PEIMGIGPVA ASRKALARAG LEAGQIDIVE
LNEAFASQSI ACMRELNLSP DRVNIDGGAI ALGHPLGATG ARIVGKAASL LKREKGKYAL
ATQCIGGGQG IATVLEAF