Gene B21_02110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_02110 
SymbolatoB 
ID8113580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp2215780 
End bp2216964 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content53% 
IMG OID644848318 
Producthypothetical protein 
Protein accessionYP_002999891 
Protein GI251785587 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATT GTGTCATCGT CAGTGCGGTA CGTACTGCTA TCGGTAGTTT TAACGGTTCA 
CTCGCTTCCA CCAGCGCCAT CGACCTGGGG GCGACAGTAA TTAAAGCCGC CATTGAACGT
GCAAAAATCG ATTCACAACA CGTTGATGAA GTGATTATGG GTAACGTGTT ACAAGCCGGG
CTGGGGCAAA ATCCGGCGCG TCAGGCACTG TTAAAAAGCG GGCTGGCAGA AACGGTGTGC
GGATTCACGG TCAATAAAGT ATGTGGTTCG GGTCTTAAAA GTGTGGCGCT TGCCGCCCAG
GCCATTCAGG CAGGTCAGGC GCAGAGCATT GTGGCGGGGG GTATGGAAAA TATGAGTTTA
GCCCCCTACT TACTCGATGC AAAAGCACGC TCTGGTTATC GTCTTGGAGA CGGACAGGTT
TATGACGTAA TCCTGCGCGA TGGCCTGATG TGCGCCACCC ATGGTTATCA TATGGGGATT
ACCGCCGAAA ACGTGGCTAA AGAGTACGGA ATTACCCGTG AAATGCAGGA TGAACTGGCG
CTACATTCAC AGCGTAAAGC GGCAGCCGCA ATTGAGTCCG GTGCTTTTAC AGCCGAAATC
GTCCCGGTAA ATGTTGTCAC TCGAAAGAAA ACCTTCGTCT TCAGTCAAGA CGAATTCCCG
AAAGCGAATT CAACGGCTGA AGCGTTAGGT GCATTGCGCC CGGCCTTCGA TAAAGCAGGA
ACAGTCACCG CTGGGAACGC GTCTGGTATT AACGACGGTG CTGCCGCTCT GGTGATTATG
GAAGAATCTG CGGCGCTGGC AGCAGGCCTT ACCCCCCTGG CTCGCATTAA AAGTTATGCC
AGCGGTGGCG TGCCCCCCGC ATTGATGGGT ATGGGGCCAG TACCTGCCAC GCAAAAAGCG
TTACAACTGG CGGGGCTGCA ACTGGCGGAT ATTGATCTCA TTGAGGCTAA TGAAGCATTT
GCTGCACAGT TCCTTGCCGT TGGGAAAAAC CTGGGCTTTG ATTCTGAGAA AGTGAATGTC
AACGGCGGGG CCATCGCGCT CGGGCATCCT ATCGGTGCCA GTGGTGCTCG TATTCTGGTC
ACACTATTAC ATGCCATGCA GGCACGCGAT AAAACGCTGG GGCTGGCAAC ACTGTGCATT
GGCGGCGGTC AGGGAATTGC GATGGTGATT GAACGGTTGA ATTAA
 
Protein sequence
MKNCVIVSAV RTAIGSFNGS LASTSAIDLG ATVIKAAIER AKIDSQHVDE VIMGNVLQAG 
LGQNPARQAL LKSGLAETVC GFTVNKVCGS GLKSVALAAQ AIQAGQAQSI VAGGMENMSL
APYLLDAKAR SGYRLGDGQV YDVILRDGLM CATHGYHMGI TAENVAKEYG ITREMQDELA
LHSQRKAAAA IESGAFTAEI VPVNVVTRKK TFVFSQDEFP KANSTAEALG ALRPAFDKAG
TVTAGNASGI NDGAAALVIM EESAALAAGL TPLARIKSYA SGGVPPALMG MGPVPATQKA
LQLAGLQLAD IDLIEANEAF AAQFLAVGKN LGFDSEKVNV NGGAIALGHP IGASGARILV
TLLHAMQARD KTLGLATLCI GGGQGIAMVI ERLN