Gene B21_04035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_04035 
Symbolybl212 
ID8115074 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp4334710 
End bp4336251 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content53% 
IMG OID644850185 
Producthypothetical protein 
Protein accessionYP_003001758 
Protein GI251787454 
COG category[I] Lipid transport and metabolism 
COG ID[COG4670] Acyl CoA:acetate/3-ketoacid CoA transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.566302 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTAAGA TAACAACCGC CGAGGCACTG GCGGCGCAAA TTCAGGACGG TGCGACTATT 
GCTATTAGCG GTAACGGCGG CGGTATGGTA GAAGCCGACC ATATTCTGGC TGCTATTGAA
GCGCGTTTCC TGCAAACCGG ACACCCTCGC GACCTGACGC TAATCCACTC GCTGGGTATT
GGCGATCGCG ACTGCAAAGG CACTAATCGT TTTGCGCACG CAGAGATGCT CAAACGCATT
ATTGCCGGGC ACTTTACCTG GTCGCCCAAA ATGCAGGCGC TGGTAAAAAA TAACACGATT
GAAGCCTACT GCTTTCCTGG TGGCGTGATT CAGGCACTGC TACGGGAGAT CGGCGCCGGA
CGTCCGGGGC TTTTTACCCA CGTTGGGCTG GGATCGTTTG TTGATCCACG CAATGGCGGC
GGTAAGTCGA ATGAATGCAC TACCGACGAC CTGGTAGAAC TGATTGAAAT CGACGGTGAA
ACCAAACTTC GTTATCGCCC TTTCAAAGTG GATTACGCGA TTTTGCGTGG CACTTATGCC
GATCCTCGAG GCAACGTCAG CCTTGAAGAA GAAGCGATTG ATATGGATAG CTATTCCATG
GCGCTGGCAG CACACAACAG CGGCGGCAAA GTGTTCGTAC AGGTACGCGA TGTGCTGGAA
GCTGGCGCGA TTGAACCACG CCGGGTCAAA TTACCGGGCA TTCTGGTTGA TGGCATCGTT
GAACACTGCG AACAACCGCA AACCTATCTT GGCGGTTACG ACCTGACCAT TAGCGGTCAA
CATCGCCGTC TGAGTTCTAA CGACGCTATT GAACTGGTTA GTCATCCGGT GCGCCGCCTG
ATTGCCCGTC GGGCAGCACG CGAACTGGTG GCTGGCGCTT CAACCAACTT TGGCTTTGGT
ATTCCTGGCG GTATTCCAGG CGTAGCGCTG CGCGAAGGCG TTCCTTACCA AAGTTTATGG
CTGAGCGTCG AGCAAGGCGT ACATAACGGC ATGATGCTTG ATGATGCTCT GTTTGGTTGT
GCCCGCAACG CCGATGCCAT TATTCCATCA CTCGATCAAT TTGAGTTCTA CAGTGGCGGA
GGGATCGATA TCACCTTCCT CGGCATGGGA GAGATGGATC AGTACGGTAA CGTCAACGTC
TCCCATCTCA ATGGCAATCT CATTGGCCCC GGCGGTTTTC TCGAAATTGC GCAAAACGCC
CGTAAAGTGG TGTTCTGCGG CACGTTCGAC GCCAAAGGTA GCAAGATTGA TGTAACGCCA
GATGGCTTGC ATATCGCCCA GTCAGGTCAA ATCCCTAAAC TGGTTACCCA GGTGGAAAAA
ATCACTTTTA GCGCCGCCTA CGCACAGCAA AGTGGTCAGG AAGTGTTGTA TATCACTGAA
CGCGCAGTAT TCCAGTTAAC GGCAGAAGGC GTTGAATTAA TTGAAATCGC ACCGAGTGTG
GAGATTGAGC GCGACATTCT GCCGTATATG GCCTTCCGTC CAATTATCAA GCATCCACGC
CTGATGGAAA GTAGCCTGTT TACGCCGATG GAGGATGCAT GA
 
Protein sequence
MRKITTAEAL AAQIQDGATI AISGNGGGMV EADHILAAIE ARFLQTGHPR DLTLIHSLGI 
GDRDCKGTNR FAHAEMLKRI IAGHFTWSPK MQALVKNNTI EAYCFPGGVI QALLREIGAG
RPGLFTHVGL GSFVDPRNGG GKSNECTTDD LVELIEIDGE TKLRYRPFKV DYAILRGTYA
DPRGNVSLEE EAIDMDSYSM ALAAHNSGGK VFVQVRDVLE AGAIEPRRVK LPGILVDGIV
EHCEQPQTYL GGYDLTISGQ HRRLSSNDAI ELVSHPVRRL IARRAARELV AGASTNFGFG
IPGGIPGVAL REGVPYQSLW LSVEQGVHNG MMLDDALFGC ARNADAIIPS LDQFEFYSGG
GIDITFLGMG EMDQYGNVNV SHLNGNLIGP GGFLEIAQNA RKVVFCGTFD AKGSKIDVTP
DGLHIAQSGQ IPKLVTQVEK ITFSAAYAQQ SGQEVLYITE RAVFQLTAEG VELIEIAPSV
EIERDILPYM AFRPIIKHPR LMESSLFTPM EDA