Gene B21_00542 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_00542 
SymbolentF 
ID8114613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp574445 
End bp578326 
Gene Length3882 bp 
Protein Length1293 aa 
Translation table11 
GC content57% 
IMG OID644846820 
Producthypothetical protein 
Protein accessionYP_002998393 
Protein GI251784089 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0422505 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAGC ATTTACCTTT GGTCGCCGCA CAGCCCGGCA TCTGGATGGC AGAAAAACTG 
TCAGAATTAC CCTCCGCCTG GAGCGTGGCG CATTACGTTG AGTTAACCGG AGAGGTTGAT
TCGCCATTAC TGGCCCGCGC GGTGGTTGCC GGACTAGCGC AAGCAGATAC GCTGCGGATG
CGTTTTACGG AAGATAACGG CGAAGTCTGG CAGTGGGTCG ATGATGCGCT GACGTTCGAA
CTGCCAGAAA TTATCGACCT GCGAACCAAC ATTGATCCGC ACGGTACTGC GCAGGCATTA
ATGCAGGCCG ATTTGCAGCA AGATCTGCGC GTCGATAGCG GTAAACCACT GGTCTTTCAT
CAGCTCATTC AGGTGGCAGA TAACCGCTGG TACTGGTATC AGCGTTATCA CCATTTGCTG
GTCGATGGCT TTAGTTTCCC GGCCATTACC CGCCAGATCG CCAATATTTA CTGCACATGG
CTGCGTGGCG AACCAACGCC TGCTTCGCCG TTTACGCCTT TCGCTGATGT AGTGGAAGAG
TACCAGCAAT ACCGCGAAAG CGAAGCCTGG CAGCGTGATG CGGCATTCTG GGCGGAACAG
CGTCGTCAAC TGCCACCGCC CGCGTCACTT TCTCCGGCAC CTTTACCGGG GCGCAGCGCC
TCGGCAGATA TTCTGCGCCT GAAACTGGAA TTTACCGACG GGGAATTCCG CCAGCTGGCT
ACGCAACTTT CAGGTGTGCA GCGTACCGAT TTAGCCCTTG CGCTGGCGGC TTTGTGGCTG
GGGCGATTGT GCAACCGCAT GGACTACGCT GCCGGATTTA TCTTTATGCG TCGACTGGGC
TCGGCGGCGT TGACGGCTAC CGGACCCGTG CTCAACGTTT TGCCGTTGGG TATTCATATT
GCGGCGCAAG AAACGCTGCC GGAACTGGCA ACCCGACTGG CGGCTCAACT GAAAAAAATG
CGTCGTCATC AACGTTACGA TGCCGAACAA ATTGTCCGTG ACAGCGGGCG AGCGGCAGGT
GATGAACCGC TGTTTGGTCC GGTACTCAAT ATCAAGGTAT TTGATTACCA ACTGGATATT
CCTGGTGTTC AGGCGCAAAC CCATACCCTG GCAACCGGTC CGGTTAATGA CCTTGAACTG
GCCCTGTTCC CGGATGAACA CGGTGATTTG AGTATTGAGA TCCTCGCCAA TAAACAGCGT
TACGATGAGC CAACGTTAAT CCAGCATGCT GAACGCCTGA AAATGCTGAT TGCCCAGTTC
GCCGCGGATC CGGCGCTGTT GTGCGGCGAT GTCGATATTA TGCTGCCAGG TGAGTATGCG
CAGCTGGCGC AGCTCAACGC CACTCAGGTT GAGATTCCAG AAACCACGCT TAGCGCGCTG
GTGGCAGAAC AAGCGGCAAA AACACCGGAT GCTCCGGCGC TGGCAGATGC GCGTTACCTG
TTCAGCTATC GGGAAATGCG CGAGCAGGTG GTGGCGCTGG CGAATCTGCT GCGTGAGCGC
GGCGTTAAAC CAGGGGACAG CGTGGCGGTG GCACTACCGC GCTCGGTCTT TTTGACCCTG
GCACTCCATG CGATAGTTGA AGCTGGAGCG GCCTGGCTAC CGCTGGATAC CGGCTATCCG
GACGATCGCC TGAAAATGAT GCTGGAAGAT GCGCGTCCGT CGCTGTTAAT TACCACCGAC
GATCAACTGC CGCGCTTTAG CGATGTTCCC AATTTAACAA GCCTTTGCTA TAACGCCCCG
CTTACACCGC AGGGCAGTGC GCCGCTGCAA CTTTCACAAC CGCATCACAC GGCTTATATC
ATCTTTACCT CTGGCTCCAC CGGCAGGCCG AAAGGGGTAA TGGTCGGGCA GACGGCTATC
GTCAACCGCC TGCTTTGGAT GCAAAATCAT TATCCGCTTA CAGGCGAAGA TGTCGTTGCC
CAAAAAACGC CGTGCAGTTT TGATGTCTCG GTGTGGGAGT TTTTCTGGCC GTTTATCGCA
GGGGCAAAAC TGGTGATGGC TGAACCGGAA GCGCACCGCG ACCCGCTCGC TATGCAGCAA
TTCTTTGCCG AATATGGCGT AACGACCACG CACTTTGTGC CGTCGATGCT GGCGGCATTT
GTTGCCTCGC TGACGCCGCA AACCGCTCGC CAGAGTTGCG CGACGTTGAA ACAGGTTTTC
TGTAGTGGTG AGGCCTTACC GGCTGATTTA TGCCGCGAAT GGCAACAGTT AACTGGCGCG
CCGTTGCATA ATCTATATGG CCCGACGGAA GCGGCGGTAG ATGTCAGCTG GTATCCGGCT
TTTGGCGAGG AACTGGCACA GGTGCGCGGC AGCAGTGTGC CGATTGGTTA TCCGGTATGG
AATACGGGTC TGCGTATTCT TGATGCGATG ATGCATCCGG TGCCGCCGGG TGTGGCGGGT
GATCTCTATC TCACTGGCAT TCAACTGGCG CAGGGCTATC TCGGACGCCC CGATCTGACC
GCCAGCCGCT TTATTGCCGA TCCTTTTGCC CCAGGTGAAC GGATGTACCG TACCGGAGAC
GTTGCCCGCT GGCTGGATAA CGGCGCGGTG GAGTACCTCG GGCGCAGTGA TGATCAGCTA
AAAATTCGCG GGCAGCGTAT CGAACTGGGC GAAATCGATC GCGTGATGCA GGCGCTGCCG
GATGTCGAAC AAGCCGTTAC CCACGCCTGT GTGATTAACC AGGCGGCTGC CACCGGTGGT
GATGCGCGTC AATTGGTGGG CTATCTGGTG TCGCAATCGG GCCTGCCGTT GGATACCAGC
GCATTGCAGG CGCAGCTTCG TGAAACATTG CCACCACATA TGGTACCGGT GGTTCTGCTG
CAACTTCCAC AGTTACCACT TAGCGCCAAC GGCAAGCTGG ATCGCAAAGC CTTACCGTTG
CCTGAACTGA AGGCACAAGC GCCAGGGCGT GCGCCGAAAG CGGGCAGTGA AACGATTATC
GCCGCGGCAT TCTCGTCGTT GCTGGGGTGT GACGTGCAGG ATGCCGATGC TGATTTCTTC
GCGCTTGGCG GTCATTCGCT ACTGGCAATG AAACTGGCAG CGCAGTTAAG TCGGCAGGTT
GCCCGCCAGG TGACGCCGGG GCAAGTGATG GTCGCGTCAA CTGTCGCCAA ACTGGCAACG
ATTATTGATG CTGAAGAAGA CAGCACCCGG CGTATGGGAT TCGAAACCAT TCTGCCGTTG
CGTGAAGGTA ATGGCCCGAC GCTGTTTTGT TTCCATCCTG CGTCCGGTTT TGCCTGGCAG
TTCAGCGTGC TCTCGCGTTA TCTCGATCCA CAATGGTCGA TTATCGGCAT TCAGTCACCG
CGCCCCAATG GCCCCATGCA GACGGCGGCA AACCTGGATG AAGTCTGCGA AGCGCATCTG
GCAACGTTAC TTGAACAACA ACCGCGCGGC CCTTATTACC TGCTGGGGTA TTCGCTGGGC
GGTACGCTGG CGCAGGGTAT TGCGGCGCGG CTGCGTGCCC GTGGCGAACA GGTGGCATTT
CTTGGCTTGC TGGATACCTG GCCGCCAGAA ACGCAAAACT GGCAGGAAAA AGAAGCTAAT
GGTCTGGACC CGGAAGTGCT GGCGGAGATT AACCGCGAAC GCGAGGCCTT CCTGGCGGCA
CAGCAGGGAA GTACTTCAAC GGAGTTGTTT ACCACCATTG AAGGCAACTA CGCTGATGCT
GTGCGCCTGC TGACGACTGC TCATAGCGTA CCGTTTGATG GTAAAGCGAC GCTGTTTGTT
GCTGAACGCA CGCTTCAGGA AGGTATGAGC CCCGAACGCG CCTGGTCGCC GTGGATAGCC
GAGCTGGATA TCTATCGTCA GGATTGTGCG CATGTGGATA TTATCTCTCC AGGGGCGTTT
GAAAAAATTG GGCCGATTAT TCGCGCAACG CTAAACAGGT AA
 
Protein sequence
MSQHLPLVAA QPGIWMAEKL SELPSAWSVA HYVELTGEVD SPLLARAVVA GLAQADTLRM 
RFTEDNGEVW QWVDDALTFE LPEIIDLRTN IDPHGTAQAL MQADLQQDLR VDSGKPLVFH
QLIQVADNRW YWYQRYHHLL VDGFSFPAIT RQIANIYCTW LRGEPTPASP FTPFADVVEE
YQQYRESEAW QRDAAFWAEQ RRQLPPPASL SPAPLPGRSA SADILRLKLE FTDGEFRQLA
TQLSGVQRTD LALALAALWL GRLCNRMDYA AGFIFMRRLG SAALTATGPV LNVLPLGIHI
AAQETLPELA TRLAAQLKKM RRHQRYDAEQ IVRDSGRAAG DEPLFGPVLN IKVFDYQLDI
PGVQAQTHTL ATGPVNDLEL ALFPDEHGDL SIEILANKQR YDEPTLIQHA ERLKMLIAQF
AADPALLCGD VDIMLPGEYA QLAQLNATQV EIPETTLSAL VAEQAAKTPD APALADARYL
FSYREMREQV VALANLLRER GVKPGDSVAV ALPRSVFLTL ALHAIVEAGA AWLPLDTGYP
DDRLKMMLED ARPSLLITTD DQLPRFSDVP NLTSLCYNAP LTPQGSAPLQ LSQPHHTAYI
IFTSGSTGRP KGVMVGQTAI VNRLLWMQNH YPLTGEDVVA QKTPCSFDVS VWEFFWPFIA
GAKLVMAEPE AHRDPLAMQQ FFAEYGVTTT HFVPSMLAAF VASLTPQTAR QSCATLKQVF
CSGEALPADL CREWQQLTGA PLHNLYGPTE AAVDVSWYPA FGEELAQVRG SSVPIGYPVW
NTGLRILDAM MHPVPPGVAG DLYLTGIQLA QGYLGRPDLT ASRFIADPFA PGERMYRTGD
VARWLDNGAV EYLGRSDDQL KIRGQRIELG EIDRVMQALP DVEQAVTHAC VINQAAATGG
DARQLVGYLV SQSGLPLDTS ALQAQLRETL PPHMVPVVLL QLPQLPLSAN GKLDRKALPL
PELKAQAPGR APKAGSETII AAAFSSLLGC DVQDADADFF ALGGHSLLAM KLAAQLSRQV
ARQVTPGQVM VASTVAKLAT IIDAEEDSTR RMGFETILPL REGNGPTLFC FHPASGFAWQ
FSVLSRYLDP QWSIIGIQSP RPNGPMQTAA NLDEVCEAHL ATLLEQQPRG PYYLLGYSLG
GTLAQGIAAR LRARGEQVAF LGLLDTWPPE TQNWQEKEAN GLDPEVLAEI NREREAFLAA
QQGSTSTELF TTIEGNYADA VRLLTTAHSV PFDGKATLFV AERTLQEGMS PERAWSPWIA
ELDIYRQDCA HVDIISPGAF EKIGPIIRAT LNR