Gene B21_00550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_00550 
SymbolentE 
ID8116577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp586358 
End bp587968 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content56% 
IMG OID644846828 
Producthypothetical protein 
Protein accessionYP_002998401 
Protein GI251784097 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1021] Peptide arylation enzymes 
TIGRFAM ID[TIGR02275] 2,3-dihydroxybenzoate-AMP ligase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCATTC CATTCACCCG CTGGCCGGAA GAGTTTGCCC GTCGCTATCG GGAAAAAGGC 
TACTGGCAGG ATTTGCCGCT GACCGACATT CTGACTCGCC ACGCTGCGAG TGACAGCATC
GCGGTTATCG ACGGCGAGCG ACAGTTGAGT TACCGGGAGC TGAATCAGGC GGCGGATAAC
CTCGCGTGTA GTTTACGCCG TCAGGGCATT AAACCTGGTG AAACCGCGCT GGTACAACTG
GGTAACGTCG CTGAATTGTA TATTACCTTT TTCGCGCTGC TGAAACTGGG CGTTGCGCCG
GTGCTGGCGC TGTTCAGCCA TCAGCGTAGT GAACTGAACG CCTATGCCAG CCAGATTGAA
CCCGCATTGC TGATTGCCGA TCGCCAACAT GCGCTGTTTA GCGGGGATGA TTTCCTCAAT
ACTTTCGTCA CAGAACATTC CTCCATTCGC GTGGTGCAAC TGCACAACGA CAGCGGTGAG
CATAACTTGC AGGATGCGAT TAACCATCCG GCTGAGGATT TTACTGCCAC GCCATCACCT
GCTGATGAAG TGGCCTATTT CCAGCTTTCC GGCGGCACCA CCGGCACACC GAAACTGATC
CCGCGCACTC ATAACGACTA CTACTACAGC GTGCGTCGTA GCGTCGAGAT TTGTCAGTTC
ACACAACAGA CACGCTACCT GTGCGCGATC CCGGCGGCTC ATAACTACGC CATGAGTTCG
CCGGGATCGC TGGGCGTCTT TCTTGCCGGA GGAACGGTTG TTCTGGCGGC CGATCCCAGC
GCCACGCTTT GCTTCCCATT GATTGAAAAA CATCAGGTGA ACGTCACCGC GCTGGTGCCA
CCGGCAGTCA GCCTGTGGTT GCAGGCGCTG ACCGAAGGTG AAAGCCGGGC GCAGCTTGCC
TCGCTGAAAC TGTTACAGGT CGGCGGCGCA CGTCTTTCAG CCACCCTTGC GGCGCGTATT
CCCGCTGAGA TTGGCTGCCT GTTGCAGCAG GTGTTTGGCA TGGCGGAAGG GCTGGTGAAC
TACACCCGAC TTGATGATAG CGCGGAGAAA ATTATCCATA CCCAGGGTTA CCCAATGTGT
CCGGACGATG AAGTATGGGT TGCTGATGCT GAAGGAAATC CACTGCCGCA AGGGGAAGTT
GGACGCCTGA TGACGCGCGG GCCGTACACC TTCCGCGGTT ATTACAAAAG TCCGCAGCAC
AATGCCAGTG CCTTTGATGC CAACGGTTTT TACTGTTCCG GCGATCTGAT CTCTATTGAT
CCAGAGGGTT ACATCACCGT GCAGGGGCGC GAGAAAGATC AGATCAACCG TGGCGGCGAG
AAGATCGCTG CCGAAGAGAT CGAAAACCTG CTGCTGCGCC ATCCGGCGGT GATCTATGCC
GCACTGGTGA GCATGGAAGA TGAGCTGATG GGCGAAAAAA GCTGTGCTTA TCTGGTGGTA
AAAGAGCCGC TGCGCGCGGT GCAGGTGCGT CGTTTCCTGC GTGAACAGGG TATTGCCGAA
TTTAAATTAC CGGATCGCGT GGAGTGTGTG GATTCACTTC CGCTGACGGC GGTCGGGAAA
GTCGATAAAA AACAATTACG TCAGTGGCTG GCGTCACGCG CATCAGCCTG A
 
Protein sequence
MSIPFTRWPE EFARRYREKG YWQDLPLTDI LTRHAASDSI AVIDGERQLS YRELNQAADN 
LACSLRRQGI KPGETALVQL GNVAELYITF FALLKLGVAP VLALFSHQRS ELNAYASQIE
PALLIADRQH ALFSGDDFLN TFVTEHSSIR VVQLHNDSGE HNLQDAINHP AEDFTATPSP
ADEVAYFQLS GGTTGTPKLI PRTHNDYYYS VRRSVEICQF TQQTRYLCAI PAAHNYAMSS
PGSLGVFLAG GTVVLAADPS ATLCFPLIEK HQVNVTALVP PAVSLWLQAL TEGESRAQLA
SLKLLQVGGA RLSATLAARI PAEIGCLLQQ VFGMAEGLVN YTRLDDSAEK IIHTQGYPMC
PDDEVWVADA EGNPLPQGEV GRLMTRGPYT FRGYYKSPQH NASAFDANGF YCSGDLISID
PEGYITVQGR EKDQINRGGE KIAAEEIENL LLRHPAVIYA ALVSMEDELM GEKSCAYLVV
KEPLRAVQVR RFLREQGIAE FKLPDRVECV DSLPLTAVGK VDKKQLRQWL ASRASA