Gene ECD_02851 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_02851 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp2990792 
End bp2992522 
Gene Length1731 bp 
Protein Length576 aa 
Translation table11 
GC content53% 
IMG OID 
Productputative saframycin Mx1 synthetase B 
Protein accessionACT44655 
Protein GI253978985 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000692091 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTGTATA TGTCTAATAA AATCTTTACG CATTCCCTAC CTATGCGCTA TGCCGATTTT 
CCAACGCTTG TTGATGCTTT GGACTACGCC GCTCTGAGTA GCGCCGGAAT GAATTTTTAT
GACAGACGTT GCCAACTTGA AGATCAACTG GAATATCAGA CGTTAAAAGC ACGTGCCGAA
GCTGGTGCGA AGCGGTTGTT ATCGCTGAAC CTGAAAAAAG GCGATCGCGT GGCACTGATT
GCCGAAACAA GTAGCGAGTT CGTAGAGGCT TTTTTTGCCT GCCAGTATGC CGGCTTAGTC
GCCGTCCCGT TGGCGATTCC AATGGGCGTT GGTCAGCGGG ATTCCTGGAG CGCCAAATTG
CAGGGTTTAC TGGCAAGTTG CCAGCCCGCA GCCATTATCA CTGGTGATGA GTGGTTGCCA
CTGGTCAATG CCGCGACGCA TGACAACCCC GAATTACATG TTTTAAGCCA CGCTTGGTTT
AAGGCATTAT CGGAAGCCGA TGTTGCGCTC CAGCGTCCAG TTCCGAACGA TATCGCCTAC
CTCCAGTACA CCTCCGGCAG CACCCGTTTT CCCCGTGGCG TCATTATCAC CCATCGCGAA
GTGATGGCTA ATCTACGTGC TATAAGCCAC GACGGCATTA AATTACGCCC TGGCGACCGC
TGCGCCTCCT GGCTGCCTTT CTACCATGAT ATGGGACTGG TCGGCTTTCT CCTGACCCCC
GTCGCCACGC AGCTTTCAGT AGATTATTTG CGCACTCAGG ATTTTGCCAT GCGTCCTCTG
CAATGGCTTA AATTGATCAG TAAAAATCGC GGCACCGTTT CCGTTGCGCC GCCGTTTGGC
TATGAATTGT GCCAGCGCCG CGTGAATGAA AAAGATCTCG CTGAACTGGA TCTTTCCTGC
TGGCGCGTCG CTGGTATTGG TGCAGAACCC ATCTCCGCAG AACAACTCCA TCAATTCGCT
GAATGTTTCC GTCAGGTTAA CTTTGACAAT AAAACTTTCA TGCCGTGCTA CGGACTGGCA
GAAAATGCGC TGGCTGTCAG CTTCTCTGAT GAAGCCTCCG GGGTTGTGGT TAACGAAGTG
GATCGCGACA TCCTCGAATA TCAGGGTAAA GCCGTCGCGC CGGGTGCAGA GACACGCGCC
GTATCGACTT TCGTCAACTG CGGCAAAGCG TTGCCGGAAC ATGGTATTGA AATCCGCAAT
GAAGCAGGTA TGCCGGTCGC GGAACGTGTG GTAGGCCATA TTTGCATCTC CGGTCCCAGT
CTGATGAGCG GTTACTTTGG CGACCAGGTT TCGCAAGACG AGATTGCCGC GACGGGCTGG
TTAGACACCG GCGACCTCGG TTATCTGCTG GACGGTTATC TGTATGTCAC CGGACGCATT
AAAGATCTGA TTATTATTCG TGGCCGTAAT ATCTGGCCGC AGGATATTGA ATATATAGCG
GAACAGGAAC CGGAAATTCA TTCTGGCGAT GCGATTGCTT TTGTTACCGC CCAGGAAAAA
ATCATTTTGC AGATCCAGTG TCGGATCAGC GACGAAGAAC GTCGCGGGCA GCTTATCCAC
GCGCTGGCGG CACGGATCCA AAGCGAATTT GGCGTGACCG CGGCTATCGC TCTGTTGCCG
CCCCACAGTA TTCCCCGAAC GTCCTCCGGC AAGCCTGCCC GTGCGGAAGC GAAAAAACGT
TATCAGAAGG CTTATGCTGC CAGTCTTAAT GTGCAGGAAT CCCTGGCATG A
 
Protein sequence
MVYMSNKIFT HSLPMRYADF PTLVDALDYA ALSSAGMNFY DRRCQLEDQL EYQTLKARAE 
AGAKRLLSLN LKKGDRVALI AETSSEFVEA FFACQYAGLV AVPLAIPMGV GQRDSWSAKL
QGLLASCQPA AIITGDEWLP LVNAATHDNP ELHVLSHAWF KALSEADVAL QRPVPNDIAY
LQYTSGSTRF PRGVIITHRE VMANLRAISH DGIKLRPGDR CASWLPFYHD MGLVGFLLTP
VATQLSVDYL RTQDFAMRPL QWLKLISKNR GTVSVAPPFG YELCQRRVNE KDLAELDLSC
WRVAGIGAEP ISAEQLHQFA ECFRQVNFDN KTFMPCYGLA ENALAVSFSD EASGVVVNEV
DRDILEYQGK AVAPGAETRA VSTFVNCGKA LPEHGIEIRN EAGMPVAERV VGHICISGPS
LMSGYFGDQV SQDEIAATGW LDTGDLGYLL DGYLYVTGRI KDLIIIRGRN IWPQDIEYIA
EQEPEIHSGD AIAFVTAQEK IILQIQCRIS DEERRGQLIH ALAARIQSEF GVTAAIALLP
PHSIPRTSSG KPARAEAKKR YQKAYAASLN VQESLA