Gene Snas_4134 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_4134 
Symbol 
ID8885335 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp4413382 
End bp4416687 
Gene Length3306 bp 
Protein Length1101 aa 
Translation table11 
GC content70% 
IMG OID 
Product6-deoxyerythronolide-B synthase 
Protein accessionYP_003512878 
Protein GI291301600 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAAGA CGAGCACCGA AGACAAACTC CGCGAATACC TCAAGCGCGC GACGGTTGAG 
CTGGGACAGA CCAGAAAACG TATGCAGGAG CTGGACCGGC GCGCCACCGA ACCGATCGCG
ATCGTGGGGA TGGCCTGCCG GTTTCCCGGC GGCGTGACAT CGCCGGAAGA CCTGTGGCGA
CTGGTTTCGC AGGGCACCGA CGCCGTCGGC GAGTTCCCGG AGGATCGCGG CTGGGACCTG
GATTCGCTGT TCTGTGACGA CACGTCCGCC CATGGGACGT CCTATGTGTC CGAGGCCGGG
TTCCTCGACG GCGCCACCGA GTTCGACGCC GGGTTCTTCG GCATCTCACC CCGTGAGGCG
TACACCATGG ACCCCCAGCA GCGGATCCTG CTGGAAGTGG CGTGGGAATC GCTGGAACGG
GCGGGGATCG ATCCGAAGAC ACTGACGGGC ACCGAGGTCG GGGTGTACGC CGGGTCCATC
GGCCAGGACT ACGGATACCG CCTGGGACAG TTCGACCAGT CCCTGGAGGG ACAGCTGATC
ACCGGGAACA CCGGCAGCGT CGTGTCCGGC CGGATCGCCT ACGCCCTCGG CTTGGAGGGA
CCGGCCGTCA CCGTCGACAC CGCGTGTTCG TCGTCGCTGG TCGCACTGCA TCTGGCGGCG
CGGGCCCTGC GTGCCGGTGA ATGCGACCTC GCCATGGCGG GCGGAGTGTA CATTGTGACC
TCTCCGGCGC CGTTCGTCGA GTTCAGCCGC CAGCGTGGCC TGGCCAAGGA CGGCCGCTGC
AAGTCGTTCG CGGACTCCGC CGACGGCACC AACTGGGCCG AGGGCGCCGG AATGCTGGTC
GTGGAACGGT TGTCGGACGC CCGACGGCTG GGGCACGAGG TGCTCGCGGT CGTACGCGGT
TCGGCGATCA ACCAGGACGG CGCCAGCAAC GGACTCACCG CGCCCAACGG CCGGGCCCAG
GAGCGCGTGA TCCGCCGCGC GCTCGACGAC GCCCGCCTCG CGCCATCCGA TGTAGAGCTC
GTCGAGGCGC ACGGCACTGG CACGACGCTG GGTGACCCGA TCGAAGCCCG GGCGCTGCTG
GCGGTGTACG GGCAGGGCCG CGAACCCGAC TCCCCGGTGT GGGTGGGTTC GCTGAAGTCG
AACATCGGGC ACGCCCAGGC CGCCGCCGGG GTCGGCGGCG TCATCAAGAC CGTGATGGCG
ATGCGAAACA AGACGATGCC GCCCACGCTG CACATCGACG AACCGTCCAC CCACGTGGAC
TGGTCGCAGG GCGAGGTCGC GCTGCTGACC GAATCCCGGG ACTGGACCGT CGCCGACGAA
CCGCGCCGCG CGGGCGTGTC GTCCTTCGGC ATCAGCGGCA CCAACGCGCA CGTGATCCTG
GAGGAGTCGC AGACCGACAC CATGTCCACA CCGGACGACG CGGTGACACC CGAAGCCGTC
GTCTGGCCGG TCTGTGGGCG CACCGACGCC GCCCTGGCGG GTCAGGCCGA GAAACTGCTG
TCGCACCTCG GCGAGGACTT CGATCCCGTC AGCGTGGGCT ATTCGCTGGC GGCGACCCGG
ACCCCGCTGG ACCGGCGCGC GGTGCTGGTC GGTGGCGACC GCGAAACCCT GCGGCGCGGG
CTCACCGCGC TGGCGAATGG CGACAACGCC CCGGGACTGG TGCGCGGCAA CGTCTCCGAG
GCTTCGGTGG CGTTCCTGTT CACCGGACAG GGCAGTCAGC GGCCCGGCAT GGGCCGACAG
CTGTACCGGC GCTACCCGGT CTTCGCCAAG ACGCTCGACG AGGTGTGCGA CGCCCTGGAT
CCGCACCTCG ACCGGCGCCT GCGGGACGTC CTGTTCGGCG CCGACACCGA AGCGGTCCAC
CAGACCGGCT ACACCCAGCC CGCCCTGTTC GCGGTCGAAC TGGCGCTGCA CCGGCTCGCC
GAGTCGTGGG GGCTGCGTCC GGGAGCAGTC GCGGGCCATT CGATCGGCGA ACTCGCCGCC
GCCCACGTCG CCGGAGTGTT CTCACTGCCG GACGCGTGCG CGCTGGTCGC GGCACGCGGG
CGCCTGATGC AGGCGCTGCC GCGCGGCGGT GCGATGGTCG CGATCCAGGC CACCGAGGCC
GAGGTCACGC CGCTACTGGA AGAGCGGGTG TCGCTGGCCG CGGTCAACGG TCCGTCGTCA
GTGGTCATCT CCGGCGATGC CGAAGCGGCA CAGATGATCG CGGCCAAGTT CGACGCCGAG
GGACGCAAGA CCAAGAACCT GACGGTCAGC CACGCGTTCC ACTCACCGCG CATGGACGAC
ATGCTGGCCG ACTTCGCCGA GGTGGCGGCC ACGATCGAAT ACCACCCGCC AAGACTGCCA
ATGGTGTCGA ACGTGACGGG AGACCTCGAG CGCGCTCGGG TCGCCTCGGC CGACTACTGG
GTCCGGCACG TGCGCCAGCC GGTGCGGTTC GCCGACGGGG TGCGGACCCT GCACGCCAAC
GGCACCACGA TGTTCGTCGA GATAGGACCA GACGCGGTGC TCAGCGGCAT CGGCGTCGAA
TCCGCGACCG ACGCCGTGTT CGTGCCGCTG CTGCGCGGGG CCCGTCCGGA AGAACGCACC
CTGGTGTCGG GCATGGCGCA GGCCTGGACC CGTGGGGCCC CCGTCGAACT GACGCGGGTG
TTCGACGGCA CCGCCGCGTC GCGGGTGGAC CTGCCGACCT ACGCCTTCCA GCACAAGCGT
TACTGGCCGA AGGCCGAGGC GTTCGCGAAC ACCGAACCGG CCGTCGGTGC CCCGACCGCG
ACCCTCCCGG CCGACGCCGG GACCCCGGCC CCGGACCTGG CGGGGCGGCT CGCGGTCATG
CCGGAGGCCG AACGACACCG GGTCCTGCTC AACCTGGTCC GCACCGAGAC CGCGGACGTG
CTGGCACATG ACACGATGGA CGATGTCACC GCCGACGAAC CGTTCAAGGA TCTGGGATTC
GACTCACTCA GCGGTGCTGA ACTCCGCGAA CGTCTGGCAT CATTGACCGG TCTTGAACTG
CCCATCACCC TGTCGTTCAC CTACCCGACT TCGCGGGCGC TGGCGGACTA TCTCGCCGAG
GAGATCCGTG CCGCCCAACC CGCGGACGCC GACCCGATCG ACGCACTACT GACCGAACTG
GACAACGCAC TGTCCGCGAC GTCCGACGAT CCCGACCGGC GCACTCGCGT GACCGGGCGC
CTGGAGTCGT TGCTGGCGAA GTGGGCTGTC GACCGCGAGG CCGCGACGAA CTCCGGCGAT
TCCTTCGAGG AGGCTTCGGA CGAGGAAATG TTCGCGATGC TCGACCGGCA ACTCGGGGAG
CGCTGA
 
Protein sequence
MDKTSTEDKL REYLKRATVE LGQTRKRMQE LDRRATEPIA IVGMACRFPG GVTSPEDLWR 
LVSQGTDAVG EFPEDRGWDL DSLFCDDTSA HGTSYVSEAG FLDGATEFDA GFFGISPREA
YTMDPQQRIL LEVAWESLER AGIDPKTLTG TEVGVYAGSI GQDYGYRLGQ FDQSLEGQLI
TGNTGSVVSG RIAYALGLEG PAVTVDTACS SSLVALHLAA RALRAGECDL AMAGGVYIVT
SPAPFVEFSR QRGLAKDGRC KSFADSADGT NWAEGAGMLV VERLSDARRL GHEVLAVVRG
SAINQDGASN GLTAPNGRAQ ERVIRRALDD ARLAPSDVEL VEAHGTGTTL GDPIEARALL
AVYGQGREPD SPVWVGSLKS NIGHAQAAAG VGGVIKTVMA MRNKTMPPTL HIDEPSTHVD
WSQGEVALLT ESRDWTVADE PRRAGVSSFG ISGTNAHVIL EESQTDTMST PDDAVTPEAV
VWPVCGRTDA ALAGQAEKLL SHLGEDFDPV SVGYSLAATR TPLDRRAVLV GGDRETLRRG
LTALANGDNA PGLVRGNVSE ASVAFLFTGQ GSQRPGMGRQ LYRRYPVFAK TLDEVCDALD
PHLDRRLRDV LFGADTEAVH QTGYTQPALF AVELALHRLA ESWGLRPGAV AGHSIGELAA
AHVAGVFSLP DACALVAARG RLMQALPRGG AMVAIQATEA EVTPLLEERV SLAAVNGPSS
VVISGDAEAA QMIAAKFDAE GRKTKNLTVS HAFHSPRMDD MLADFAEVAA TIEYHPPRLP
MVSNVTGDLE RARVASADYW VRHVRQPVRF ADGVRTLHAN GTTMFVEIGP DAVLSGIGVE
SATDAVFVPL LRGARPEERT LVSGMAQAWT RGAPVELTRV FDGTAASRVD LPTYAFQHKR
YWPKAEAFAN TEPAVGAPTA TLPADAGTPA PDLAGRLAVM PEAERHRVLL NLVRTETADV
LAHDTMDDVT ADEPFKDLGF DSLSGAELRE RLASLTGLEL PITLSFTYPT SRALADYLAE
EIRAAQPADA DPIDALLTEL DNALSATSDD PDRRTRVTGR LESLLAKWAV DREAATNSGD
SFEEASDEEM FAMLDRQLGE R