Gene Sare_0349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0349 
Symbol 
ID5708021 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp390871 
End bp393861 
Gene Length2991 bp 
Protein Length996 aa 
Translation table11 
GC content71% 
IMG OID641269875 
Productlanthionine synthetase C family protein 
Protein accessionYP_001535270 
Protein GI159036017 
COG category[V] Defense mechanisms 
COG ID[COG4403] Lantibiotic modifying enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.179334 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGGCCG CGGGCGGCGA GGTCGAGACC GAGCCGGGCT TCGCCGCGCG ACTCGCCGAC 
CTCGGGATGC CCCAGGATCC GCGCCTGGGT GAGTCTCCCG CCGAGCTCGC CGCCCGTATC
CGTCGACCTG CCTGGGCCGT CCTGGTGGAG GACGTCCTCG CCGCCGCCCG GCCGCTGCCC
TCCGACGCCA GGCCGGCGGC CGACTGGCGG GAGGCGTTCG CCCGGGTCCT CGCACCCTTC
GTCGACGCCG CGCTGACCCA GATTCGCCGG CACGATGGCC GGCACGTGGA TCTGGACCGA
GTCGCCGCCA CGGTTGGCGA CACGCTCGGG CCACGCCTGG TTGGCATCGC CGCCCGCGCC
CTGGTCACCG AGCTACATCG GTGGCGTGCC GAGGGCCGAT TGGCCGGAGG TGACAGCCAA
GCGCGCTTTC ACGACTTCGT CCGCCAACTC ACCGCACCCA CCGGTCTCGG TGAGGTGCTC
GCCCGATACC CTGTTCTGGC CCGTCTGCTC GCCCAGGACA CGGCCACCAC CGCCGATGCC
ACCGTGGAGC TGCTCGACCG GTTCGGCCTC GACCGCGATG CGCTGATTGC CACCCTGCTC
GGCGGCACCG ACCCAGGCCC GGTCACCTCG GTGCTCACCT CGCGGGGAGA CCGGCACGCC
GGAGGGCGCT CGGTCGCCTT TGTCGACTTC GACGACGGGC GACGAATCGT CTACAAGCCA
CGCGATCTCG CGCCGCACAC CCAGCTCACC GTGATTTTGG AATACCTGTC GACGGCTACA
CCGGGGTTGT TTCCCCGAAC CCCACGAACT CTCGCCCGCA CCGGCTACGG CTGGGCCGAG
CACATCGCCG CACTACCGCT GTTCACCTGG GAGGATGCTG ACCTCTTCTA CCGCCGCCAA
GGCGCGTTGC TGGCTCTGTT GCACCTCGTC CGCGCCGCCG ACGTGCACTA CGAGAACCTC
GTCGCGCATG GCGACCAGCC GGTTCTCGTC GATGTCGAGA CCCTGTTCCA CCCTGGACTC
ACTCCTGGCG GATCGGGTGA CCCTGCCGCC GATGCCCTGG CCGCATCCGT GCATCGCACC
GCACTGCTGC CGCTGGTCTT CGTCGGTGAG CAGGGCGTCG CTGACGTGTC CGGGCTTGGC
GGCGACCCCA GCGCGTCTCC GTCGACCGTC GTCGACTGGC TCGACGCGGG TACCGACCGG
ATGCGCCTGA CGCGCCGCGC CGCCGACATG GCCGCTGCGG CCAACCGACC GGTCCTCAAC
GGTCGGCTGG TTGAGCCGCA CGAGCACGAT CGTGCCATCG TCGGCGGATT CCGTCATGCC
TACGACACGT TCATCGCGCA CCGTGACGCG CTCACCGCGC TTGTCCGCGA CTGCGCCGAC
CTCGAAGTCC GCGTCGTCGT CAGGGCGACC TGGATGTACC AGACGTTGCT CGACGAAACC
ACCCACCCGG ATGTCCTGCG CGACGCTGTC GACCGAGACC GGGCGCTTTC CGTCCTCTAC
CACGGCAGGA CCGAGCAGCC ACTGCTCGCG CAGCTGTTGC GGCCGGAGCT GGCTGCCCTG
TGGGCAGGGG ACATACCGAT GTTCACCGCA TCGGTCGGGA CCGGCAGGAT TCGCACCACG
TCCGGCGCCG AGATGCCCGA ACTGCTGCCA CAGCCCGGGC TCGCCGCGGC GCTGGCCACC
CTCGCCTCGC TCGACGAGGT GAGCCGACGC GGTCAGGAGT GGATAATCTC AGCGACCCTG
GCCTCCCGGG CCCGTGTCGC CCCTCACCCC GACGCCGCCC CGATCGCGGC CCAACCCGAG
GGGGCAGCGG CGCACCCCGA TGAGCTGCTG GCCGCGGCCT GCGCCGTTGC CGACCAGCTC
GTCGCCGAGG CGAACGCCGG TGGCGGACGG GTCAACTGGC TGGGCTTAGA GGCCGTCGAG
GACCATCGGT GGCTGGTGCT TCCGCTCGGC GCCAGCCTCG GCAGTGGGTA CCTCGGTGTG
GCACTGTTCC TCGCCCAGCT CGCTGCGGTC ACTGGCATCC ATCGCTATGC CGAGCAGGCC
CGCGCCGCCA CCGCCGACCT GCCCCAGCTC GTCGCTGCGC TGGACAAACG TCCTGATCTC
GTCGCCGTCA TTGGGTGCGG CGGGCTGGAC GGGCTGGGCG GCATCGCCTA CGGTCTGGCC
CGAATCGGGA CCCTGCTCGA CGACTCCGCG CTCACCGACG CCGCCGCCCG CACCGTTCCT
CTCGCTATGG CGGCGGCGAC CCCTGGGGCG TCACTGGGCT GGTCCGCCGG CCTCGCCGGC
TGCCTAGCCG CGCTGACTGC GGTGCACACC GAACTGAAAC TCCCCGAGGC TGCTGACGCC
GCCCGCCGGT GCGCCGACCT CCTCGTCGCG CCATTGACCG AATCCGACGG CGTGCCTGGG
CTGATGTACC GAACGCACCA TGGGGTCTCG ACTGATCGGG CCGGTGGCGC CTGGCCAGCA
TCGGGCGGGT TCGCCGATGG GCTGGCCGGC GTCGGATGGG CATTGACCAC CATTGGACCC
GACGACCACC ATCACGCCGC TGGCCGTCGG TTGGCCACAC TGCTCGGCGA CCGAGGTGAG
CAGGTAGCGG CCGGCTGGTG TCGTGGCACC GCCGGAGCCG TCCTGGCCCG TACGGCCCTA
TCGGCTGCTA CTGGTCCACG CAGCCTGGCG GGCTGCGTGG AAACCCTGGC GAACGCACCC
GTACGGCGGG ATCTGAGCCT GTGTCACGGC GAGCTCGGGG TGACCGAGGT GCTCGCCCAG
CTCGCCGGCT CCGACCGACG CACCCTCGCG GCCCAAGCCC TACGCCGCCG GGCCGGACTG
GTACTCGATG TTCTGCGTCG GCACGGCTCA CTGTGCGGGG TGCCCGGTGG CGCTCGTTCC
CCGGGGCTCC TCACCGGGCT CGCCGGCATC GGCTACGGGC TGCTGCGTCA CGCCGCGCCC
CAGCGAGTGC CCTCCGTGCT GCTGCTACAG CACACAATGC TTGCCCCTTA A
 
Protein sequence
MPAAGGEVET EPGFAARLAD LGMPQDPRLG ESPAELAARI RRPAWAVLVE DVLAAARPLP 
SDARPAADWR EAFARVLAPF VDAALTQIRR HDGRHVDLDR VAATVGDTLG PRLVGIAARA
LVTELHRWRA EGRLAGGDSQ ARFHDFVRQL TAPTGLGEVL ARYPVLARLL AQDTATTADA
TVELLDRFGL DRDALIATLL GGTDPGPVTS VLTSRGDRHA GGRSVAFVDF DDGRRIVYKP
RDLAPHTQLT VILEYLSTAT PGLFPRTPRT LARTGYGWAE HIAALPLFTW EDADLFYRRQ
GALLALLHLV RAADVHYENL VAHGDQPVLV DVETLFHPGL TPGGSGDPAA DALAASVHRT
ALLPLVFVGE QGVADVSGLG GDPSASPSTV VDWLDAGTDR MRLTRRAADM AAAANRPVLN
GRLVEPHEHD RAIVGGFRHA YDTFIAHRDA LTALVRDCAD LEVRVVVRAT WMYQTLLDET
THPDVLRDAV DRDRALSVLY HGRTEQPLLA QLLRPELAAL WAGDIPMFTA SVGTGRIRTT
SGAEMPELLP QPGLAAALAT LASLDEVSRR GQEWIISATL ASRARVAPHP DAAPIAAQPE
GAAAHPDELL AAACAVADQL VAEANAGGGR VNWLGLEAVE DHRWLVLPLG ASLGSGYLGV
ALFLAQLAAV TGIHRYAEQA RAATADLPQL VAALDKRPDL VAVIGCGGLD GLGGIAYGLA
RIGTLLDDSA LTDAAARTVP LAMAAATPGA SLGWSAGLAG CLAALTAVHT ELKLPEAADA
ARRCADLLVA PLTESDGVPG LMYRTHHGVS TDRAGGAWPA SGGFADGLAG VGWALTTIGP
DDHHHAAGRR LATLLGDRGE QVAAGWCRGT AGAVLARTAL SAATGPRSLA GCVETLANAP
VRRDLSLCHG ELGVTEVLAQ LAGSDRRTLA AQALRRRAGL VLDVLRRHGS LCGVPGGARS
PGLLTGLAGI GYGLLRHAAP QRVPSVLLLQ HTMLAP