Gene Sare_4186 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4186 
Symbol 
ID5703840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4753927 
End bp4755762 
Gene Length1836 bp 
Protein Length611 aa 
Translation table11 
GC content59% 
IMG OID641273610 
ProductABC transporter related 
Protein accessionYP_001538963 
Protein GI159039710 
COG category[V] Defense mechanisms 
COG ID[COG2274] ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.938404 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0036434 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATCGCAA ATATGGGTGG CCCGGTCCGA ACAGGTGCTG CGGCGGTGCG CCTTGCTTGG 
CAAAGCGCGC CCGTTCTGTT GACGTCTTAC GCTGTGATTA GCGTCCTTAT GGCAAGTGTG
CCGGTCTGCA CGGCATGGGT GACTAAGCTG ACGATTGACG AGTTAGGTAG TCCAGCTGCC
ACCCAAGACC GCATAGTTGG GTTGGTCGTT ATTCTGGCAG CGTTCACCGT AGCGAGCGCG
ACGTTGCTGC CCTGCAGCAA GTATCTGCAA GCTCAGATGA AGCGTGTCAT TGCCCTGCAC
GCGCAGGATA ATCTATTCAG AGCAGTCGGT CGGCTGGGCG GACTCCGCCG GTTCGAGAAC
CCCGACTTTC AAGACCGGCT GCGTCTGGCG CAGGCAGCCG CGTCGAATGC CGCTAGCCAA
GTTGTGTCTA ATATCCTGAA TGTCGCGACA GTGCTGCTTA CGGTTACTGG TCTACTCACC
GCGCTGGCAG TGATCAGTCC AGTAATGACG ATCGTTGTGG TTGTCGCGGT TGCGCCGGTT
GCCCTGGCCG AGATCTCGGT TGCGCGACGG CGTGCCTCAG TGGCTTGGAC ATTGAGCCCG
ACGGAGCGGC GTGAAGCATT TTACGCTGGC CTCCTGACCA GTGTTGATGC AGCCAAGGAA
GTGCGGTTGT TTAACATTGT CGGGTTTCTC CGGAACCGAA TGTTCGAGGA GCGGTTTGCC
GCGAATTCTG CACATCGGAA ACTTGATCTT CGGGAGCTAC GTACCCAGAC GCTACTCGGC
ATGTTTACCG CTACGGTTTC GGGAGCAGGA CTACTTTGGG CCGGAATGCA GGTGTACGGT
GGCGGACTTA CGATTGGAGA CCTGACTCTG TTCGTGGCGG CAGTCGCGGG CATCCAGACC
GCGCTGCTGG GCGTTGCTGG TACTGTCGCG AGGACGTATC AGGAGTTGCT TCTGTTCGGC
CACTACCAGA CCGTTACGAC GGCAGGACCA GATCTGGCCA CTATATCCGG GCCGCGTGTG
GTGCCGCCGC TACGCCGTGG CCTTGAACTT CGGGATGTGT GGTTCCGGTA TGCGCCGGAC
CAGCCATGGG TGCTACGTGG CATCAACATG TTCATCCCAC ACGGCCAAGA GGTCGCCCTG
GTAGGGCGGA ACGGTAGCGG CAAGAGTACG CTGATAAAAA TTCTCTGCCG TCTGTACGAC
CCGGACCGGG GAGTCGTCTA TTGGGATGGG GTGGACATTA AAGAGCTAGA CTTAGACCAG
TTGCGAAATC GGATTAGCGC GGTCTTCCAG GATGCCATGC ACTACGATCT TTCCGCCGCC
GAGAATGTCG CCGTAGGTGA TATAGTGCAT TTGAAGAATC GGGAAAGGAT TCACAAAGCG
GCCCAGGTGG CGGGGGTGCA CGACACTCTT GTGCAGCTGC CTGTCGGTTA CGAGACCATG
CTCACGCGGA TGTTCTATGC ACCAAATGGA GGTGACGACC CGCAATCGGG CGTCCTGCTT
TCGGGGGGCC AATGGCAACG CCTGGCGCTG GCTCGGGCGC TGTTCCGGGA TCGACGGGAT
CTCATGATTT TGGATGAACC CGTCGGAGGA CTGGACGCAG TAGCCGAGCG CGAGGTGCAC
TCGGCGGTGC GGCGGCATAG GGCTGGCGCG ACAAGTCTGC TGGTCTCGCA CCGTATGGGC
TCACTTCGTG ATGCAGATCT GATTATCGTC ATTTCCGGTG GCGAGGTGGT CGAGCAGGGC
GACCATGACG AACTCATGGC GGCCCGAGGA CAATATGCCG AGTTGTTCAC CGCACAGGCG
CAGGGCTACG TTGAGTCGTC GGCGTCGGCT AGGTAG
 
Protein sequence
MIANMGGPVR TGAAAVRLAW QSAPVLLTSY AVISVLMASV PVCTAWVTKL TIDELGSPAA 
TQDRIVGLVV ILAAFTVASA TLLPCSKYLQ AQMKRVIALH AQDNLFRAVG RLGGLRRFEN
PDFQDRLRLA QAAASNAASQ VVSNILNVAT VLLTVTGLLT ALAVISPVMT IVVVVAVAPV
ALAEISVARR RASVAWTLSP TERREAFYAG LLTSVDAAKE VRLFNIVGFL RNRMFEERFA
ANSAHRKLDL RELRTQTLLG MFTATVSGAG LLWAGMQVYG GGLTIGDLTL FVAAVAGIQT
ALLGVAGTVA RTYQELLLFG HYQTVTTAGP DLATISGPRV VPPLRRGLEL RDVWFRYAPD
QPWVLRGINM FIPHGQEVAL VGRNGSGKST LIKILCRLYD PDRGVVYWDG VDIKELDLDQ
LRNRISAVFQ DAMHYDLSAA ENVAVGDIVH LKNRERIHKA AQVAGVHDTL VQLPVGYETM
LTRMFYAPNG GDDPQSGVLL SGGQWQRLAL ARALFRDRRD LMILDEPVGG LDAVAEREVH
SAVRRHRAGA TSLLVSHRMG SLRDADLIIV ISGGEVVEQG DHDELMAARG QYAELFTAQA
QGYVESSASA R