Gene Sare_4885 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4885 
Symbol 
ID5707537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5538786 
End bp5540666 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content70% 
IMG OID641274280 
ProductABC transporter related 
Protein accessionYP_001539625 
Protein GI159040372 
COG category[V] Defense mechanisms 
COG ID[COG2274] ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGTGGG AGACCGGGGT ACGGGCGCGG GCGCAGGCCG GGTTGTTCGC CGTCTTCTCC 
GAGCTACCCC GACTGGTGGG AGCCGCGCTG GCCGTGAGCT GGCGGGCCGA CCGGCTCCGC
ACGTCGATCG TGGCGGCCAC GACGGTGGGC GCCGGGGTGC TGGCCGCGTT TGGGCTGCTC
GCCGCGCAGC GGGTGCTGGT GGAACTGTTC GCCGGCGGGC CGACCGCCGA CAAGGTGATC
GCCGCGTTGC CCGCACTGGC CGCTCTCGCG GCGGTGACCG CACTACGCGC CAGCATGACC
GCCGCGATGG GATACGCCCA GAATGGCCTG AGCCCGAAGG TCGACCGGGA GGTGGAACGG
GGCCTGTTCG AGGTGACAAC GGCCGTGCGG CTGGAGGCGT TCGACGCTGA TGCCTTCGCC
GACGACATGG AGCGTGCCTC CCGTGGCGCG GACTCCACCA CCGGCCTGGT TCAGTCGTCG
ATAAACCTGC TCGCCGGCCT CACCGGCGTG CTCGCCGTCG CGGTCGCCGT CGTGGTGATC
CACCCACTGC TGCTGTTCGC CCTGCTGGTG GCCACGGTGC CGAACGGGTG GGCGTCGCTG
CGGGCCGGGC ATCTGCGCTA TCAGACGTAC GCGGCCGGGT CGGTGCGTCG GCGCCGGTTG
TGGCTGCTGC ACCGACTGAT GGCCGAACGC GACTCCGCCC CCGAACTGCG TACCTACGGG
CTGCGGGCCT TCCTGCTCGA CCAGTACGAC CGGGTCATGG GGGTGGAGAC CAGTATCCAA
CTCGCCCTGG CTCGCCGGGT CACCACGACC ACCACCGTCG GCGCGATGAT CGGCGGGATC
GCCACCGCCG TGGTGTACGT GCTGCTCGGC GTACTGCTCG TCGAAGGGCA GATTCCCCTC
GCCGCCGCCG CTACCGGTGT CATCGCCGTG CAGTCCGCGC AGCGATCCCT GGCGGTGGTC
ACCTTCCAGA TGGACCGGGT CTACACCGAG GGGCAACACT TCCGCGACTA CACCGGCTTC
ATGACCCGTG CCGCCGACTA CCTGCCCGAG CCGCGTACCG GCGACGTCGG CCGCCAGGCG
CCGCAGCGGT TGCGCACGAT CGTGGTGGAC GCGGTCAGCC TGCGCTATCC CGACCGTGAC
ACTGCCGCCG TCGACCAGGT CAGCCTCACC ATCGAGGCGG GACAGACGGT GGCGTTCGTC
GGGGAGAACG GCTCCGGCAA GTCCACTCTC GCCACGATGA TCGCCACCCT GCGTACACCC
ACCGGCGGGA CCATCCGCTA CAACGGCCGG CCCACCGACG ACTGGGACAC CGATGCGTTG
CGGGCCCGGA TCGCGGTGGT GACGCAGGAA TATCACAAGT GGCCGTTCAC GGCCGCCACG
AACATCGCCA TCGGCGACCT CGACGTCGGC TGCGAGCAGG ACCGCATCGA GGCCGCCGCC
GCCCGCGCCG TCGCCCACAA CATGATCAAC GAACTGCCGC ACGGCTACGA CACCCTGCTC
GACCGCACCT TCGCCGGTGG TCAGGACCTG TCCGGCGGAC AGTGGCAACG CATCACGGCA
GCCCGCGGCT TCCTTCGTGA CGCGGACCTG CTCATCATGG ACGAGCCCTC GTCGGCCCTC
GACCCCCGCG CGGAGGACGC CCTGTTTCAG GCCATCCGTG ACCGGCAGGG ACACGGGATC
ACCATCCTCA TCACTCACCG GCTCGCCAAC GTCCGCCACG CCGACCGCAT CTACGTCCTG
CACCACGGCC GGCTCGTCGA AGCCGGCACC CACGATGACC TACTCGCCGG CGGCGGACGG
TACGCCGAAC TGTTCACCCT CCAGGCGGCC GGCTACGACA CCCCCGTATC GCTACCCCGG
CAGACCGCCG CCCTCGGGTA G
 
Protein sequence
MWWETGVRAR AQAGLFAVFS ELPRLVGAAL AVSWRADRLR TSIVAATTVG AGVLAAFGLL 
AAQRVLVELF AGGPTADKVI AALPALAALA AVTALRASMT AAMGYAQNGL SPKVDREVER
GLFEVTTAVR LEAFDADAFA DDMERASRGA DSTTGLVQSS INLLAGLTGV LAVAVAVVVI
HPLLLFALLV ATVPNGWASL RAGHLRYQTY AAGSVRRRRL WLLHRLMAER DSAPELRTYG
LRAFLLDQYD RVMGVETSIQ LALARRVTTT TTVGAMIGGI ATAVVYVLLG VLLVEGQIPL
AAAATGVIAV QSAQRSLAVV TFQMDRVYTE GQHFRDYTGF MTRAADYLPE PRTGDVGRQA
PQRLRTIVVD AVSLRYPDRD TAAVDQVSLT IEAGQTVAFV GENGSGKSTL ATMIATLRTP
TGGTIRYNGR PTDDWDTDAL RARIAVVTQE YHKWPFTAAT NIAIGDLDVG CEQDRIEAAA
ARAVAHNMIN ELPHGYDTLL DRTFAGGQDL SGGQWQRITA ARGFLRDADL LIMDEPSSAL
DPRAEDALFQ AIRDRQGHGI TILITHRLAN VRHADRIYVL HHGRLVEAGT HDDLLAGGGR
YAELFTLQAA GYDTPVSLPR QTAALG