Gene Sare_4526 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4526 
Symbol 
ID5706016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5116971 
End bp5119067 
Gene Length2097 bp 
Protein Length698 aa 
Translation table11 
GC content69% 
IMG OID641273940 
Productoligopeptidase B 
Protein accessionYP_001539289 
Protein GI159040036 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1770] Protease II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000835874 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACCACCG AGACCCCAGC GCCCGTCGCC AGGCGGATGC CGACCGAGCG AACCCACCAC 
GGCGACACCG TCGTCGACGA GTATGCCTGG CTCGCCGACA AGGACGATCC GGCCACGATC
GCCTACCTCA CCACCGAGAA CGCCTACACC GAGGCCCGGA CAGCCCACCT GACGGACCTG
CGCGCGCAAC TGTTCGAGGA GATCCGCCAG CGGACCCAGG AAACCGACCT GTCGGTTCCC
ACCCGCAAGG GTGGCCACTG GTACTACACC CGCACGGTCG AGGGGCAGCA GTACGGAGTG
CAGTGCCGCC GCGCCGTCCA CGACGGTGAA ACCGCCCCCC CGGTCAGCGG CGACGGCACC
CCCCTGACAG ACGAGGAGGT GCTGCTCGAC GGCAACCTCC TCGCTGAGGG ACACGACTTC
TTCGCGCTCG GGGCGTTCGA TGTGAGCCCG GACGGGCGCT GGCTGGCCTA CTCGACCGAC
TTCTCCGGCG ACGAGCGGTT CACCCTACGG GTCAAGGACC TCACCACCGG TGAGTTGCTG
CCCGACGAGG TGCCCGGCAC GTTCTACGGC ACGGCCTGGT CCGCTGACGC CTCGGTGCTC
TTCTACGTCA CCGTCGACGA CGCGTGGCGG CCGAACCGGG TCTGGCGGCA CACACTGGGC
ACTCCGGCCG GCGAGGACGT GGTGGTCCAC CAGGAGGACG ACGAGCGGTT CTGGGTCGGG
GTCGAACTGA CCCGCTCCGA AAAATTCGTA CTCATCGACA TACACAGCAA GTTGACCAGT
GAGATCCTGG CCATCCCCGC CGGCAACCCG ACCGGAGCCC CGGCCCCGGT GGCCCCCCGC
CGTCAGGGCG TGGAGTACAC GGTCGAGCAC CACGGCCACC GGTTCCTGAT CCTGCACAAC
GACGGCGCCG AGGACTTCGC CCTCGCGTAC ACCTCGGCCG ACGCCCCGGG CGACTGGGTG
CCACTCATCG AGCACTCCCC GGGCACCCGC CTGGAGGCGA TCGACGCGTT CGACAACCAT
CTGGTGGTCA CGTTACGCAG CAACGGGCTG ACCGGGCTGC GGGTGCTACC GGTCGGCGGT
GGCGACCCCC ACGACATCGA CTTCCCCGAA CCGCTGTACA GCGTCGGCCT GGACAGCAAC
CCGGAGTACC GCACCTCCCA GCTCCGCCTG CGCTACACCT CGTTGGTCAC CCCGGACTCG
GTGTACGACT ACGACCTGGT CACCCGTCGG ATGATCCGAC GCCGGCAGAA GCCGGTGCTA
CCCGGGCCAG ACGGTCGCCC GTACGACCCG GCCGGCTACG AGCAGCACCG GGAGTGGGCG
CTCGCCGACG ACGGCACCCG GGTGCCGATC TCGCTGGTCT GCCGGGCCGG CACGCCGCGC
GACGGCTCCG CGCCGTGCGT CATCTACGGG TACGGCTCCT ACGAGGCGAG CATGGACCCC
TGGTTCTCCG TTGCCCGGCT GTCCCTGCTG GACCGGGGTG TCGTCTTCGC CGTGGCGCAC
ATCCGCGGCG GCGGTGAACT GGGGCGCCGC TGGTACGACC AGGGGAAGCT ACTGGCCAAG
AAGAACACCT TCACCGACTT CGTTTCCTGT GCCCGGCACC TGGTCAAGGC CGGCTGGACG
GCGACCGACC GGCTGGTCGC CCGGGGCGCC TCGGCCGGTG GGCTGCTGAT GGGCGCGGTG
ACCAACCTCG CTCCGGACGC CTTCGCCGGG ATCGTCGCGC AGGTTCCCTT CGTCGACGCG
CTCACCTCGA TCCTCGACCC ATCGCTGCCG TTGACCGTCA CCGAGTGGGA GGAGTGGGGC
AACCCACTGG ACGACCCCGA GGTGTACGCG TACATGAAGT CGTACACGCC ATACGAGAAC
GTACGGGCCG TGGACTACCC GGCGATCCTC GCGGTGACCA GCCTCAACGA CACCCGTGTG
CTCTACCATG AGCCGGCGAA GTGGATCGCG CGACTGCGGG CCACCGCGCC GCAGGGTGAC
TACCTGCTCA AAACTGAGAT GGGCGCCGGG CACGGTGGGC CCAGCGGCCG GTACGACGCC
TGGCGGGAGG AGGCGTTCAT CAACGCCTGG CTGCTCAACC AACTCGACAG CGCCTGA
 
Protein sequence
MTTETPAPVA RRMPTERTHH GDTVVDEYAW LADKDDPATI AYLTTENAYT EARTAHLTDL 
RAQLFEEIRQ RTQETDLSVP TRKGGHWYYT RTVEGQQYGV QCRRAVHDGE TAPPVSGDGT
PLTDEEVLLD GNLLAEGHDF FALGAFDVSP DGRWLAYSTD FSGDERFTLR VKDLTTGELL
PDEVPGTFYG TAWSADASVL FYVTVDDAWR PNRVWRHTLG TPAGEDVVVH QEDDERFWVG
VELTRSEKFV LIDIHSKLTS EILAIPAGNP TGAPAPVAPR RQGVEYTVEH HGHRFLILHN
DGAEDFALAY TSADAPGDWV PLIEHSPGTR LEAIDAFDNH LVVTLRSNGL TGLRVLPVGG
GDPHDIDFPE PLYSVGLDSN PEYRTSQLRL RYTSLVTPDS VYDYDLVTRR MIRRRQKPVL
PGPDGRPYDP AGYEQHREWA LADDGTRVPI SLVCRAGTPR DGSAPCVIYG YGSYEASMDP
WFSVARLSLL DRGVVFAVAH IRGGGELGRR WYDQGKLLAK KNTFTDFVSC ARHLVKAGWT
ATDRLVARGA SAGGLLMGAV TNLAPDAFAG IVAQVPFVDA LTSILDPSLP LTVTEWEEWG
NPLDDPEVYA YMKSYTPYEN VRAVDYPAIL AVTSLNDTRV LYHEPAKWIA RLRATAPQGD
YLLKTEMGAG HGGPSGRYDA WREEAFINAW LLNQLDSA