Gene Strop_4105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_4105 
Symbol 
ID5060587 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp4668299 
End bp4670395 
Gene Length2097 bp 
Protein Length698 aa 
Translation table11 
GC content70% 
IMG OID640476366 
Productoligopeptidase B 
Protein accessionYP_001160913 
Protein GI145596616 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1770] Protease II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.662092 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCATCG AGACCCCAGC GCCCGTCGCC AAACGGATGC CGACCGAGCG AACCCACCAC 
GGCGATACGT TCACTGACGA GTACGCCTGG CTCGCCGGCA AGGACGATCC CGCCACGATT
GCCTACCTCA CCACCGAGAA CGCCTACACC GAGGCGCGGA CGGCCCACCT GGCGGACCTG
CGCGAGCAGC TGTTCGAGGA GATCCGCCAG CGGACCCAGG AGACCGACCT GTCGGTGCCC
GCCCGCAAGG GCGGCCACTG GTACTACACC CGCACGGTCG AGGGGCAGCA GTACGGAGTG
CAGTGCCGCC GCGCCGTCCA CGACGGCGAG ACCGACCCGC CGGTCAGCCA GGACGGTGCC
CCCCTCGCGG ACGAGGAGGT ACTGCTCGAT GGCAACGTCC TCGCCGCGGG GCACGACTTC
TTCGCGCTCG GGGCGTTCGA CGTGAGCCCG GACGGACGCT GGCTGGCGTA CTCGACCGAC
TTCTCCGGCG ACGAGCGGTT CACGCTCCGG GTCAAGGACC TCACCACCGG GGAACTGCTG
CCCGACGAGG TGCCCGGCAC GTTCTACGGA ACGGCCTGGT CCGCCGACGC CTCGGTGCTC
TTCTACGTCA CGGTTGACGA CGCGTGGCGG CCGAACCGGG TCTGGCGGCA CACCCTGGGC
ACCTCGGCCA GCGAGGACGT GGTGGTTTAC CAGGAGGACG ACGAGCGGTT CTGGGTCGGG
GTCGAGCTGA CCCGCTCGGA GAAGTTCCTC CTCATCGACA TTCACAGCAA GGTGACCAGC
GAAGTCCTGG CCATCCCCGC CGGCAACCCG ACCGGCGCCC CGGTTCCGGT GGCCCCCCGC
CGCCAGGGTG TGGAGTACAC GGTCGAGCAC CACGGCCACC GGTTCCTGAT CCTGCACAAC
GACGGCGCCG AGGACTTCGC CCTCGCGTAC ACCTCGGCCG ACGCCCCGGG CGACTGGGTG
CCGCTCATCG AGCACCGTCC CGGCACCCGT CTGGAGGCGG TCGACGCCTT CGAGAACCAT
CTGGTGGTCA CGTTGCGCGC CAACGGGCTG ACCGGGCTGC GGGTGCTGCC GATCGGGGGT
GGCGACTCAC ACGACATCGA CTTCCCCGAA CCGCTGTACA GCGTCGGCCT GGACAGCAAC
CCGGAGTACC GCACGGGTCA GCTCCGATTC CGCTACACCT CACTGGTCAC CCCGGACTCG
GTGTACGACT ACGACCTGGT CACCCGCCGG ATGATTCAAC GCCGGCAGCG GCCGGTGCTG
CCCGGGCCGG ACGGCCGCCC GTACGACCCC GCCGGCTACG AGCAGCACCG GGACTGGGCG
ATCGCCGACG ACGGCACCCG GGTGCCGATC TCGCTGGTCT GCCGGGCCGG CACCCCGCGC
GACGGCTCCG CGCCGTGCGT CATCTACGGC TACGGCTCCT ACGAGGCGAG CATGGACCCC
TGGTTCTCGG TCGCCCGGCT CTCCCTGCTG GACCGGGGTG TCGTCTTCGC CGTGGCACAC
ATCCGCGGCG GCGGCGAGCT GGGCCGGCGC TGGTATGACC AGGGCAAGCT GCTGGCCAAG
AAGAACACCT TCACCGACTT CGTGGCCTGC GCACGGCACC TGGTCGAGGC GGGTTGGACC
GCGACCGACC GGCTGGTCGC CCGGGGCGCC TCCGCCGGCG GGCTGCTGAT GGGGGCGGTC
GCCAACCTCG CCCCGGACGC GTTCACCGGG ATCGTCGCGC AGGTTCCCTT CGTCGACGCG
CTGACCTCGA TGCTCGACCC GTCGCTGCCG TTGACCGTCA CCGAGTGGGA GGAGTGGGGC
AACCCGCTGG ACGACCCCGA GGTGTACGCG TACATGAGGT CGTACACGCC GTACGAGAAC
GTGCGGGCCG TGGACTATCC AGCGATCCTC GCGGTGACCA GCCTCAACGA CACCCGGGTG
CTCTACCACG AGCCGGCGAA GTGGATCGCG CGACTGCGAG CCACCGCACC GCGGGGCGAC
TACCTGCTCA AGACCGAGAT GGGTGCCGGG CACGGCGGGC CGAGCGGTCG GTACGACGCC
TGGCGTGAGG AGGCCTTCAT CAACGCCTGG CTGCTCGACC AGCTCGGCCG CGCCTGA
 
Protein sequence
MTIETPAPVA KRMPTERTHH GDTFTDEYAW LAGKDDPATI AYLTTENAYT EARTAHLADL 
REQLFEEIRQ RTQETDLSVP ARKGGHWYYT RTVEGQQYGV QCRRAVHDGE TDPPVSQDGA
PLADEEVLLD GNVLAAGHDF FALGAFDVSP DGRWLAYSTD FSGDERFTLR VKDLTTGELL
PDEVPGTFYG TAWSADASVL FYVTVDDAWR PNRVWRHTLG TSASEDVVVY QEDDERFWVG
VELTRSEKFL LIDIHSKVTS EVLAIPAGNP TGAPVPVAPR RQGVEYTVEH HGHRFLILHN
DGAEDFALAY TSADAPGDWV PLIEHRPGTR LEAVDAFENH LVVTLRANGL TGLRVLPIGG
GDSHDIDFPE PLYSVGLDSN PEYRTGQLRF RYTSLVTPDS VYDYDLVTRR MIQRRQRPVL
PGPDGRPYDP AGYEQHRDWA IADDGTRVPI SLVCRAGTPR DGSAPCVIYG YGSYEASMDP
WFSVARLSLL DRGVVFAVAH IRGGGELGRR WYDQGKLLAK KNTFTDFVAC ARHLVEAGWT
ATDRLVARGA SAGGLLMGAV ANLAPDAFTG IVAQVPFVDA LTSMLDPSLP LTVTEWEEWG
NPLDDPEVYA YMRSYTPYEN VRAVDYPAIL AVTSLNDTRV LYHEPAKWIA RLRATAPRGD
YLLKTEMGAG HGGPSGRYDA WREEAFINAW LLDQLGRA