Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4526 |
Symbol | |
ID | 5706016 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 5116971 |
End bp | 5119067 |
Gene Length | 2097 bp |
Protein Length | 698 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641273940 |
Product | oligopeptidase B |
Protein accession | YP_001539289 |
Protein GI | 159040036 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1770] Protease II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000835874 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGACCACCG AGACCCCAGC GCCCGTCGCC AGGCGGATGC CGACCGAGCG AACCCACCAC GGCGACACCG TCGTCGACGA GTATGCCTGG CTCGCCGACA AGGACGATCC GGCCACGATC GCCTACCTCA CCACCGAGAA CGCCTACACC GAGGCCCGGA CAGCCCACCT GACGGACCTG CGCGCGCAAC TGTTCGAGGA GATCCGCCAG CGGACCCAGG AAACCGACCT GTCGGTTCCC ACCCGCAAGG GTGGCCACTG GTACTACACC CGCACGGTCG AGGGGCAGCA GTACGGAGTG CAGTGCCGCC GCGCCGTCCA CGACGGTGAA ACCGCCCCCC CGGTCAGCGG CGACGGCACC CCCCTGACAG ACGAGGAGGT GCTGCTCGAC GGCAACCTCC TCGCTGAGGG ACACGACTTC TTCGCGCTCG GGGCGTTCGA TGTGAGCCCG GACGGGCGCT GGCTGGCCTA CTCGACCGAC TTCTCCGGCG ACGAGCGGTT CACCCTACGG GTCAAGGACC TCACCACCGG TGAGTTGCTG CCCGACGAGG TGCCCGGCAC GTTCTACGGC ACGGCCTGGT CCGCTGACGC CTCGGTGCTC TTCTACGTCA CCGTCGACGA CGCGTGGCGG CCGAACCGGG TCTGGCGGCA CACACTGGGC ACTCCGGCCG GCGAGGACGT GGTGGTCCAC CAGGAGGACG ACGAGCGGTT CTGGGTCGGG GTCGAACTGA CCCGCTCCGA AAAATTCGTA CTCATCGACA TACACAGCAA GTTGACCAGT GAGATCCTGG CCATCCCCGC CGGCAACCCG ACCGGAGCCC CGGCCCCGGT GGCCCCCCGC CGTCAGGGCG TGGAGTACAC GGTCGAGCAC CACGGCCACC GGTTCCTGAT CCTGCACAAC GACGGCGCCG AGGACTTCGC CCTCGCGTAC ACCTCGGCCG ACGCCCCGGG CGACTGGGTG CCACTCATCG AGCACTCCCC GGGCACCCGC CTGGAGGCGA TCGACGCGTT CGACAACCAT CTGGTGGTCA CGTTACGCAG CAACGGGCTG ACCGGGCTGC GGGTGCTACC GGTCGGCGGT GGCGACCCCC ACGACATCGA CTTCCCCGAA CCGCTGTACA GCGTCGGCCT GGACAGCAAC CCGGAGTACC GCACCTCCCA GCTCCGCCTG CGCTACACCT CGTTGGTCAC CCCGGACTCG GTGTACGACT ACGACCTGGT CACCCGTCGG ATGATCCGAC GCCGGCAGAA GCCGGTGCTA CCCGGGCCAG ACGGTCGCCC GTACGACCCG GCCGGCTACG AGCAGCACCG GGAGTGGGCG CTCGCCGACG ACGGCACCCG GGTGCCGATC TCGCTGGTCT GCCGGGCCGG CACGCCGCGC GACGGCTCCG CGCCGTGCGT CATCTACGGG TACGGCTCCT ACGAGGCGAG CATGGACCCC TGGTTCTCCG TTGCCCGGCT GTCCCTGCTG GACCGGGGTG TCGTCTTCGC CGTGGCGCAC ATCCGCGGCG GCGGTGAACT GGGGCGCCGC TGGTACGACC AGGGGAAGCT ACTGGCCAAG AAGAACACCT TCACCGACTT CGTTTCCTGT GCCCGGCACC TGGTCAAGGC CGGCTGGACG GCGACCGACC GGCTGGTCGC CCGGGGCGCC TCGGCCGGTG GGCTGCTGAT GGGCGCGGTG ACCAACCTCG CTCCGGACGC CTTCGCCGGG ATCGTCGCGC AGGTTCCCTT CGTCGACGCG CTCACCTCGA TCCTCGACCC ATCGCTGCCG TTGACCGTCA CCGAGTGGGA GGAGTGGGGC AACCCACTGG ACGACCCCGA GGTGTACGCG TACATGAAGT CGTACACGCC ATACGAGAAC GTACGGGCCG TGGACTACCC GGCGATCCTC GCGGTGACCA GCCTCAACGA CACCCGTGTG CTCTACCATG AGCCGGCGAA GTGGATCGCG CGACTGCGGG CCACCGCGCC GCAGGGTGAC TACCTGCTCA AAACTGAGAT GGGCGCCGGG CACGGTGGGC CCAGCGGCCG GTACGACGCC TGGCGGGAGG AGGCGTTCAT CAACGCCTGG CTGCTCAACC AACTCGACAG CGCCTGA
|
Protein sequence | MTTETPAPVA RRMPTERTHH GDTVVDEYAW LADKDDPATI AYLTTENAYT EARTAHLTDL RAQLFEEIRQ RTQETDLSVP TRKGGHWYYT RTVEGQQYGV QCRRAVHDGE TAPPVSGDGT PLTDEEVLLD GNLLAEGHDF FALGAFDVSP DGRWLAYSTD FSGDERFTLR VKDLTTGELL PDEVPGTFYG TAWSADASVL FYVTVDDAWR PNRVWRHTLG TPAGEDVVVH QEDDERFWVG VELTRSEKFV LIDIHSKLTS EILAIPAGNP TGAPAPVAPR RQGVEYTVEH HGHRFLILHN DGAEDFALAY TSADAPGDWV PLIEHSPGTR LEAIDAFDNH LVVTLRSNGL TGLRVLPVGG GDPHDIDFPE PLYSVGLDSN PEYRTSQLRL RYTSLVTPDS VYDYDLVTRR MIRRRQKPVL PGPDGRPYDP AGYEQHREWA LADDGTRVPI SLVCRAGTPR DGSAPCVIYG YGSYEASMDP WFSVARLSLL DRGVVFAVAH IRGGGELGRR WYDQGKLLAK KNTFTDFVSC ARHLVKAGWT ATDRLVARGA SAGGLLMGAV TNLAPDAFAG IVAQVPFVDA LTSILDPSLP LTVTEWEEWG NPLDDPEVYA YMKSYTPYEN VRAVDYPAIL AVTSLNDTRV LYHEPAKWIA RLRATAPQGD YLLKTEMGAG HGGPSGRYDA WREEAFINAW LLNQLDSA
|
| |