Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3354 |
Symbol | |
ID | 5705860 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 3872287 |
End bp | 3873927 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641272780 |
Product | X-Pro dipeptidyl-peptidase domain-containing protein |
Protein accession | YP_001538147 |
Protein GI | 159038894 |
COG category | [R] General function prediction only |
COG ID | [COG2936] Predicted acyl esterases |
TIGRFAM ID | [TIGR00976] putative hydrolase, CocE/NonD family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.417425 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0112773 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTCGCGG CGCTGGCGAC CCGGGTGGCC ACGGCCGCGT TGCGCCTGCC GCCGTCACGT ACCCGCCGGG TCACCCTGAC CCGCGACATC CTGGTCCGGA CCCGCGACGG CGTGTCGCTG CGCACCGACC ACCACGCCCC GGACCGGCCG GCCGCACCCA CGGTGCTCAT CCGCACCCCG TACGGGCGGG GTGGGCCGAT GCGCCTGCTC GGCCGGCTCG CCGCCGAGCG GGGCTACCAC GTGGTGATCC AGTCCTGCCG GGGTACCGGT GGGTCCGGCG GGCTGTTCGA CCCGCTGGTG CACGAACGCG ACGACGGCCT GGACACCCTC GACTGGCTGC GCCGCCAGTC CTGGTGGAAC GGCACATTCG GCATGTTCGG GGCCAGCTAC CAGGGCTTCG TCCAGTGGGC CGTCGCCGCT GACGCCGGGG CCGACCTTCG CGCGATGGTC GCGGTGGTGA CCGCCTCCGG CACCCGCGAC TCGACGTATC CGGGCGAGTC CTTCGCCCTG GACACCGTGC TCACCTGGGC CGAACTGCTC CAGGCGCAGA CCGTCGGGTG GCTCGCCCGG CAGTGGGAAC TCAAGCGTGG CCAGCCCCGG CTCGCCGCCG GGTTGTCCCA CCTGCCGCTG GCCGAGGCGG ACCGGGTGGC CACCGGCGTC ACCGTGCCCT TCTTCCAGGA ATGGTTACGC CACCACACCC CGGACGCGGC GTACTGGCGG AGGCGGGTCT TCGGTGACCG GCTTGCGGAG GTCCACGCCC CCGTTTCCAT GATCAGCGGC TGGCACGACA TCTTTCTCCC CGCCCAGTTG CGGGACTTCG CGGCCCTGCG TGCCGCCGGT GCCGCGCCCC GGCTCACCGT CGGGCCGTGG ACGCACGGCA GCCCCGGGCT GTTCGTCGCC GCGCTCCGCG ACGGACTGGA CTGGATCGAC CAACATCTGG GCGGGTACCC GGGGCGTCAC CGCGCCCCGG TCCGCGTGCA CGTCGGCGGG GCCGGCGGCG GCTGGCGAGA TCTGCCAGAC TGGCCGCCAC CAGGCACGCC GACCGCCTGG CACCTGCACC CACACGGTGC GCTGCGGGCC ACGCCGCCGC CGGTGTCGAC CCCAGACGGT TTCTGGTACG ACCCGGCCGA TCCCACCCCC TCGGTGGGCG GCCCGCTGCT GGTGGCCCAA CAGGCCGGCA AGGTGGACAA CCGGCCCGTC GAGGCCCGCT CCGACGTGCT GACCTGGACC AGCGCGGCGT TGACCGCGGC AGTGGAAGTC ATCGGACCGG TCCAGGCCGA GATCTTCGTC CGCAGCGAGC TACCTCACCT GGACGTTTTC GTGCGGCTGT GCGACGTGGA CCGCCGGGGT CGCTCCTGGA ACGTCTGTGA CGGGCTGGTC CGGGTCAGGC CGCCCGCCTT CTCGCCCGAC CAGACGAGCG CGGTCCGCGT CGCGGTGCCG TTGTGGCCGG TGGCCCACCG GTTCGCCGCC GGTCACCGAC TGCGGGTGCA GATCTCCGGC GGGGCCCACC CCCGGTACGC GCGTAACCCC GGCACCGGCG AACCGCTCGG CACCGCGGTC ACCCTGCGCG CCGGATGGCG GGAGATCCTG CACGATCCGC AGCACCCGTC CGCGCTGGTG CTACCCACTG TCGAGGGTTG A
|
Protein sequence | MLAALATRVA TAALRLPPSR TRRVTLTRDI LVRTRDGVSL RTDHHAPDRP AAPTVLIRTP YGRGGPMRLL GRLAAERGYH VVIQSCRGTG GSGGLFDPLV HERDDGLDTL DWLRRQSWWN GTFGMFGASY QGFVQWAVAA DAGADLRAMV AVVTASGTRD STYPGESFAL DTVLTWAELL QAQTVGWLAR QWELKRGQPR LAAGLSHLPL AEADRVATGV TVPFFQEWLR HHTPDAAYWR RRVFGDRLAE VHAPVSMISG WHDIFLPAQL RDFAALRAAG AAPRLTVGPW THGSPGLFVA ALRDGLDWID QHLGGYPGRH RAPVRVHVGG AGGGWRDLPD WPPPGTPTAW HLHPHGALRA TPPPVSTPDG FWYDPADPTP SVGGPLLVAQ QAGKVDNRPV EARSDVLTWT SAALTAAVEV IGPVQAEIFV RSELPHLDVF VRLCDVDRRG RSWNVCDGLV RVRPPAFSPD QTSAVRVAVP LWPVAHRFAA GHRLRVQISG GAHPRYARNP GTGEPLGTAV TLRAGWREIL HDPQHPSALV LPTVEG
|
| |