Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3084 |
Symbol | |
ID | 5706819 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 3490450 |
End bp | 3492786 |
Gene Length | 2337 bp |
Protein Length | 778 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641272520 |
Product | hypothetical protein |
Protein accession | YP_001537888 |
Protein GI | 159038635 |
COG category | [S] Function unknown |
COG ID | [COG3472] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00108127 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCAAAGA CGTTTGAGAG TGGCGATGTT CCGCTGGCAG ACCTGCTGAA TCAAGCACGC CAGGGTGTGC TGCAGCTTCC TGACTTCCAA CGGGGATGGG TCTGGGACGA CGACCACATC GTGAGCCTGC TCGCCTCGGT CTCCCGTTCG TTTCCCATCG GTGCGGTGAT GACGCTGGAG ACCGGCAACG CTGAGGTCCG CTTTCGGCCC CGACCGTTGG AGGGAGTCCC GAGCGGCACA GCGGCTACCA ACCCGAAGTA CCTGCTGCTC GATGGCCAGC AGCGGACTAC CTCGCTCTAC CTGGCGCTTC GTTCTGGGGC ACCGGTCCGT ACCCGGGACA CCAGGAAGAA GGAGGTGGAG CGTTGGTACT TCGCCGACAT CGACGCCTGC ATCGACCCGG ACGCCGACCG GACCGAGGCC ATCCTGTCGC TCCCGGCTGA CAAGCGGCGG CTGAGCTCCC GAGGCGAGGT GCTGCTCGAC GCCTCCACGA CCGAGGCGCA GGTGCGAGCC GGCAAGGCAG GTCTCTTCCC GCTCGACCTC GTCCTCGATC AGAATGCCAC CCTTGACTGG CAATACGCCT ACCTGCAGGC GGGCCCGTCG ATGGATCAGC AGCTCCAGAT CTGGAAGCAG TTTCAGGAGT CGCTGATCTC GCCCTTCCTG CACTACAGCG TTCCTCGGAT CGCCCTCGAC CAGAAGACTT CCAAGGAGGC CGTTTGCCAG GTCTTCGAGA AGGTGAACAC CGGTGGTGTC GAGCTGACGG TCTTCGAACT GCTGACGGCG ACGTTCGCGG CGGAGAACTT CCGGTTGCGC GACGACTGGG AGCGGCGACA AGCGACCTGG GCTGATGAGC CGCTGCTCGC CGACCTGGAC GCCACGACCT TCCTGCAGAT CGTGACACTG CTCTGGACGC GAAGCCGTTG GGAGGAACGT ATGCGGGAGC GTGTTCGTGG TGACCGTGTC CCGGCGGTCT CCGCGAAACG TCGGGAGATG CTGTCGCTGC CATTGAGCGG CTACCGGGGC TGGGCGGATG CTGTTACCGA CACGCTGCAG CGGGTGGTGC GTTTCCTGCA CGGCGAGCGG ATCTTCCGAA GCCGTGATCT CCCGTACAGC ACGCAGTTGG TTCCGTTGAC CGCGATCCTT ACGCTGCTCG GGGAGGACGC CTTCACGCCA GGGCCCCGCG CGAAGCTTCG TCAGTGGTAC TGGTCCGGGG TGTTCGGAGA GCTGTACGGC GGCACCACCG ACACCCGTTT CGCCAACGAC CTGCAAGACG TGCTCGCCTG GATCCTGAAC GACGGCGAGG AACCCCGCAC CGTCCGCGAG TCTCAGTTCC AGGCCGAGCG GTTGCTGGGC CTCCGGACCC GGAACAGCGC CGCCTACAAG GGCCTGTACG CGCTCGCCAT GAAACGCGGC GGCCGAGATT TCCGGACCGG CGACACAATC GACGCCAAGG CCTACGCGGC CGACTCCATC GACATCCGCC ACGTCTTCCC GCAGAAATGG TGCGCGGCGA ACGGCATCGA CAGCAACTAC GCCAACTGCA TCGTGAACAA GACAGCCATC GACGGGCAGA CCTGGGGATA CATCAGCAAC AACGCGCCAA GCCAGTACCT GGCCGCGATC GAAAGTGACC TGCCGGTGAG CTCGCAGGAC CTGGACGCGA TCATCGCCAG TCACGACATC GACCCCGTCG CACTGCGGCA GGACGACTTC CGTGCCGTCT TCGACGCCCG CTACGAGCGC TTGATCCGAC AGATTGAGGA CGCGACGGGC CGGCCGGTGA ACCGAGGAGA CAGTCACGGC AGTCCCTTCG CCACGCATCA GGGCGGAGCC GCGCTGGCCC GTAGCATCCA GGCGCTCATC AGGGCCGGCG AGAGCAAGAT CGTCGAGTTC AACTCGACCG GGCGGAAGAA CCTTTCCACC GGCCAGAAAG ACCGCGAGAT CGAGTGGGCG GTCACCAAGA CGATCGCCGG CTTCATGAAC GGTCACGGCG GGACGCTGCT GGTCGGCGTT GAGGACGACG GAAAGGTCAT CGGTTTGGAA GAAGACCTGA CGATCTTCAC CAAGAAGAAC ACCGACGCCT GGGAACAGTG GCTCACTCAT CTGCTCATCC AAGATTTCGG CAAGGCCCCG ACGGCCAACG TGACCGTCAG GTTCGGCACC ATCGAGGACC GGACGGTTGC CCGGATCGAT GTCGCGCTCA CCTCGGAGCC GGTGTATACA CTTCGCACGA AGACCGGGAT GAAGGGGGCG GTCTTCCTCG TGCGCCTCAA CCACACGACG CAGGAGGTCG CTGGGCCCGA GGCTTACGCC TACCAGCACA ACCGATGGTT CAAGTGA
|
Protein sequence | MAKTFESGDV PLADLLNQAR QGVLQLPDFQ RGWVWDDDHI VSLLASVSRS FPIGAVMTLE TGNAEVRFRP RPLEGVPSGT AATNPKYLLL DGQQRTTSLY LALRSGAPVR TRDTRKKEVE RWYFADIDAC IDPDADRTEA ILSLPADKRR LSSRGEVLLD ASTTEAQVRA GKAGLFPLDL VLDQNATLDW QYAYLQAGPS MDQQLQIWKQ FQESLISPFL HYSVPRIALD QKTSKEAVCQ VFEKVNTGGV ELTVFELLTA TFAAENFRLR DDWERRQATW ADEPLLADLD ATTFLQIVTL LWTRSRWEER MRERVRGDRV PAVSAKRREM LSLPLSGYRG WADAVTDTLQ RVVRFLHGER IFRSRDLPYS TQLVPLTAIL TLLGEDAFTP GPRAKLRQWY WSGVFGELYG GTTDTRFAND LQDVLAWILN DGEEPRTVRE SQFQAERLLG LRTRNSAAYK GLYALAMKRG GRDFRTGDTI DAKAYAADSI DIRHVFPQKW CAANGIDSNY ANCIVNKTAI DGQTWGYISN NAPSQYLAAI ESDLPVSSQD LDAIIASHDI DPVALRQDDF RAVFDARYER LIRQIEDATG RPVNRGDSHG SPFATHQGGA ALARSIQALI RAGESKIVEF NSTGRKNLST GQKDREIEWA VTKTIAGFMN GHGGTLLVGV EDDGKVIGLE EDLTIFTKKN TDAWEQWLTH LLIQDFGKAP TANVTVRFGT IEDRTVARID VALTSEPVYT LRTKTGMKGA VFLVRLNHTT QEVAGPEAYA YQHNRWFK
|
| |