Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2590 |
Symbol | |
ID | 5707175 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 2951729 |
End bp | 2953621 |
Gene Length | 1893 bp |
Protein Length | 630 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641272052 |
Product | hypothetical protein |
Protein accession | YP_001537422 |
Protein GI | 159038169 |
COG category | [S] Function unknown |
COG ID | [COG1944] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00702] uncharacterized domain [TIGR03604] bacteriocin biosynthesis docking scaffold, SagD family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.188065 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.130609 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGACG GTACGACCGC GGCCGACGAC CTGGTCCGTG CCGGGCTGGC CCGGCTGCTC GCCGAACGGG GCTGCGACGC GGACGTGGTC GCGGTGGGGG TGCGGGACGA ACTCAGCGCG ACCCCCCGCC CGGAGCGCGG ACCGCGGGTG CTGCTCTACG GGCAGCTGGC GCTGATCGGA CCGGTGCCGC CGGCCGCGGC CTGTTCGATC TGCCTCGCCC GTCGCTGGCA GGCGGTGCGC CACCGCGACC TGCGCGACGC GCTGGAACTG GGCGGTGACA CGGTCGCGGC GGGCCCCTGG CCGTACGCGA CCCCGCTCGT CGCGGACTTC CTGCATGCCC TGATCGCAGC CCGCCGTACC GCCGCCACCG GGCCGGCAGG CACCGGGACG GTGACGCCCA GCACGGTGTT TCAGGTCGAC CTGTCGACCC TGCGGGTGCA CCAGGTGCCG CTGCTGCCCG ACCCCGAGTG CCCGGCCTGC GGCGACATGG TCACCGATGC ACCGGAACTG GCGGCGATCG AGCTGCCCGC CACACCGAAA TCGGAGCCGG GCACGTTTCG CGGCCGGGAC CTGGACGACT ACCCGCTGAA CGTGGCGGCG TTCGCCAACC CCGTGTGCGG GGCGCTCGGC GCCAGTCTGT GGCAGGACGT CACCTCGCTG TCCACCTCCC CGGCGGTCGG CTCGTTCACC CTGCGATGTG GGCGCTTCCT GCGGGAGACG CTCTACGGTG GGCACACCGA CGGCTACCGG AGCAGCGCCC GGATCGCGGT GCTGGAAGGA CTGGAACGCG CCGCCGGCCT GCGCCCGCGG GGAAAACGCA CCGCGGTCAC CGCGACCCTG CGGGAACTGG GCGACGAGGC CCTGGACCCA CGCGAGTGTG GGCTCTACAC CGACGCCTTC TATCAGGCCG CTCCGTACCT GCACCGGTTC GACGTGGACC GGCCGATCAC GTGGGTGTGG GGCTGGTCGC TGCGCGACCA GCGCCCGCTG CTGGTGCCGG AGGTCCTCGC CTACTACCAC GCGGCGAGTG TCGAGGAGCG GTTCGTCCAG GAGACCTCGA ACGGCTGCGC CTCCGGCGGA TCGATGGTGG AGGCGATCTA CCACGGCCTG ATGGAGGCGA TCGAGCGCGA CGCGTTCCTG CTGGCCTGGT ACGGCGGCCG GTCCCTACCG GAGATCGACC CGGCCACCAT CGACCGACCA CGGACCCGGA TGATGGTCGA CCGGCTGGCG ATGTACGGCT ACCGGGCCCG ATTCTTCGAC ACCCGGATGA CCTTCGACAT CCCGGTGGTG ACCGCGGTGG CCGTCCGTGC GGACGGCGGT CTCGGTACCC TCGCCTTCGG CGGTGGGGCG AGCCTCGACC CGCAGGCCGC GATCACCGCG GCGCTCTGCG AAATCGCCAC CGACTCGGTG ATGGTCCGGG TCCGCGCCCG CGCCGACGAA ACCCGGTTAC GTCAGATGAC GACCGACTTC TCCCGGGTGC AGAGCCTGCA CGACCACCCG TTGCTCTACG GCCTGCCGGA GATGGCGCGG CACGCCGCGT TCCTGCTGGA ACACGGCAGG GCGCCGGTCC CGATGGCGCA CCTGTACGAG CGGGACCGTC CCGCCCCACC GGTCACCACC GACCTGCGCG ACGACCTCGA ACGCTGCCTG AAGCAGGTGA CCGCGCAGGG CTTCGACGTG ATCGCCGTCG ACCAGACCAC CCCCGAGCAG CGCGAGCTGG GCCTGACGAC GGTGAGCGTG GTGGTCCCGG GACTGCTGCC GATCGACTTC GGCTGGCTAC GCCAGCGCGC CCCGCACGCG CGGCGGCTGC GGACCGCGTT CCGCACCGCC GGGCTCCTGC ACCGCGACCT GCGCGACGAC GAAATCCACT CCGTTCCCCA CCCGTTCCCG TGA
|
Protein sequence | MNDGTTAADD LVRAGLARLL AERGCDADVV AVGVRDELSA TPRPERGPRV LLYGQLALIG PVPPAAACSI CLARRWQAVR HRDLRDALEL GGDTVAAGPW PYATPLVADF LHALIAARRT AATGPAGTGT VTPSTVFQVD LSTLRVHQVP LLPDPECPAC GDMVTDAPEL AAIELPATPK SEPGTFRGRD LDDYPLNVAA FANPVCGALG ASLWQDVTSL STSPAVGSFT LRCGRFLRET LYGGHTDGYR SSARIAVLEG LERAAGLRPR GKRTAVTATL RELGDEALDP RECGLYTDAF YQAAPYLHRF DVDRPITWVW GWSLRDQRPL LVPEVLAYYH AASVEERFVQ ETSNGCASGG SMVEAIYHGL MEAIERDAFL LAWYGGRSLP EIDPATIDRP RTRMMVDRLA MYGYRARFFD TRMTFDIPVV TAVAVRADGG LGTLAFGGGA SLDPQAAITA ALCEIATDSV MVRVRARADE TRLRQMTTDF SRVQSLHDHP LLYGLPEMAR HAAFLLEHGR APVPMAHLYE RDRPAPPVTT DLRDDLERCL KQVTAQGFDV IAVDQTTPEQ RELGLTTVSV VVPGLLPIDF GWLRQRAPHA RRLRTAFRTA GLLHRDLRDD EIHSVPHPFP
|
| |