Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2583 |
Symbol | |
ID | 5707168 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 2941478 |
End bp | 2943301 |
Gene Length | 1824 bp |
Protein Length | 607 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641272045 |
Product | hypothetical protein |
Protein accession | YP_001537415 |
Protein GI | 159038162 |
COG category | [S] Function unknown |
COG ID | [COG1944] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00702] uncharacterized domain [TIGR03604] bacteriocin biosynthesis docking scaffold, SagD family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.11419 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTGGCCC TGGGCGACAC CGACGCGCTC GCGGCGCCCG CCGCGGCCGA CCCCGACCGC CGCGACGCCA CCGTGCACCT CACCGCCGCC ACCGTGCTGA TCGGACCGTC CAGCAGCACG GCCGGCCCCG CCTGCGGACA CTGCCTGGCC ATCCGATGGC AGCGGCTGCG CACCCGCAGC CAACGCGACG CCCTGGAAGT CGGCGACCAG ACCATTGCCG TCGGCCCGTG GCCACAGCTG ACCGCCTACC AGCTCGACGC CGTCTGGGAG CTCTACCGCG CCAGCCACAC CGGCCCACCA CTGCCGCCGC CACCAAGCTG GGACCGGCAC AGCACCCCGC TGCCCCGGGT CAGCCAGCTC GACCTGGCCA GCCTGCGGAT CCGCACCTAC CCGGTGCTGG CCGACCCGCG CTGCCCCAGT TGCGCCCGGC AACGACCTGA CACCCCCGAC CCTCTGGTGC TCGCGCACCA ACCCCGGCCC AAGCCGCGCC CCGACACGTA CCGGCTGAGC ACTCCCGACG GCTATCCGCT GCCAGCGACC GCGCTGGTCA ACCCGGTCTG CGGCGCGCTG GGTGCGGGTA CCGCGCTCAC CATCACGTCA CCGACCACAG CACCCGTGAC CGGCAGCGTC TTCATCCGCG GCTACGGCGG GCTGCTCGAC GTCTCCTGGA GCGGCCAGTC CAGCGGCTAC GACGCGAGCC GCTCGCTGGC CTACCTCGAA GGGCTGGAGC GCTACGCCGG AACCCACCGG CGGCGCAACA CCATCCCGGT CGTCGCCGCA TACGCCGATC TCGACACCGA CGCACTGCAC CCGGACCGCT GCGGCAGCCA CCCCGACGAG GTGTACGACA CCGACCCGAT CCTGCGCCGG TTCGACCCAC AACGACCGAT CCCCTGGGTG TGGGGCCAGA ACCTGCACAC CGGCAAGCCC GTACTGGTGC CTCGCCGACT GTGCTTCTAC AGCTCGCCCG CCGCCGGCGA CACCTTCGTC CTCTCCTCCT CCAGCGGCTG CGCCACCGGC AGCTGCCTGG AGGAAGCCGC CCTGTTCGGC ATGCTGGAGC TGATCGAACG CGACGCGTTC CTGCTCGCCT GGTACGGCAA CCTGACCCTG CCCCGGATAG ACCTCGACAC CTGTCCGCCG GTCGTGCGCG CGCTGGTCGA TCGCGCCGAA CTGCAGGGCT ACCGTCTCTA CGCCTTCGAC AACCGGATCG ACCTCGACGT ACCCGTGGTC ACCAGTCTCG CCGTCCGCCA CGACGGCGGT CCCGGCCTGC TGTCGTTCGC CGCCGCGGCC CACCTCGACC CGCGGCAGGC GGTCACCGGC GCGCTCGCCG AGGCGCTCAC CTACATCCCA CACCAGCCCG CCACGGTGCG CCGACGTCGG GCCGAACTGG AGCGGATGGC CGACGACTAC ACCCTCGTCC GCCGGCTGCC GGACCACTCG GCACTGTTCG GCCTACCCCG AATGGCGGTG CACGCCGAAA GCTACCTCGA CGACCGAGGC ACGCTCCCCA TCGAGCACGC GTTCACCGGC TACCGGCCCC CCGGCACGCC GGATCTCCGC GATGACCTGC GCCGGGTGCT CGACCTGCTG GATGCGCGCG GGCTCGAGGC GATCATGGTG GACCAGACCA CACCAGAACA GGAGGCGGTC GGACTCCGCT CGGTCTGCAC GATCGTGCCC GGCCTGCTAC CGATCGACTT CGGCTGGATC CGACAACGGG CTCCGCACCT GCCGCGGCTG CGGACCGCGC CCGTGGTGGC CGGCCTCGCC GACACCGAAC TTACCGACGC CGACTTCCGC CTCGTTCCGC ACCCCTTCCC ATGA
|
Protein sequence | MVALGDTDAL AAPAAADPDR RDATVHLTAA TVLIGPSSST AGPACGHCLA IRWQRLRTRS QRDALEVGDQ TIAVGPWPQL TAYQLDAVWE LYRASHTGPP LPPPPSWDRH STPLPRVSQL DLASLRIRTY PVLADPRCPS CARQRPDTPD PLVLAHQPRP KPRPDTYRLS TPDGYPLPAT ALVNPVCGAL GAGTALTITS PTTAPVTGSV FIRGYGGLLD VSWSGQSSGY DASRSLAYLE GLERYAGTHR RRNTIPVVAA YADLDTDALH PDRCGSHPDE VYDTDPILRR FDPQRPIPWV WGQNLHTGKP VLVPRRLCFY SSPAAGDTFV LSSSSGCATG SCLEEAALFG MLELIERDAF LLAWYGNLTL PRIDLDTCPP VVRALVDRAE LQGYRLYAFD NRIDLDVPVV TSLAVRHDGG PGLLSFAAAA HLDPRQAVTG ALAEALTYIP HQPATVRRRR AELERMADDY TLVRRLPDHS ALFGLPRMAV HAESYLDDRG TLPIEHAFTG YRPPGTPDLR DDLRRVLDLL DARGLEAIMV DQTTPEQEAV GLRSVCTIVP GLLPIDFGWI RQRAPHLPRL RTAPVVAGLA DTELTDADFR LVPHPFP
|
| |