Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3602 |
Symbol | |
ID | 5707592 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4156483 |
End bp | 4157391 |
Gene Length | 909 bp |
Protein Length | 302 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641273026 |
Product | zinc finger SWIM domain-containing protein |
Protein accession | YP_001538391 |
Protein GI | 159039138 |
COG category | [S] Function unknown |
COG ID | [COG4279] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00061687 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGTCCGC GGGCGACCGG GCGGGACAAG ACGTCCCGTA CGGCTGACCG GTTCGCCGAA TACGGTCCAC CTCGCAGGGT CGAGGGTGGG CTGCGGGCGC GCAGCACCCG TGGCGCGATC GGCCGGTCCT GGTGGTCCCG CCGTTTCCTG GAGGTGCTGG AGTCCTTTGC CCTGGGCACG CGCCTCACCC GCGGCCGGTC GTACGCCCGC GCCGGGCAGG TGATGCGCCT GGACGTGGCG CCGGGTGAGG TGACGGCTGT CGTGCAGGGC TCCCGATCCC GGCCGTACGA GGTCCACATC ACGCTGGCGA CCTTCCCGAT GGCGCTCTGG TCGCGGGTGG AGGAAGACTT GGCCACGCAG GCGTTTTTCA GCGCCCGCCT ACTCGCCGGC GATCTTCCCG CCGAGCTGGA GGAGCTGTTC GCCGCGGCCG GTGCGCCGTT GTTTCCCGCC GGCATGGCCG AGTTGGGCCA GCGGTGTAGC TGCCCCGACG CGGCGGTGCC GTGCAAACAC CTCGCTGCGA CCTTCTACCT CCTTGCCGAG GCGTTCGACG CCGACCCGTT CGCGTTGCTG CACTGGCGTG GGCGTAGCCG GGACGAGTTG CTCGATCAGT TGCGGGTGCG CCGGGGCACC GCCGTAGCAG TGCGGTCCGA CTCGGACAGT GCGCCTGACC TGCCGGTGGC CGGTGCCGCG CGGTCGGTCG CCGGGCTGCC ACCGGTGCCG CTGGCCGACG CGGTGCACCG GTTCTGGTCG CCGCCGGTGC CGCTGCCCGA CCGGCCGCCC ACTCTGGTGA CGGGGACTGA GCTGCTGCTG CGGCAGCTCG GCGCGCCCGC CCCGACCATC GGCGGTCCAG GACTGGTGGA ACGACTGCGG CGGGCGTATC GGCGGCTCGG TGAGCCGGAC GCTCGATGA
|
Protein sequence | MSPRATGRDK TSRTADRFAE YGPPRRVEGG LRARSTRGAI GRSWWSRRFL EVLESFALGT RLTRGRSYAR AGQVMRLDVA PGEVTAVVQG SRSRPYEVHI TLATFPMALW SRVEEDLATQ AFFSARLLAG DLPAELEELF AAAGAPLFPA GMAELGQRCS CPDAAVPCKH LAATFYLLAE AFDADPFALL HWRGRSRDEL LDQLRVRRGT AVAVRSDSDS APDLPVAGAA RSVAGLPPVP LADAVHRFWS PPVPLPDRPP TLVTGTELLL RQLGAPAPTI GGPGLVERLR RAYRRLGEPD AR
|
| |