Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1369 |
Symbol | |
ID | 5707288 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 1584181 |
End bp | 1585245 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641270880 |
Product | hypothetical protein |
Protein accession | YP_001536261 |
Protein GI | 159037008 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.791695 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00309173 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCGTCG GGGTCGCACT CGTCACCTCG GTGGTTCTGC TGGCGTTGAA CGGCTTCTTC GTCGCCGCCG AGTTCGCCCT CGTGGCGAGC AAGCGCTACC GGCTGGAGCA GGCCGCTGCC GGTGGTGGCC GGGCAGCCCG AGCCGCACTG GACGGCGTAC GGGAGTTGTC GCTCATGCTG GCCGGCGCGC AACTGGGCAT CACCCTGTGC ACGCTGGGCC TGGGCGCGCT GGCCGAACCG GCGATCGAGC GTCTGCTCAG CCCGTTGCTG CACGCCGTCG GGCTGCCCAC CGCGGCGAGC CACGTCATCG CGTTGATCTT CGCGCTGAGC TTGGTGACCT TCCTGCATCT GGTGGTGGGG GAGATGGCGC CGAAGTCGTG GGCGATCAGC GACGCCGAAC GGTCCGCGGT CTTGTTGGCG CTGCCGTTCC GCGCTTTCGC CCGGGTGTCC CGGCCGGTGT TGTCGGCACT GAACTCGATG GCGAACGGCA TCCTGCGCCT GTTCAAGGTC AAGCCGCAGG ATCAACTGGC CCAGGTGCAC GGCCCGGAGG AACTGCGCAT CCTGCTGGAG CAGTCCCGTG AACACGGGCT GCTCGGTGCC GAGCAGCACG AGTTGCTGAC CAGCATGCTG GAGCTGCAGG GCACGACGGT GGCCCAGGTG ATGGAGCCGT TCGATCGGAT CGTCACCGTG CGACGGCACG CGGACGTAGG CCGGATCGAG CAGGTCAGCC GCGACAGCGG GCGGTCCCGC CTGGCGGTGC TCGACGAGGC CGGTGACGTG TGTGGGCTGG TGCACGTGCG GGAGGCGGTC CGGGCCGCGG TCAGCCGTCC GACGGCGACC GCCGGGGAGC TGATGACGGC CGCGTTCACC CTGCCCGCGT CGGCGACGGT CACCGAGGCG GTGGCGGCGA TGCGGGCCCG ACGTTCGCAG CTGGCCTTGG TCCGTAACGG CGGGGGGCCG GCCCGTCCGG TCGGTTTCGT CGCGCTGGAG GACCTGCTGG AGGAGGTCAT CGGCGAGTTC GACGATGAGA CGGATCCGGT TCCTCGGGGG CGGCGGTTGC GCTGA
|
Protein sequence | MSVGVALVTS VVLLALNGFF VAAEFALVAS KRYRLEQAAA GGGRAARAAL DGVRELSLML AGAQLGITLC TLGLGALAEP AIERLLSPLL HAVGLPTAAS HVIALIFALS LVTFLHLVVG EMAPKSWAIS DAERSAVLLA LPFRAFARVS RPVLSALNSM ANGILRLFKV KPQDQLAQVH GPEELRILLE QSREHGLLGA EQHELLTSML ELQGTTVAQV MEPFDRIVTV RRHADVGRIE QVSRDSGRSR LAVLDEAGDV CGLVHVREAV RAAVSRPTAT AGELMTAAFT LPASATVTEA VAAMRARRSQ LALVRNGGGP ARPVGFVALE DLLEEVIGEF DDETDPVPRG RRLR
|
| |