Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3326 |
Symbol | |
ID | 5703818 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 3838573 |
End bp | 3839553 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641272753 |
Product | hypothetical protein |
Protein accession | YP_001538120 |
Protein GI | 159038867 |
COG category | [S] Function unknown |
COG ID | [COG1481] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR00647] conserved hypothetical protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00142775 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCGATGA CGGCGGCGGT CAAGGACGAG CTGAGCCGAG TCGACGTGCC CAAGCCCTGC TGCCGGCGGG CGGAGATGGC GGCGCTGCTG CGCTTCGCCG GCGGGTTGCA CATCGTCTCC GGGCGGGTGG TCGTGGAAGC CGAACTGGAC ACCGGGGCGG TGGCCCGACG ACTGCGGCGC GAGATCGCCG AGGTGTACGG GTATCCCAGC GAGATTCATG TCCTCGCCTC CGGCGGCCTG CGCAAGGGCA GCCACTTCAT CGTGCGGGTG GTCAAGGACG GCGAGTTCCT CGCCCGGCAG ACCGGCCTGC TCGACGTCCG TGGGCGCCCG GTGCGGGGCC TGCCACCGCA CGTGGTCGCC GCCAACGTCT GCTGCGCGGT CTCGGCGTGG CGGGGCGCGT TCATGGCGCA CGGCTCGCTG ACCGAGCCGG GCCGCTCCAG CGCCCTGGAG ATCACCTGCC CCGGCCCGGA ATCGGCGCTG GCCCTGGTCG GTGCGGCCCG CCGGATCGGT ATCGCCGCGA AGAACCGCGA GGTGCGCGGC GTGGATCGGG TGGTCGTCAA GGACGGCGAC GCCATCGCCG CCCTGCTCAC CCGGATCGGT GCCCACGCCA GCGTGCTGGC CTGGGAGGAA CGCCGGGTCC GGCGGGAGGT GCGGGCCACC GCGAACCGGC TGGCCAACTT CGACGACGCG AACCTGCGCC GGTCGGCGCG GGCGGCGGTG GCCGCCGCCG CGCGGGTCAC CCGAGCCCTG GAGATCCTCG CCGACGACGC CCCGCATCAT CTGACCTCGG CCGGGCGGCT GCGGCTGGAA CATCGCCAGG CGTCGCTGGA GGAACTGGGT GCGCTCGCCG ACCCGCCGTT GACCAAGGAC GCGATCGCCG GGCGGATCCG GCGGCTGCTC GCGCTCGCCG ACAAGCGGGC CCGTGACCTC GGCATCCCGG ATACCGAAGC GGCAGTCACG CCGGACATGC TCGTGGTCTG A
|
Protein sequence | MAMTAAVKDE LSRVDVPKPC CRRAEMAALL RFAGGLHIVS GRVVVEAELD TGAVARRLRR EIAEVYGYPS EIHVLASGGL RKGSHFIVRV VKDGEFLARQ TGLLDVRGRP VRGLPPHVVA ANVCCAVSAW RGAFMAHGSL TEPGRSSALE ITCPGPESAL ALVGAARRIG IAAKNREVRG VDRVVVKDGD AIAALLTRIG AHASVLAWEE RRVRREVRAT ANRLANFDDA NLRRSARAAV AAAARVTRAL EILADDAPHH LTSAGRLRLE HRQASLEELG ALADPPLTKD AIAGRIRRLL ALADKRARDL GIPDTEAAVT PDMLVV
|
| |