Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2053 |
Symbol | |
ID | 5704725 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 2348505 |
End bp | 2349515 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641271540 |
Product | hypothetical protein |
Protein accession | YP_001536911 |
Protein GI | 159037658 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.479662 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000642876 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGAGACACA CGGACTACTT CAAAAACTTA CTTGACAAAA CCGTCAACCT GTCCAAGTTC AAGCTTGAGT TGCTGACCGA GCGGGTAGAC AGCATTTACG GCGTCCTAAA GGACGACGAG CATCTCGGTC CGCTGATCAA GAAGAAGATC CCGCAGGGCT CTTGGCCTCA GAGGACCATC ATTAACCCGC AGAACGGGAA GCCGTTCGAC GCTGACTTTA TGGTGCAGAT GGTCGAGGAC CCCGACTGGG CCGACGACCT GAAGAAGTAC GGCGATGCGA TCTACAAGGT GATTCACAAC ATGTCGCCCT ACAAGGACAT GCCGCATGGC CGGAAGTGCC GGTGTGTCTA CGTCACCTAC GCGAAAAACG CGATGCACGT CGACATCGTT CCGTTCGTGG TCCGCGCTGA CGGGACTCAG TGGATCATTA ACCGAGACGC CAACATGTGG GAGCGCACTG ATACCGACGG TTTCACTAGC TGGATGAAGG AGAAGGACGC GATCGCGGGA AACCATCTTC GTGAAGTCAT CCGCATCATG AAGTTCTTGC GCGATCACAA GAACTCGTTC ACTGGAACAA AGTCGATCCT GCTCACGACA GTGCTGGGTA TGCAGGTCGA AGCCTGGCGC AAAGTCCTGG AACCCGGGTA TTACGCGGAC CTACCGACCG CGCTGCTGCA CATCGTCTCC GACCTCAACA AGTGGCTTCA GGCTAACCCG ACCAAGCCAT TGATCATGGC CCCGTCCGGT TCGGGCACCT CGTTCGACCA CCGGTGGTCG CAGGCGACCT ATGCGTATTT CCGGGACCGC ATCCACACTC ACGCCGCCGA GATCAAGGCC GCCTACGATG AGGCCGACTT CGACGAGAGC GTGAAGAAGT GGCAGAGCTT GTTCGGTGAC GGCTTCACGG CCCCGAGTTC CTCTACGTCG AGCGGGAAGT ACAGCACCGT AGGAACGGGT GCTGGCACGG CCGGCGCCGC AACCGTCAAC CTCTCCGGAA GAGCGGGGTG A
|
Protein sequence | MRHTDYFKNL LDKTVNLSKF KLELLTERVD SIYGVLKDDE HLGPLIKKKI PQGSWPQRTI INPQNGKPFD ADFMVQMVED PDWADDLKKY GDAIYKVIHN MSPYKDMPHG RKCRCVYVTY AKNAMHVDIV PFVVRADGTQ WIINRDANMW ERTDTDGFTS WMKEKDAIAG NHLREVIRIM KFLRDHKNSF TGTKSILLTT VLGMQVEAWR KVLEPGYYAD LPTALLHIVS DLNKWLQANP TKPLIMAPSG SGTSFDHRWS QATYAYFRDR IHTHAAEIKA AYDEADFDES VKKWQSLFGD GFTAPSSSTS SGKYSTVGTG AGTAGAATVN LSGRAG
|
| |