Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Strop_2151 |
Symbol | |
ID | 5058614 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora tropica CNB-440 |
Kingdom | Bacteria |
Replicon accession | NC_009380 |
Strand | + |
Start bp | 2435178 |
End bp | 2436668 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640474414 |
Product | cupin 4 family protein |
Protein accession | YP_001158980 |
Protein GI | 145594683 |
COG category | [S] Function unknown |
COG ID | [COG2850] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.247247 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.537762 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGCCCGAT GTGTCTCGGT CGAACCGGCG ACGTTCGCCG CCGCGCACTG GGGACAGACA CCGCTGCTGT CCCGCGCCCA CGAGCTGCCC AACCCGAGTG GCTTCCGCGA CCTACTCAGC CCGGCAGACG CCGACGACCT GCTCAGCCGG CGCGGCTTAC GTACCCCGTT CCTGCGGGTG GCGCAGGACG GCGTGTTGGT GCCGGCGGCC CGGTACACCG GCGGCGGGGG CGCAGGCGCC GAGATCACCG ACCAGGTCCT CGACGAGAAG ATCCTCGACC TGTACGCCGG TGGGGCCACC CTGGTGTTGC AGGGCCTGCA CCGGACCTGG CCGGCACTCA TTGACTTCGC CCGGGACCTC GGCCTGGCTG TCGGGCAGCC ATTGCAGGTC AACGCCTACC TGACTCCCGC CGGCAGTCAG GGATTTGCCA CCCACTACGA CACGCACGAC GTCTTCGTCC TTCAGGTCGA CGGTGGGAAG CACTGGCGAA TCCACCCGCC GGTGCTCCCC GATCCGCTGG AACGGCAGCC GTGGGGCGGC CGAGCTGACG AGGTCGTCGC GACCGCGACG GGCGCACCCG CCCTCGACGT GCTGCTCGCC CCCGGAGACG CGTTGTACCT GCCCCGTGGC TGGTTGCACA GTGCTGCGGC TCAGGAGCGC AGCTCACTCC ACCTGACCGT CGGCGTTCGG GCGCTGACTC GGTACACGCT GGTCGAGGAG TTGCTCGCCC TCGCTGCCGA GGATCAGCGG CTGCGGGCCA CCCTGCCATT CGGGATCGAC GTCTCCGCTC CGGAGGCGGT GGAGCCCGAG TTGACCGAGA CGGTGGAGAT ACTGCGCGAC TGGCTGCGCC GCGTCGATCC GACAGCGCTC GCCGCCCGGC TGCGGCAGCG TGCATGGCCT GCGGCCCGGC CGGCTCCGCT GCACCCCCTT GCCCAGGCGG CCGCGTTGGG CGCGCTCGGC CCGGACAGCC GGGTCACTCC CCGCCCGGGC CTGCGGTGGC AGCTCACCCC GGCGGGGGAG CGGGTGACGT TGCGAGTCTT CGACCGCACC ATCACCCTGC CGCAGATGTG CGCCCCGGCC CTACGCGCGT TGCTCTCCGG TGAGGTCAGC CGGGTGGGGG ACCTGCCCGG TCTGGCCGAC GACACCGACC GGGTCACCCT CGTGCGCCGC CTGCTCCGGG AGGCCGTCGC CGTACCTGCG CACGGCAGCT GTCCGAACCT GCCCGACGGG GGCGCGGCGG GTGACCCGCC GCCACCACCG ACCGGTTCAA CGGACCAGGG TCAGGAGAAG CACCGTCGTC AGCAGTCCGG CGACCACCAC CACGGCCATC GGGCGTGGGC CGAGCACCGC GTGTCGCCGG TCGGCCAGAA CCTTCGCCGG GGCCAGGCCG ACCAGCGGGC CGAGCAAGGT CACGACCAGC GCCAGCACCG GCAGCCCGAC CGCCAGGTCC CCGCCGTACC GGTCGCCGGC CGTCCGGGCG ACGGTGCCTA G
|
Protein sequence | MARCVSVEPA TFAAAHWGQT PLLSRAHELP NPSGFRDLLS PADADDLLSR RGLRTPFLRV AQDGVLVPAA RYTGGGGAGA EITDQVLDEK ILDLYAGGAT LVLQGLHRTW PALIDFARDL GLAVGQPLQV NAYLTPAGSQ GFATHYDTHD VFVLQVDGGK HWRIHPPVLP DPLERQPWGG RADEVVATAT GAPALDVLLA PGDALYLPRG WLHSAAAQER SSLHLTVGVR ALTRYTLVEE LLALAAEDQR LRATLPFGID VSAPEAVEPE LTETVEILRD WLRRVDPTAL AARLRQRAWP AARPAPLHPL AQAAALGALG PDSRVTPRPG LRWQLTPAGE RVTLRVFDRT ITLPQMCAPA LRALLSGEVS RVGDLPGLAD DTDRVTLVRR LLREAVAVPA HGSCPNLPDG GAAGDPPPPP TGSTDQGQEK HRRQQSGDHH HGHRAWAEHR VSPVGQNLRR GQADQRAEQG HDQRQHRQPD RQVPAVPVAG RPGDGA
|
| |