Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Strop_3469 |
Symbol | |
ID | 5059938 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora tropica CNB-440 |
Kingdom | Bacteria |
Replicon accession | NC_009380 |
Strand | - |
Start bp | 3977634 |
End bp | 3978476 |
Gene Length | 843 bp |
Protein Length | 280 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 640475718 |
Product | helix-hairpin-helix repeat-containing competence protein ComEA |
Protein accession | YP_001160278 |
Protein GI | 145595981 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1555] DNA uptake protein and related DNA-binding proteins |
TIGRFAM ID | [TIGR00426] competence protein ComEA helix-hairpin-helix repeat region [TIGR01259] comEA protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.389617 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCATACG ACGAGGAGAA GGTGGTCCGG GACCGCCTGC ACCGAGTGCT GCCAGCGGGC GAGCTGTCCG GCCCCGGCCT GCCGGTGGCG GAGGCCCCGG CGGTGCTGAC GCGACCGGAG GCGCCGGCCG GTCCGCTACG CACCCCGCCG GACGGGGCGG AGGAGGAACC GGTTTCGTCC ACCCAGCCGG ATGGAGAACG GTCGAGTCGG GTGTTGCCGG GGCCGGGCGC GTTCGATCCG GGGCGGCGCG GGGTACGGGC ACTGGCCGTC GTCGCCGTGC TGGTGGTGCT CGGGGCGGGC TTCTGGGCGT GGCAGTCCCG GCCACAGATC GAGCCGGTCG CGCCGGTCGC AGAGGTGACG CCGGTCGGCC CGCCCGCCTC CGTCGGCCCG ACGGAGGCGG GCGGTGAGCT GGTGGTGGCG GTCGCCGGTA AGGTCCGCCG GCCGGGACTT GTCCGGGTGC CGGCCGGTGC CCGGGTCGCC GACGCGGTGC AGGCGGCCGG TGGGGCGCTG CCCGGAGTCG ATGTCGCTCT GTTCAATCCG GCCCGGAAGG TAACCGATGG GGAACTCATC CTGATCGGTG TCACCGCGCC ACCGGGGGCA GCCCCCGCTG GCGGCGTCGC CCCCGGTGGG GAGGCGGGAG TGGGGCCCGG AGGCAAGGTC AACCTCAACA CCGCCAGCCT GGCGCAGCTC GACACGTTGC CCGGAGTCGG CCCGGTGCTC GCGCAGCGCA TCCTCGACCA CCGTCAACAG CACGGCGGCT TCCGATCGGT CAGCGACCTG CGCCAGGTCG GTGGCATCGG CGACACCCGG TACGAGCAGC TCAAGGACTT GGTGACGGTG TGA
|
Protein sequence | MPYDEEKVVR DRLHRVLPAG ELSGPGLPVA EAPAVLTRPE APAGPLRTPP DGAEEEPVSS TQPDGERSSR VLPGPGAFDP GRRGVRALAV VAVLVVLGAG FWAWQSRPQI EPVAPVAEVT PVGPPASVGP TEAGGELVVA VAGKVRRPGL VRVPAGARVA DAVQAAGGAL PGVDVALFNP ARKVTDGELI LIGVTAPPGA APAGGVAPGG EAGVGPGGKV NLNTASLAQL DTLPGVGPVL AQRILDHRQQ HGGFRSVSDL RQVGGIGDTR YEQLKDLVTV
|
| |