Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3848 |
Symbol | |
ID | 5707926 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4381341 |
End bp | 4382180 |
Gene Length | 840 bp |
Protein Length | 279 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641273270 |
Product | helix-hairpin-helix repeat-containing competence protein ComEA |
Protein accession | YP_001538632 |
Protein GI | 159039379 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1555] DNA uptake protein and related DNA-binding proteins |
TIGRFAM ID | [TIGR00426] competence protein ComEA helix-hairpin-helix repeat region [TIGR01259] comEA protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00294158 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGTCAGACG ACGAGGAGAC GGTGGTCCGG GACCGCCTGC ACCGGGTGCT ACCGGTAGAA GACGAGTTGG CCGGCGCCGG CCGGGCGGTG CCCGAGGCCC AGGCGGTGGT GACGCGACGC GAGGCGCTGA GCAGTCCGCG ACCCCACCCA CCTGACGGGT CGGACGACGG CGCGGCGTCA TCCGTTCCGC CGGCTGGCGA GCCGTCAAGC CGGGTGCTGC CAGGACCGGG AGCGTTCGAC CCGGGGCGGC GCGGTGTGCG GGCTCTGGCC GTCGTCGCCG TGCTGGTAGT GCTCGGGGCC GGCTTCTGGG CGTGGCAGTC CCGACCGCAG GTCGAGCAGG TCGCGCCGGT CGCGGAGGCG GGACCGGTCG CCCTGTCCGT GTCCGGTGGG CCGACGGCGA CGCCGGGTGG TGAGCTGGTG GTGGCGGTCG CCGGTAAGGT CCGCCGGCCG GGACTGGTCC GGGTGTCGGC GGGCGCGCGG GTCGCTGACG CGGTGCAGGC GGCCGGTGGG GCGCTGCCCG GCGTCGATGT GGCGCTGTTC AACCCAGCCC GAAAGGTAGT TGACGGGGAA CTCATTCTGG TCGGCGTCCC CACGCCGCCG GGAGCGGCCC CCGTGGCCGC CGGCGGGGAG GCGGCAGCGG CGACCGGAGG CAAGGTCAAC CTGAACACCG CCACCCTGGC GCAGCTCGAC ACGCTGCCCG GAGTCGGCCC GGTGCTGGCG CAGCGCATCC TCGCTCACCG CCAGCAGCAC GGCGGCTTCC GTTCGGTGAG CGACCTGCGC CAGGTCGGCG GTATCGGCGA TACCCGATAC GAGCAGCTCA AGGATCTGGT GACGGTGTGA
|
Protein sequence | MSDDEETVVR DRLHRVLPVE DELAGAGRAV PEAQAVVTRR EALSSPRPHP PDGSDDGAAS SVPPAGEPSS RVLPGPGAFD PGRRGVRALA VVAVLVVLGA GFWAWQSRPQ VEQVAPVAEA GPVALSVSGG PTATPGGELV VAVAGKVRRP GLVRVSAGAR VADAVQAAGG ALPGVDVALF NPARKVVDGE LILVGVPTPP GAAPVAAGGE AAAATGGKVN LNTATLAQLD TLPGVGPVLA QRILAHRQQH GGFRSVSDLR QVGGIGDTRY EQLKDLVTV
|
| |