Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3847 |
Symbol | |
ID | 5707925 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4378924 |
End bp | 4381344 |
Gene Length | 2421 bp |
Protein Length | 806 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641273269 |
Product | DNA internalization-related competence protein ComEC/Rec2 |
Protein accession | YP_001538631 |
Protein GI | 159039378 |
COG category | [R] General function prediction only |
COG ID | [COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) |
TIGRFAM ID | [TIGR00360] ComEC/Rec2-related protein [TIGR00361] DNA internalization-related competence protein ComEC/Rec2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00472645 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAGCGACG GGCAGCCGAT GCGTTCCGGT CGCCGGGCAG CCCCGCCGCG CAGGGGTGGC GCACCTGACC CACCGGATTT GCGGCTGGCC GGCTTGGCCG TTGCCGCGTG GCTCGCCGCG TTGGCCGGGC TGCATTTGCC CGCCAGTTCC TCTTTGCTTG TTGCCGCGAT CGCCGCCGGG CTGGCTGGAC TGGGCGGGCT GTACCTGCTT GGACTGCTGG GTCGTCCACT CGCATCTGTC CGTCCGTACG GCTGGACGGC CATCGCCATC TTGCTTGGCG TGGTCTGCGG GGCGAGCGTC ACCGCAGCTC GGGTGACCGT GCGGGATGCC ACACCGGTGC GCGCCCTGGT GGAGGCGCGG GCCACCGTCG CCGCCGACCT GGTTGTCCGA GACGACCCCC GGTTGGTGCG CACCGCTTCT GGGAGACCCG CCATGTTCCT GGTGGCAACG GAGTCGACCC GAGTCACCGG GCCCGGCGGG CGTCGGGTCG AGGCGCGGGC CCGGATGCTG GTCCTCGCCA CCGACCCAGC CTGGCGGTAC CTGCTGCCGG GGCAGCGACT GACCGCCGAG GGGCGGCTCG CCGCTCCGCG GGGCGGCGAC CTCACCGCCG CGGTCCTCTG GTCGACCCGG GCCCCGGTAC CCCACGGGCC GCCACCGGGC TTTCAGCACG CCGCCGGCAC GCTCCGCGCC GGGCTTCAGG AAGCCTGCGA ACCACTACCG GACGAGCAGG GCGGCCTGCT ACCCGGTCTG GTGGTGGGCG ATACGAGTCG GTTGCCCGAT GCGGTGCGGG AGGATTTCCT CGCCACGGGC ATGACCCACC TGACGGCGGT CTCCGGATCC AACGTCGCGA TCATCGTGGG CGCCGTGCTG CTTCTCGCCC GCTGGGGGCG GGCCGGTCCC TGGCTCGCCG CCGGGCTCAG TGTGGTCGCA CTGGCAGGAT TCGTGATCTT GGTTCGTCCG TCGCCGAGCG TCGTGCGGGC GGCCACCATG GGAGCGATCG GGCTCGCCGC GCTCGCCGTC GGACGGCCGC GTGCGGCGTT GCCGGCCCTG GCCGCGGCGG TCACCGCCCT CGTGCTGTTC GATCCCGAGC TTGCCGGGGA CGTCGGCTTC GCCCTTTCCG TCCTCGCCAC CGGCGGGTTG CTGCTGCTCG CCCCGCGCTG GCGGGACGCG TTGCGGCGCC GCCGGGTGCC TGCGGGGGTC GCCGAGGCAC TTGCCGTGCC CGCCGCCGCG CAACTCGCCT GCGCGCCGGT CGTCGCGGGG ATCTCGGGCA CGGTCAGCCT GGTCGCGGTC CCGGCGAACC TGTTGGCGGT GCCAGCGATC GCGCCCGCAA CGGTGCTCGG CGTCGTGGCG GCGGCGCTTT CGCCCCTCTG GCCGGCGGGC GCTGGATTTC TGGCCTGGCT GGCCAGTTGG CCGGCATGGT GGCTGGTCGC CGTGGCGCAT CACGGGGCAC GGGTGCCGGC GGGCGCACTA CCCTGGCCGG ACGGCGTCGC TGGCGCGCTG TTGCTGACCG GGTTGACTCT GGCCCTGCTG GTGGCTGCCC GCCGCCGAGT GGTGCGCCGA CTTGTGGCGG TGACCGCCGT GGCGGCCGTG CTCGGCGCGT TGCCGGTGCG GCTGGTGGCC TCCGGCTGGC CACCGGTGGG TTGGGTGGCC GTGGCATGCG CGGTCGGTCA GGGCGATGCG ATTGTCCTGT CCGCCGGTCC GGGGCGGGCC GTGGTGGTGG ACGCCGGGCC GGAGCCGGGG GTTGTGGACC GCTGCCTGCG TCGAATCGGT GTCCGGGAGG TGCCGCTGCT GATAGTCAGC CACTTCCATC ACGACCACGT TGGTGGGGTG GCGGGCGTGT TCCGGGGGCG GCGGGTCACG ACCGTGCTCG CTCCGCCGTG GCCGGAGCCG GAGCACGGTC GTGATCTGGT CCGTGTCACG GCCGCGGCGG GCTCCGCCGA TGTGATCTCC GCCCCGGCCG GCTGGGGCTA CCGAACCGGT GGAGTGGAGC TGACCGTCAT CGGCCCACCA ACTCCGCTGC GGGGTACCCG CTCCGACCCG AACAACAACT CGCTCGTCCT GCTGGCCACG GTCAGCGGGG TGCGGATCCT GCTCACCGGT GACGCCGAGG CCGAGGAACA GCGCGCCCTG CTCGACCGCC CACCGGCCGG CGGGCTCCGC GTGCACGTGC TGAAGGTCGC CCACCACGGC TCGGCATACC AGGACTCCGC CTTCCTCGAC GTGGTCCGCC CGCTGGTCGC GGTCGTCCCG GTTGGCCGAG ACAACACCTA CGGGCACCCG GCTGCGTCTG TGCTCGGTCG CCTTGCCCGC GGTGGGGCTC GCGTTCTGCG AACCGACGTC GATGGGGACG TGGCTGTGGT GACCCGGCCG TCCGGTCTGG CCGTCGTCAC GCGGGGGCCT GAGAGCCCGA GCGATCGTTA G
|
Protein sequence | MSDGQPMRSG RRAAPPRRGG APDPPDLRLA GLAVAAWLAA LAGLHLPASS SLLVAAIAAG LAGLGGLYLL GLLGRPLASV RPYGWTAIAI LLGVVCGASV TAARVTVRDA TPVRALVEAR ATVAADLVVR DDPRLVRTAS GRPAMFLVAT ESTRVTGPGG RRVEARARML VLATDPAWRY LLPGQRLTAE GRLAAPRGGD LTAAVLWSTR APVPHGPPPG FQHAAGTLRA GLQEACEPLP DEQGGLLPGL VVGDTSRLPD AVREDFLATG MTHLTAVSGS NVAIIVGAVL LLARWGRAGP WLAAGLSVVA LAGFVILVRP SPSVVRAATM GAIGLAALAV GRPRAALPAL AAAVTALVLF DPELAGDVGF ALSVLATGGL LLLAPRWRDA LRRRRVPAGV AEALAVPAAA QLACAPVVAG ISGTVSLVAV PANLLAVPAI APATVLGVVA AALSPLWPAG AGFLAWLASW PAWWLVAVAH HGARVPAGAL PWPDGVAGAL LLTGLTLALL VAARRRVVRR LVAVTAVAAV LGALPVRLVA SGWPPVGWVA VACAVGQGDA IVLSAGPGRA VVVDAGPEPG VVDRCLRRIG VREVPLLIVS HFHHDHVGGV AGVFRGRRVT TVLAPPWPEP EHGRDLVRVT AAAGSADVIS APAGWGYRTG GVELTVIGPP TPLRGTRSDP NNNSLVLLAT VSGVRILLTG DAEAEEQRAL LDRPPAGGLR VHVLKVAHHG SAYQDSAFLD VVRPLVAVVP VGRDNTYGHP AASVLGRLAR GGARVLRTDV DGDVAVVTRP SGLAVVTRGP ESPSDR
|
| |