Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1703 |
Symbol | |
ID | 5704014 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 1968167 |
End bp | 1970560 |
Gene Length | 2394 bp |
Protein Length | 797 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641271206 |
Product | RNA-binding S1 domain-containing protein |
Protein accession | YP_001536581 |
Protein GI | 159037328 |
COG category | [K] Transcription |
COG ID | [COG2183] Transcriptional accessory protein |
TIGRFAM ID | [TIGR00426] competence protein ComEA helix-hairpin-helix repeat region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.247477 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00353536 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGACCCTCT CTGTTCATCA GCGGATCGCC GAGGAACTCG GCGTCGCCGA GCGCCAGGTA CGCGCAGCCG TGGAACTACT CGACGGCGGC GCGACCGTGC CGTTCATCGC CCGCTACCGC AAGGAGGCCA CCGGCCTGCT CGACGACACC CAGCTGCGCA CCCTCGAGGA GCGGATGCGC TACCTGCGCG AGTTGGACCA GCGCCGGACT GCGGTCCTGG AGTCGATCCG GGGCCAGGGC AAGCTCGACG AGACCCTGAC GGCACAGATC ATGGCAGCCG ACTCGAAGTC TCGGCTGGAG GACATCTATC TGCCGTACAA GCCGAAGCGG CGGACCCGGG CACAGATCGC GCGCGAGGCT GGACTGGAGC CACTCGCCGA CACACTGCTC GACGATCCCG CCCAAGACCC ACGCGCGACG GCCGTCAGGT TCGTCGACCC GGACCGGGGC ATCGCCGACC CGTCCGCCGC ACTGGACGGT GCCCGCGCCA TCCTCGTCGA ACGGTTCGCC GAGGACGCCG ACCTGATCGG CACGCTACGC GAGCAGATGT GGTCACGGGG CCGGCTGGTG TCCCGGGTAC GCGATGGTCA GGCCACGGCC GGCGCCAAGT TCGCCGACTA CTTCGACTTC GCCGAGCCGT ACCCGAAACT GCCCTCGCAC CGGGTCCTCG CCGTGTTCCG GGGGGAGAAG GAGGGTGTGC TCGACCTGAC CATGGAGCCG GAGCAGCAGG AGAACCCGGA TCCAGCGACC ACCGGTCCGA CCCGGTACGA GGCGGCCGTC GCCGCCCGGT TCGGGGTCAG TGACCGGGGA CGGCCGGCCG ACCGGTGGCT CTCCGACACG GTGCGCTGGG CCTGGCGTAC CCGAATCCTG ATCCACCTCG GCGCGGACCT TCGCATGCGG TTGTGGCAGG CCGCCGAGCA GGAAGCGGTG CGGGTCTTCG CCACGAACCT GCGGGACCTG CTGCTGGCCG CCCCGGCCGG GGCCCGGACG ACGATGGGCC TGGATCCCGG CCTGCGCACC GGGGTGAAGG TCGCCGTCGT TGACGCGACG GGCAAGGTGG TCGCCACCGA CACCATCTAC CCGCACGAGC CGCGCCGGCA GTGGGACGCC TCGATCGAGA CCCTCGCCCG TCTCGCCACC GCGCACCAGG TCGAGTTGGT CGCGATTGGT AACGGCACCG CGAGCCGGGA GACCGACCGA CTCGCCGCAG AGCTGATCCG GCGCCACCCA CAGCTGAACC TCACCAAGCT CGTCGTGTCC GAAGCCGGCG CTTCGGTCTA CTCAGCGTCC GCGTACGCCG CGCAGGAGCT GCCGGGCCTG GACGTGTCGC TGCGGGGGGC GGTCTCCATC GCCCGTCGCC TCCAGGACCC ACTCGCCGAA CTGGTCAAGA TCGATCCCCG GTCCATCGGA GTCGGGCAGT ACCAACACGA CCTGTCCGAG GTGACGTTGT CCCGGTCGCT CGACGCGGTG GTCGAGGACT GCGTCAACGC GGTCGGCGTC GACGTCAACA CCGCCTCCGC GCCACTGCTG ACCCGGGTCT CCGGCATCGG TGCCGGACTG GCGGAGAACA TCGTGCTGCA CCGGGACGCC AACGGGCCCT TCCGAACCCG GGGCGACCTG CGACGGGTAC CCCGGCTTGG TCCGAAGGCA TTCGAGCAGT GCGCGGGCTT CCTGCGCATC CCCGACGGTG CCGACCCGCT GGACTCGTCG AGCGTGCACC CGGAGGCGTA CCCGGTGGTG CGGCGGATCC TCGCCGCCAC GAAGCAGGAA CTGCGGATGG TGATCGGCCG CAGCGCGGTC CTGCGCGGGC TGCGGGCCGC CGATTTCGTC GACGAGACCT TCGGGCTGCC GACGGTCACC GACATCCTCG CCGAGTTGGA GAAACCCGGC CGGGATCCGC GGCCGGAATT CCGCACCGCC ACGTTCACCG AGGGCGTGGA GACGATCACC GACCTGGTGC CCGGGCTGAT CCTCGAGGGC GTGGTCACCA ACGTCGCCGC CTTCGGCGCG TTCGTGGACG TCGGCGTGCA TCAGGATGGC CTGGTACATG TCTCGGCGAT GTCCCGCGCC TTCGTTCGCG ACCCTCGCGA GGTGGTGAAG TCCGGTGACG TGGTGAAGGT CAAGGTCCTC GACGTGGACG TGCCACGCAA GCGCATCTCG CTGACCCTTC GACTGAACGA TACCGAGGCC GGTCGCGGCG GAGCGCATGG CCAGCGGGAC CGCGGTGGCG ACCGGGAGGC CAACCGCGGC GAGTCCCGAG GCCGCGGTGG GCAGCAGGCC CGCGGTGGGC AGCAGGCCCG CGGTGGGCAG CCGCAACCCA GACGTGGCGG CGCCACGCCG CCCCCGGCCA ACGACGCGAT GGCCGATGCC CTGCGTCGCG CCGGCCTCGC CTGA
|
Protein sequence | MTLSVHQRIA EELGVAERQV RAAVELLDGG ATVPFIARYR KEATGLLDDT QLRTLEERMR YLRELDQRRT AVLESIRGQG KLDETLTAQI MAADSKSRLE DIYLPYKPKR RTRAQIAREA GLEPLADTLL DDPAQDPRAT AVRFVDPDRG IADPSAALDG ARAILVERFA EDADLIGTLR EQMWSRGRLV SRVRDGQATA GAKFADYFDF AEPYPKLPSH RVLAVFRGEK EGVLDLTMEP EQQENPDPAT TGPTRYEAAV AARFGVSDRG RPADRWLSDT VRWAWRTRIL IHLGADLRMR LWQAAEQEAV RVFATNLRDL LLAAPAGART TMGLDPGLRT GVKVAVVDAT GKVVATDTIY PHEPRRQWDA SIETLARLAT AHQVELVAIG NGTASRETDR LAAELIRRHP QLNLTKLVVS EAGASVYSAS AYAAQELPGL DVSLRGAVSI ARRLQDPLAE LVKIDPRSIG VGQYQHDLSE VTLSRSLDAV VEDCVNAVGV DVNTASAPLL TRVSGIGAGL AENIVLHRDA NGPFRTRGDL RRVPRLGPKA FEQCAGFLRI PDGADPLDSS SVHPEAYPVV RRILAATKQE LRMVIGRSAV LRGLRAADFV DETFGLPTVT DILAELEKPG RDPRPEFRTA TFTEGVETIT DLVPGLILEG VVTNVAAFGA FVDVGVHQDG LVHVSAMSRA FVRDPREVVK SGDVVKVKVL DVDVPRKRIS LTLRLNDTEA GRGGAHGQRD RGGDREANRG ESRGRGGQQA RGGQQARGGQ PQPRRGGATP PPANDAMADA LRRAGLA
|
| |