Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_5049 |
Symbol | |
ID | 5705324 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 5715948 |
End bp | 5719136 |
Gene Length | 3189 bp |
Protein Length | 1062 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641274442 |
Product | UvrD/REP helicase |
Protein accession | YP_001539783 |
Protein GI | 159040530 |
COG category | [L] Replication, recombination and repair [S] Function unknown |
COG ID | [COG0210] Superfamily I DNA and RNA helicases [COG1379] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00375] conserved hypothetical protein TIGR00375 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0107382 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCTCCGA TCAGCGCCGC ACCCCCCGGT GGCCTTTCGT CCTTCGTCGC AGACCTGCAC ATCCACTCGA AATACTCCCG GGCGTGCAGC CGCGACCTCA CCCTGCCGAA CCTCGCGTGG TGGTCCCGGC GCAAGGGCAT CGCCGTGCTC GGTACCGGCG ATTTCACCCA CCCCGCCTGG TACGACCATC TCCGGGAGAC CCTTCACCCG GCAGAGCCGG GTCTGTACCG GCTGTCACCA GATGCCGAAC GGGACATCGC GCGCCGGCTG CCCCCGCGCC TCGCCAGCGA GGCGGAGAAC GCCGCGGTGC GGTTCATGCT CAGCGTGGAG ATCTCCACGA TCTACAAACG CGACGACCGT ACGCGCAAGG TGCACCACCT GGTCTACCTG CCGGACCTGG ACGCGGTGGC CCGGTTCAAC GCCGCGCTCG GACGGATCGG CAACCTCGGC TCGGACGGCC GGCCGATCCT CGGTCTGGAC TCGCGCGACC TACTGGAGAT CACGCTCGAG GCGAGCCCGG ACGGTTACCT TGTGCCAGCG CACATCTGGA CACCGTGGTT CTCTGCCCTC GGCTCCAAGT CCGGCTTCGA CGCGATCGCC GACTGCTACG CCGACCTGGC CGGGCACATC TTCGCGGTGG AGACCGGCCT TTCCTCGGAC CCGGAGATGA ACTGGCGGGT CGGCAGCCTC GACCACTATC GACTGGTGTC GAACTCCGAT GCCCACTCCC CGCCCGCGCT GGCTCGGGAG GCCACGGTCT TCGCCTGCGC CCGCGACTAC TTCGCCATTC GGGAGGCGCT GCGGACCGGT GACGGCCTGG CCGGAACGAT CGAGTTCTTC CCGGAGGAGG GGAAGTACCA CGCGGACGGT CACCGACTCT GCGGCGTTAA CTGGGCACCG GAGCAGACCC GGGCGACCGG TGGACGCTGC CCCGTGTGCG GCAAGCCGCT GACCGTCGGT GTCCTCAACC GGGTCGAGGA GCTGGCCGAC CGGCCCCCGG GGCACCGGCC GTCACACGCC CGCGACGTCA CTCACCTGGT GCCGCTGGCC GAGATCCTTG GTGAGATCAA CACCGTGGGC GCGCGGTCGA AGAAGGTCGG GGGCAAGCTC AACGACCTGG TCGCCGCGCT CGGTCCGGAG CTGGAGATCC TCACCCGGAC ACCGCTCGAG GACATTGGCC GGGCCGGCGG CGAGCTGCTT GCCGAGGGCA TCGGCCGGCT GCGCCGGGGT GCGGTCCACA GGGTCCCGGG TTTCGATGGC GAGTACGGCG TCATCACCCT CTTCGACCCG ACCGAACTCC GGCCCGGCGG GGGCGCGGCG GCGCAGGAGA CACTCTTCGA CGTGCCTGTC GTGCCGGCGC AGCGGCGGCC CGTGGAGCCG GCACCGAAGC CGAAGGCCCG CCGCCCGGCC GCGAAGCCGG AACCAAAGCG GGCCGGCCCA CCGCCACCGC CGATCGCACC ACGGCCGTCG CCGCATGAGC CGCTCGAGCC GATGCTCGCC GGCATGGAGG AGGTCGGTAC GGGCCTGCTC GACCGGCTGG ACGCGATGCA GCGGGTGGCC GCGTCCGCGC CCGGCGGGCC GCTGCTGATC GTGGCTGGTC CGGGCACCGG CAAGACCCGC ACGCTGACGC ACCGGATCGC GTACCTGTGT GCGGAGCTGA ACGTCTTCCC GGAGCACTGC CTGGCGATCA CCTTCACTCG ACGGGCCGCC GAGGAGCTGC GGCACCGGCT CGACGGGTTA CTCGGGCCGG TCGCCGAGGA CGTCACCGTG GGCACGTTCC ACTCCGTCGG CCTACAGATC CTGCGGGAGA ACCGCGAGGC CGCCGGTCTC CCTGTGGACT TCCGGATCGC CGACGACGCG GACCGCACCG CCGCCCGAGC AGAGGCCGGT GACGACCACG ACGCCTACCT GGCGCTGCTG CGCAAGCAGG ACCTGGTCGA CCTGGACGAA CTGCTCACCC TGCCGCTGGC GCTACTGCGA GCCGATCAAC GGCTGGCTGA CTCCTACCGT GACCGGTGGC GATGGATCTT CGTTGACGAG TACCAGGACG TCGACGAGGT GCAGTACGAG CTACTCCGAC TGCTGAGCCC CGCTGACGGA AACCTCTGCG CGATCGGTGA CCCGGACCAG GCGATCTACT CCTTCCGGGG CGCCGATGTC GGATACTTCC TGCGCTTCTC GCAGGACTTC ACCGACGCCC GGCTGGTACG GCTGAACCGC AACTACCGCT CGTCGGCGCC GATCCTGGCC GCCGCGGTAC AGGCGATCGC ACCTTCGTCG TTGGTCCCTG GCCGCCGGCT GGATCCGGCC CGGCTCGACC CGGAGGCCCC ACTGATCGGC CGGTACGCGG CCTCTTCCGT CGCGGACGAG GCCAGCTTCG TCGTCCGGAC GATCGACGAG TTGGTCGGCG GGCTCTCGCA CCGCTCGCTG GACTCGGGTC GAATCGACGG CCGTACCACT GCGCTCTCGT TCTCCGACGT TGCGGTGCTC TACCGCACCG ACGCGCAGGC CGCGCCGGTC GTCGACGCGC TCTCCCGAGC GAACATCCCG GTACAGAAAC GCTCCCACGA CCGGCTGCGG GACCGACCCG GTGTCGCCGA CATCGCCCGC GAACTGCGGC ACACCAGCGG CCTCGAGGAC GGCCTACCGG CCCGGGTTCG GCTCGCCGGG CAGGTGGTCT CCGAACGGTT CGCCATCCCC ACCCTCGATG GTTCGGGCGC GGTCCGTCCG GAGGACGTGC GCGTGGCGGT CGACCTCCTC ACCCCACTCG CTCGGCGCTG CGGTGACGAC CTGGAACTTT TTCTGTCCCA GCTGGCGACG GGCGCCGAGG TGGACGCGCT CGATCCGCGC GCGCAGGCGG TCACCCTGCT CACCCTGCAT GCCGCCAAAG GGCTGGAGTT CCCGGTGGTC TTTCTGGTCG GCGCCGAGGA CAGGCTGCTG CCGCTGCGCT GGCCCGGCAG TGAGCCGGAC GACGACGCGG TCGCTGAGGA GCGGCGGCTC TTCTTCGTCG GGGTGACCCG CGCGCAGGAC CGTCTCTACG TCAGCCACGC CGCCCGGCGG GTCCGACACG GCACCGAATA CGACTGTCGC CCGTCGCCGT TCCTGTCCAC AGTCGATCCG GGCCTCTTCG AGCACCTCGG TGACCTGGAG CCCCGCCGTC CCAAGGACCG TCAGCTCCGC TTGATCTGA
|
Protein sequence | MPPISAAPPG GLSSFVADLH IHSKYSRACS RDLTLPNLAW WSRRKGIAVL GTGDFTHPAW YDHLRETLHP AEPGLYRLSP DAERDIARRL PPRLASEAEN AAVRFMLSVE ISTIYKRDDR TRKVHHLVYL PDLDAVARFN AALGRIGNLG SDGRPILGLD SRDLLEITLE ASPDGYLVPA HIWTPWFSAL GSKSGFDAIA DCYADLAGHI FAVETGLSSD PEMNWRVGSL DHYRLVSNSD AHSPPALARE ATVFACARDY FAIREALRTG DGLAGTIEFF PEEGKYHADG HRLCGVNWAP EQTRATGGRC PVCGKPLTVG VLNRVEELAD RPPGHRPSHA RDVTHLVPLA EILGEINTVG ARSKKVGGKL NDLVAALGPE LEILTRTPLE DIGRAGGELL AEGIGRLRRG AVHRVPGFDG EYGVITLFDP TELRPGGGAA AQETLFDVPV VPAQRRPVEP APKPKARRPA AKPEPKRAGP PPPPIAPRPS PHEPLEPMLA GMEEVGTGLL DRLDAMQRVA ASAPGGPLLI VAGPGTGKTR TLTHRIAYLC AELNVFPEHC LAITFTRRAA EELRHRLDGL LGPVAEDVTV GTFHSVGLQI LRENREAAGL PVDFRIADDA DRTAARAEAG DDHDAYLALL RKQDLVDLDE LLTLPLALLR ADQRLADSYR DRWRWIFVDE YQDVDEVQYE LLRLLSPADG NLCAIGDPDQ AIYSFRGADV GYFLRFSQDF TDARLVRLNR NYRSSAPILA AAVQAIAPSS LVPGRRLDPA RLDPEAPLIG RYAASSVADE ASFVVRTIDE LVGGLSHRSL DSGRIDGRTT ALSFSDVAVL YRTDAQAAPV VDALSRANIP VQKRSHDRLR DRPGVADIAR ELRHTSGLED GLPARVRLAG QVVSERFAIP TLDGSGAVRP EDVRVAVDLL TPLARRCGDD LELFLSQLAT GAEVDALDPR AQAVTLLTLH AAKGLEFPVV FLVGAEDRLL PLRWPGSEPD DDAVAEERRL FFVGVTRAQD RLYVSHAARR VRHGTEYDCR PSPFLSTVDP GLFEHLGDLE PRRPKDRQLR LI
|
| |