Gene Sare_5049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_5049 
Symbol 
ID5705324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5715948 
End bp5719136 
Gene Length3189 bp 
Protein Length1062 aa 
Translation table11 
GC content71% 
IMG OID641274442 
ProductUvrD/REP helicase 
Protein accessionYP_001539783 
Protein GI159040530 
COG category[L] Replication, recombination and repair
[S] Function unknown 
COG ID[COG0210] Superfamily I DNA and RNA helicases
[COG1379] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00375] conserved hypothetical protein TIGR00375 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0107382 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCTCCGA TCAGCGCCGC ACCCCCCGGT GGCCTTTCGT CCTTCGTCGC AGACCTGCAC 
ATCCACTCGA AATACTCCCG GGCGTGCAGC CGCGACCTCA CCCTGCCGAA CCTCGCGTGG
TGGTCCCGGC GCAAGGGCAT CGCCGTGCTC GGTACCGGCG ATTTCACCCA CCCCGCCTGG
TACGACCATC TCCGGGAGAC CCTTCACCCG GCAGAGCCGG GTCTGTACCG GCTGTCACCA
GATGCCGAAC GGGACATCGC GCGCCGGCTG CCCCCGCGCC TCGCCAGCGA GGCGGAGAAC
GCCGCGGTGC GGTTCATGCT CAGCGTGGAG ATCTCCACGA TCTACAAACG CGACGACCGT
ACGCGCAAGG TGCACCACCT GGTCTACCTG CCGGACCTGG ACGCGGTGGC CCGGTTCAAC
GCCGCGCTCG GACGGATCGG CAACCTCGGC TCGGACGGCC GGCCGATCCT CGGTCTGGAC
TCGCGCGACC TACTGGAGAT CACGCTCGAG GCGAGCCCGG ACGGTTACCT TGTGCCAGCG
CACATCTGGA CACCGTGGTT CTCTGCCCTC GGCTCCAAGT CCGGCTTCGA CGCGATCGCC
GACTGCTACG CCGACCTGGC CGGGCACATC TTCGCGGTGG AGACCGGCCT TTCCTCGGAC
CCGGAGATGA ACTGGCGGGT CGGCAGCCTC GACCACTATC GACTGGTGTC GAACTCCGAT
GCCCACTCCC CGCCCGCGCT GGCTCGGGAG GCCACGGTCT TCGCCTGCGC CCGCGACTAC
TTCGCCATTC GGGAGGCGCT GCGGACCGGT GACGGCCTGG CCGGAACGAT CGAGTTCTTC
CCGGAGGAGG GGAAGTACCA CGCGGACGGT CACCGACTCT GCGGCGTTAA CTGGGCACCG
GAGCAGACCC GGGCGACCGG TGGACGCTGC CCCGTGTGCG GCAAGCCGCT GACCGTCGGT
GTCCTCAACC GGGTCGAGGA GCTGGCCGAC CGGCCCCCGG GGCACCGGCC GTCACACGCC
CGCGACGTCA CTCACCTGGT GCCGCTGGCC GAGATCCTTG GTGAGATCAA CACCGTGGGC
GCGCGGTCGA AGAAGGTCGG GGGCAAGCTC AACGACCTGG TCGCCGCGCT CGGTCCGGAG
CTGGAGATCC TCACCCGGAC ACCGCTCGAG GACATTGGCC GGGCCGGCGG CGAGCTGCTT
GCCGAGGGCA TCGGCCGGCT GCGCCGGGGT GCGGTCCACA GGGTCCCGGG TTTCGATGGC
GAGTACGGCG TCATCACCCT CTTCGACCCG ACCGAACTCC GGCCCGGCGG GGGCGCGGCG
GCGCAGGAGA CACTCTTCGA CGTGCCTGTC GTGCCGGCGC AGCGGCGGCC CGTGGAGCCG
GCACCGAAGC CGAAGGCCCG CCGCCCGGCC GCGAAGCCGG AACCAAAGCG GGCCGGCCCA
CCGCCACCGC CGATCGCACC ACGGCCGTCG CCGCATGAGC CGCTCGAGCC GATGCTCGCC
GGCATGGAGG AGGTCGGTAC GGGCCTGCTC GACCGGCTGG ACGCGATGCA GCGGGTGGCC
GCGTCCGCGC CCGGCGGGCC GCTGCTGATC GTGGCTGGTC CGGGCACCGG CAAGACCCGC
ACGCTGACGC ACCGGATCGC GTACCTGTGT GCGGAGCTGA ACGTCTTCCC GGAGCACTGC
CTGGCGATCA CCTTCACTCG ACGGGCCGCC GAGGAGCTGC GGCACCGGCT CGACGGGTTA
CTCGGGCCGG TCGCCGAGGA CGTCACCGTG GGCACGTTCC ACTCCGTCGG CCTACAGATC
CTGCGGGAGA ACCGCGAGGC CGCCGGTCTC CCTGTGGACT TCCGGATCGC CGACGACGCG
GACCGCACCG CCGCCCGAGC AGAGGCCGGT GACGACCACG ACGCCTACCT GGCGCTGCTG
CGCAAGCAGG ACCTGGTCGA CCTGGACGAA CTGCTCACCC TGCCGCTGGC GCTACTGCGA
GCCGATCAAC GGCTGGCTGA CTCCTACCGT GACCGGTGGC GATGGATCTT CGTTGACGAG
TACCAGGACG TCGACGAGGT GCAGTACGAG CTACTCCGAC TGCTGAGCCC CGCTGACGGA
AACCTCTGCG CGATCGGTGA CCCGGACCAG GCGATCTACT CCTTCCGGGG CGCCGATGTC
GGATACTTCC TGCGCTTCTC GCAGGACTTC ACCGACGCCC GGCTGGTACG GCTGAACCGC
AACTACCGCT CGTCGGCGCC GATCCTGGCC GCCGCGGTAC AGGCGATCGC ACCTTCGTCG
TTGGTCCCTG GCCGCCGGCT GGATCCGGCC CGGCTCGACC CGGAGGCCCC ACTGATCGGC
CGGTACGCGG CCTCTTCCGT CGCGGACGAG GCCAGCTTCG TCGTCCGGAC GATCGACGAG
TTGGTCGGCG GGCTCTCGCA CCGCTCGCTG GACTCGGGTC GAATCGACGG CCGTACCACT
GCGCTCTCGT TCTCCGACGT TGCGGTGCTC TACCGCACCG ACGCGCAGGC CGCGCCGGTC
GTCGACGCGC TCTCCCGAGC GAACATCCCG GTACAGAAAC GCTCCCACGA CCGGCTGCGG
GACCGACCCG GTGTCGCCGA CATCGCCCGC GAACTGCGGC ACACCAGCGG CCTCGAGGAC
GGCCTACCGG CCCGGGTTCG GCTCGCCGGG CAGGTGGTCT CCGAACGGTT CGCCATCCCC
ACCCTCGATG GTTCGGGCGC GGTCCGTCCG GAGGACGTGC GCGTGGCGGT CGACCTCCTC
ACCCCACTCG CTCGGCGCTG CGGTGACGAC CTGGAACTTT TTCTGTCCCA GCTGGCGACG
GGCGCCGAGG TGGACGCGCT CGATCCGCGC GCGCAGGCGG TCACCCTGCT CACCCTGCAT
GCCGCCAAAG GGCTGGAGTT CCCGGTGGTC TTTCTGGTCG GCGCCGAGGA CAGGCTGCTG
CCGCTGCGCT GGCCCGGCAG TGAGCCGGAC GACGACGCGG TCGCTGAGGA GCGGCGGCTC
TTCTTCGTCG GGGTGACCCG CGCGCAGGAC CGTCTCTACG TCAGCCACGC CGCCCGGCGG
GTCCGACACG GCACCGAATA CGACTGTCGC CCGTCGCCGT TCCTGTCCAC AGTCGATCCG
GGCCTCTTCG AGCACCTCGG TGACCTGGAG CCCCGCCGTC CCAAGGACCG TCAGCTCCGC
TTGATCTGA
 
Protein sequence
MPPISAAPPG GLSSFVADLH IHSKYSRACS RDLTLPNLAW WSRRKGIAVL GTGDFTHPAW 
YDHLRETLHP AEPGLYRLSP DAERDIARRL PPRLASEAEN AAVRFMLSVE ISTIYKRDDR
TRKVHHLVYL PDLDAVARFN AALGRIGNLG SDGRPILGLD SRDLLEITLE ASPDGYLVPA
HIWTPWFSAL GSKSGFDAIA DCYADLAGHI FAVETGLSSD PEMNWRVGSL DHYRLVSNSD
AHSPPALARE ATVFACARDY FAIREALRTG DGLAGTIEFF PEEGKYHADG HRLCGVNWAP
EQTRATGGRC PVCGKPLTVG VLNRVEELAD RPPGHRPSHA RDVTHLVPLA EILGEINTVG
ARSKKVGGKL NDLVAALGPE LEILTRTPLE DIGRAGGELL AEGIGRLRRG AVHRVPGFDG
EYGVITLFDP TELRPGGGAA AQETLFDVPV VPAQRRPVEP APKPKARRPA AKPEPKRAGP
PPPPIAPRPS PHEPLEPMLA GMEEVGTGLL DRLDAMQRVA ASAPGGPLLI VAGPGTGKTR
TLTHRIAYLC AELNVFPEHC LAITFTRRAA EELRHRLDGL LGPVAEDVTV GTFHSVGLQI
LRENREAAGL PVDFRIADDA DRTAARAEAG DDHDAYLALL RKQDLVDLDE LLTLPLALLR
ADQRLADSYR DRWRWIFVDE YQDVDEVQYE LLRLLSPADG NLCAIGDPDQ AIYSFRGADV
GYFLRFSQDF TDARLVRLNR NYRSSAPILA AAVQAIAPSS LVPGRRLDPA RLDPEAPLIG
RYAASSVADE ASFVVRTIDE LVGGLSHRSL DSGRIDGRTT ALSFSDVAVL YRTDAQAAPV
VDALSRANIP VQKRSHDRLR DRPGVADIAR ELRHTSGLED GLPARVRLAG QVVSERFAIP
TLDGSGAVRP EDVRVAVDLL TPLARRCGDD LELFLSQLAT GAEVDALDPR AQAVTLLTLH
AAKGLEFPVV FLVGAEDRLL PLRWPGSEPD DDAVAEERRL FFVGVTRAQD RLYVSHAARR
VRHGTEYDCR PSPFLSTVDP GLFEHLGDLE PRRPKDRQLR LI