Gene Sare_3337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3337 
SymboluvrC 
ID5708292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3849576 
End bp3851558 
Gene Length1983 bp 
Protein Length660 aa 
Translation table11 
GC content70% 
IMG OID641272764 
Productexcinuclease ABC subunit C 
Protein accessionYP_001538131 
Protein GI159038878 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00194] excinuclease ABC, C subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0998705 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00117638 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGCTGATC CCTCGTCCTA CCGTCCCGCG CCCGGCACGA TCCCGGAATC ACCGGGGGTC 
TACCGTTTTC GCGACGGCAC CGGCCGGGTG ATCTACGTTG GCAAGGCGCG AAACCTGCGC
AGCCGGCTGA ACTCGTACTT CGCTGATCCG GTCAATCTGC ACCAGCGCAC CCGGCAGATG
GTGTTCACCG CCGAATCGGT GGACTGGATC TCCGTCGCCA CCGAGGTCGA GGCGCTCCAG
CAGGAGTTCA CCGAGATCAA GCAGTACGAC CCGAGGTTCA ACGTCCGCTA CCGGGACGAC
AAGTCCTACC CGTACCTGGC GGTCACCGTC GACGAGGAGT TTCCCCGCCT CCAGGTCATG
CGCGGCGCGA AGCGTAGGGG CGTGCGTTAC TTCGGGCCGT ACTCGCATGC CTGGGCGATC
CGCGAGACGC TCGACCTGCT GCTTCGGGTC TTTCCGGCAC GCACCTGCTC GTCGGGAGTG
TTCAAACGAG CCGGTCAGGT CGGCCGCCCC TGCCTGCTGG GCTACATCGG CAAGTGCTCC
GCGCCCTGCG TCGGCAGTGT CTCCGCCGAG GAACACCGCG ACATCGTCAA CGGCTTCTGC
GACTTCATGG CCGGCCGGAC CGATGCCATG GTCCGCCGGT TGGAGCGGGA GATGGCCGAG
GCCAGCGCGG AGCTGGAGTT CGAGCGGGCC GCCCGGCTCC GCGACGACCT GGCCGCCCTA
CGCCGGGCGA TGGAGAAGCA GACCGTGGTG TTCGGCGACG GCACCGACGC CGACGTGGTC
GCCTTCGCCG ACGACCCGCT CGAAGCGGCC GTGCAGGTGT TCCACGTGCG TGACGGTCGA
ATCCGGGGCC AGCGCGGCTG GGTGGTCGAG AAAACCGAGG ACCTGACCGC CGGCGACCTC
GTCCACCACT TCTGCACCCA GGTGTATGGC GGGGAACACG GTGAGGCGCA CGTCCCCCGG
GAACTGCTCG TGCCCGAGTT GCCCGCGGAC GTCGAGGCGC TCGCCGACTG GCTCTCCGAG
CATCGTGGCA GCCGGGTCAC CCTGCGGGTG CCGCAGCGCG GCGACAAGCG TGCCCTGCTG
GAGACGGTCG CGCGTAACGC CACGGACGCC TTGGCCCGGC ACAAGCTCAA GCGCGCCGGT
GATCTGACCA CCCGGAGCAA GGCCCTCGAC GAGATCGCCG ACACGTTGGG CATGCGGACG
GCACCGCTGC GGATCGAGTG CTTCGACATC TCCCAGATCC AGGGCACCGA TGTGGTGGCC
AGCATGGTGG TCTTCGAGGA CGGCCTGCCT CGCAAGAGCG AGTACCGGCG GTTCATCATC
CGGGGCGCCA CCGACGACCT GTCGGCGATG TCGGAGGTAC TGCGGCGGCG TTTCGCCCGC
TACCTGGACG CCCGGGCGGA AACGGGGGAG GCTGGCGTCG AGTCGGCCGG CGACCCGGAC
GCCCCGGCCG GACCGGATGC GCCTGACGAG CCGCGGGTCG GCACCTTGGT CGACCCGACG
ACGGGTCGAC CGCGCAAGTT CGCCTATCCG CCGCAACTGG TGGTGGTCGA CGGGGGCGCG
CCGCAGGTCG CGGCGGCGGC GCAGGCCCTC GCCGAGTTGG GCGTCGACGA CGTGGCCCTG
TGCGGTCTGG CGAAGCGACT GGAGGAGGTG TGGCTGCCCG ACGACGATTT CCCCGCCATT
TTGCCCCGCA CATCCGAGGG CCTCTATTTG CTGCAACGCG TGCGTGACGA GGCGCACCGG
TTCGCCATCA CGTTCCACCG GCAGCGGCGT TCCCGCCGGA TGACCGAGTC GGCGTTGGAT
CGGGTGTCCG GGCTGGGTGA GGTGCGGCGC AAGGCGCTGC TGCGCCACTT CGGCTCCCTG
AAACGGCTTG CCGCCGCCTC GGTGGAGGAG ATCACCGAGG TTCCTGGGAT CGGTAAGCGG
ACGGCCGAGG CGATCCTCGC CGCGCTCGCC GACCCGACGG GGCAGAGTGA ACCGCGTAGA
TAG
 
Protein sequence
MADPSSYRPA PGTIPESPGV YRFRDGTGRV IYVGKARNLR SRLNSYFADP VNLHQRTRQM 
VFTAESVDWI SVATEVEALQ QEFTEIKQYD PRFNVRYRDD KSYPYLAVTV DEEFPRLQVM
RGAKRRGVRY FGPYSHAWAI RETLDLLLRV FPARTCSSGV FKRAGQVGRP CLLGYIGKCS
APCVGSVSAE EHRDIVNGFC DFMAGRTDAM VRRLEREMAE ASAELEFERA ARLRDDLAAL
RRAMEKQTVV FGDGTDADVV AFADDPLEAA VQVFHVRDGR IRGQRGWVVE KTEDLTAGDL
VHHFCTQVYG GEHGEAHVPR ELLVPELPAD VEALADWLSE HRGSRVTLRV PQRGDKRALL
ETVARNATDA LARHKLKRAG DLTTRSKALD EIADTLGMRT APLRIECFDI SQIQGTDVVA
SMVVFEDGLP RKSEYRRFII RGATDDLSAM SEVLRRRFAR YLDARAETGE AGVESAGDPD
APAGPDAPDE PRVGTLVDPT TGRPRKFAYP PQLVVVDGGA PQVAAAAQAL AELGVDDVAL
CGLAKRLEEV WLPDDDFPAI LPRTSEGLYL LQRVRDEAHR FAITFHRQRR SRRMTESALD
RVSGLGEVRR KALLRHFGSL KRLAAASVEE ITEVPGIGKR TAEAILAALA DPTGQSEPRR