Gene Dgeo_1124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_1124 
SymboluvrC 
ID4058293 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp1194925 
End bp1196775 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content68% 
IMG OID641230140 
Productexcinuclease ABC subunit C 
Protein accessionYP_604591 
Protein GI94985227 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00194] excinuclease ABC, C subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.957907 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0034731 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGCATTTCG ATGACCTGCC TGTGCTGCCT GCTTCGCCCG GCGTGTACAT CTTTCGCCGG 
GGAGGAACGC CGATCTATAT CGGGAAGGCG GTCAACCTGC GCTCGCGGGT CGCGCAGCAC
TTCAAGGCGG GCGGCAAAAG CGGCAAGTTC ACGGCGTTGG CCGACAGTCT GGACTTCATC
ACCACCCGCA ACGAGGTCGA GGCGCTGATC CTCGAAGCGA ACCTGATCAA GCAGCACCGC
CCGCACTACA ACGTGCTGCT CAAGGACGAC AAGCATTACC CTTTCCTCAA GCTGACGAAC
GAGGCATTTC CGATGCTGGT CGTCACCCGG CGGGTCCTGA AGGACGGCGC GAGCTACTAC
GGGCCCTACC CGGACGCCTC GGCGGTGCGG CGGGTCAAGC ACCTCATCGA CACGATGTTC
CCGCTGCGGA AAAACTCGGG CCTGCCCATG CAGAAAAAGC CGCGCCCTTG TCTGAACTAC
CATATGGGCC GCTGCCTCGG GCCGTGCGTC GACGCGGCAG ACCCGCAGGC GTACGCACAG
GTGGTCGAGG ACGTGAAGGC GCTGCTCGAG GGCCGCGCGG CCCCGGTGAT CGCCCGGCTG
AAGGCGGACA TGCAGGCCGC GGCGCGGGCG CAGGATTTTG AGCAGGCCGC GCGGCTGCGC
GACCGCGTGC AGGCGGTCGA GAAGCTCTTT GGGACCGAGC AGCACGCCTA CGTCAGCGAG
GAGACCGACC TGGACTTCTT GGGAGTGGCG CAGGCGGGCG AGTACGCGAT GGTGCAGCTC
TTCCGCCTGC GCGGCGGGCG GGTGGTGGGC CGTGACAAGC GCTTCCTGGT GGGCGCGGAG
GGGGGCGCCG ACGTGGGCGA GGTGCTGGGG GCCTTTGTGC AGGACTACTA CACCCAGGCC
ACGCACGTCC CGCCGCTCAT CCTGCTGCCC GCCGAGTTCG AGGACGCGCC GGTGTGGAGC
GCCTTTCTCT CGGAGCGAGC CGGGCGGCGG GTGGAGATGC GCACGCCCAA GCGCGGCGAC
AAGGCCGAGT TGGTGGAGAT GGCGCAGCGC AACGCGGCAG CGGGGCTGGA ATCCGAACTG
GCCCTGCTGG AGCGCCGGGG TGACCATCCC GGGCTGGACG CGCTGCGGGA GGTGCTCGCC
CTCCCGGACC GGCCCTGGCG CATCGAGGGC TACGACAATT CCAATCTGTT TGGCAGCAAC
ATCGTCTCGG GGATGGTGGT TTTCGAGGGC GGACGCGCGC GGCGGAGCGA GCATCGCCGC
TTCAAGGTCA GGGGGCTGGA TCACCCCGAC GATTACGCGG CGATGCACCA GACGATCACC
CGGCGCTTGA CCGGGTCCCT GGCCGACAAG CTGCCGCTGC CCGACCTCAT CCTGATCGAT
GGGGGACGCG GACAGGTGCA CGCCGCCCTC GACGCGCTGC GAGCGGCGGA TGTGCGGGTG
CCGCTGGTGG GTCTTGCCAA GCGCGAAGAA CGGATCATCC TGCCGGGGCG CTTCGGGGCG
CAGTGGTGGC TGGAGACGGG AACCGAGGTC GGGGTCGGCG GCGAGCTGCT GCTCCCGCAC
ACGCATCCGG CGCTGCGGGT CCTCATCGGC GTGCGCGACG AGGTGCACCA CTACGCCGTG
AGCTACCACC GCACGCTGCG CGGCGAGCAG ATGCTGCGCA GCGTGTTTGA CGATCTGCCG
GGCATCGGCC AGAAGCGCAG GGACGCTCTG CTGGAGCATT TCACCAGCCT GGAAGACCTC
GCCGCCGCGC CGGTCGAGAG AATTGCGGCG GTACCCGGCA TGAACCTGCG AGCGGCGCAG
AGCGTCAAGA AGTTCCTGGC GGAGCGGACA GCGAACGGAA CGCCGACATA A
 
Protein sequence
MHFDDLPVLP ASPGVYIFRR GGTPIYIGKA VNLRSRVAQH FKAGGKSGKF TALADSLDFI 
TTRNEVEALI LEANLIKQHR PHYNVLLKDD KHYPFLKLTN EAFPMLVVTR RVLKDGASYY
GPYPDASAVR RVKHLIDTMF PLRKNSGLPM QKKPRPCLNY HMGRCLGPCV DAADPQAYAQ
VVEDVKALLE GRAAPVIARL KADMQAAARA QDFEQAARLR DRVQAVEKLF GTEQHAYVSE
ETDLDFLGVA QAGEYAMVQL FRLRGGRVVG RDKRFLVGAE GGADVGEVLG AFVQDYYTQA
THVPPLILLP AEFEDAPVWS AFLSERAGRR VEMRTPKRGD KAELVEMAQR NAAAGLESEL
ALLERRGDHP GLDALREVLA LPDRPWRIEG YDNSNLFGSN IVSGMVVFEG GRARRSEHRR
FKVRGLDHPD DYAAMHQTIT RRLTGSLADK LPLPDLILID GGRGQVHAAL DALRAADVRV
PLVGLAKREE RIILPGRFGA QWWLETGTEV GVGGELLLPH THPALRVLIG VRDEVHHYAV
SYHRTLRGEQ MLRSVFDDLP GIGQKRRDAL LEHFTSLEDL AAAPVERIAA VPGMNLRAAQ
SVKKFLAERT ANGTPT