Gene Daro_2038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_2038 
SymboluvrC 
ID3566756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp2188218 
End bp2190029 
Gene Length1812 bp 
Protein Length603 aa 
Translation table11 
GC content57% 
IMG OID637680509 
Productexcinuclease ABC subunit C 
Protein accessionYP_285253 
Protein GI71907666 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00194] excinuclease ABC, C subunit 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value0.331579 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTTG ATGCCAAGGC CTTTCTGGCC ACGCTGACTG AATTACCCGG CGTCTACCGC 
ATGCTGGATA TAGGCGGAAA TGTGCTTTAC GTCGGCAAAG CCAAGAATCT GAAGAAACGG
GTTGCCTCCT ACTTTCGGGA GAACCTCTCC AGTCCGCGTA TTGCGCACAT GGTCAGTCAG
ATCGCTTCGA TTGAGACCAC GGCAACCCGT ACCGAAGCCG AAGCGCTGCT ACTTGAAAAC
AACCTGATCA AGTCGCTGGC GCCGCGCTAC AACATCCTGT TTCGGGATGA CAAATCCTAT
CCGTATATCG TCCTGAGCAA GGGAAAGTTT CCTCGACTGG GTTTCTTTCG CGGTAATCCG
GACCGCAAGG CTGATTATTT CGGCCCCTAT CCATCCTCTT GGGCTGTACG TGACAGCATT
CATTTGATGC AAAAAATGTT CCGTCTGCGC ACGTGCGAAG ACACTGTCTT TTCCAATCGC
TCGCGCCCTT GTCTTCTTTA CCAGATCAAG CGCTGCAGCG GCCCCTGCGT GGGCTTCATA
TCGGCGGACG ACTACGCGGC CGACATCCAG TTAGCTGCAA TGTTTCTCCT TGGCAAGCAG
CAGGAAGTAA CCCGCCGCCT AACCAAGTCG ATGGAAGAGG CTTCTGCCAA GCTGGCCTTT
GAGCAGGCTG CCGTATTCCG TGATCAGATA CAGTCTTTGC ATCAGGTTCA AGAAAAGCAG
TTTGTCTCCA GCAGTAAAGG AGAGGATGTC GACGTCCTGG TCGCAATCAA GGAGGCGGGG
CAGCTGTGCG TCAATCTGGC CATGGTCCGC GGCGGCAGGC ACCTCGGCGA TCGACCATTT
TTCCCCACGA ATGCAGCTGA CTCCGAACCA TCGGATGCAT GTGCCGCCTT CATTCGCCAA
CATTACGCAG CCCATCCAGC ACCAGCACGG ATCCTTTCAT ATCCATTGCC TTCTGAAGAC
GAGGCGGGTG AAACCGAAGT GGCGCTTGCC GAATTGGCTG GCCGCCCAGT GCCCGTGCAG
GAAGGGCGGG GCGCCACCCA CAAGGCCTGG GTCGAAATGG CGATACAAAA CGCACGTCTG
GCTATCCTGG CGAAAAATCA GGCGACTGCC CAGCAAGAAC AACGACTTGC TGCCTTGCAG
GATGCCTTGC AACTTCCAGA GCCTATAGCA CGGATCGAAT GTTTCGACAT CAGCCACACG
ATGGGCGAAG CAACTGTTGC CTCCTGTGTC GTCTACGAAG GCAATCGCAT GAAAAAGAGC
GACTATCGCC GCTTCAATAT CCGCGACATT CAGGCGGGCG ATGACTACGC CGCCATGCGT
CAGGCGGTCA GTCGCCGTTA CGACAGCATC GCGGGCGGGG AGGGGACTGC GCCCGATCTG
ATTCTCATTG ATGGCGGCAA AGGCCAGGTC AGCTCGGCCT TCAGTGCGCT TGCCGACCTC
GGATTGACCC ATTTGCCGAT GATCGGTGTC GCCAAAGGTG AAGAACGTAA GCCGGGGCTT
GAAACCCTGA TTTTCCCGGA GGGGCGGGAG CCGTTACAAT TGCCGCCGCA ACATCCGGCA
CTACACTTGA TCCAGGAAAT CCGCGACGAA GCCCATCGTT TTGCCATTAC CGGCCATCGT
GCCCAGCGCG GCAAAGCGAG AAAAACCTCG AAGCTGGAAA GCCTGCCTGG CATAGGACCG
GCTCGTAGAA AGGCGCTCGT CGCGCGCTTT GGTGGCCTTC CTGGCGTACT TGCGGCAAGC
ATCGACCAGT TGGCCGAAGT TCCCGGAGTC AGTCGGGAAA TGGCCGAGAA GATACATTCC
GCATTACACT GA
 
Protein sequence
MSFDAKAFLA TLTELPGVYR MLDIGGNVLY VGKAKNLKKR VASYFRENLS SPRIAHMVSQ 
IASIETTATR TEAEALLLEN NLIKSLAPRY NILFRDDKSY PYIVLSKGKF PRLGFFRGNP
DRKADYFGPY PSSWAVRDSI HLMQKMFRLR TCEDTVFSNR SRPCLLYQIK RCSGPCVGFI
SADDYAADIQ LAAMFLLGKQ QEVTRRLTKS MEEASAKLAF EQAAVFRDQI QSLHQVQEKQ
FVSSSKGEDV DVLVAIKEAG QLCVNLAMVR GGRHLGDRPF FPTNAADSEP SDACAAFIRQ
HYAAHPAPAR ILSYPLPSED EAGETEVALA ELAGRPVPVQ EGRGATHKAW VEMAIQNARL
AILAKNQATA QQEQRLAALQ DALQLPEPIA RIECFDISHT MGEATVASCV VYEGNRMKKS
DYRRFNIRDI QAGDDYAAMR QAVSRRYDSI AGGEGTAPDL ILIDGGKGQV SSAFSALADL
GLTHLPMIGV AKGEERKPGL ETLIFPEGRE PLQLPPQHPA LHLIQEIRDE AHRFAITGHR
AQRGKARKTS KLESLPGIGP ARRKALVARF GGLPGVLAAS IDQLAEVPGV SREMAEKIHS
ALH