Gene Dshi_0644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_0644 
SymboluvrC 
ID5711479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp642411 
End bp644306 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content67% 
IMG OID641266552 
Productexcinuclease ABC subunit C 
Protein accessionYP_001531991 
Protein GI159043197 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00194] excinuclease ABC, C subunit 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACA ATGGCGAACT GTCGGACGGA TTGGACCCCG AAGCGGAGGC GCTGACCGGT 
CATGCGCGCA TCCAGGCCTA TCTGAAAACC CTCGACGGCT CGCCCGGGGT GTACCGGATG
CTCGATGCGC AATCCCGGGT GCTCTACGTG GGCAAGGCGC GCAACCTCAA GGCGCGGGTG
TCGAACTACG CGCGCCCCTC GGGGCACTCG GCGCGGATCG CGCGGATGAT CTCCGAGACC
GCGTCGATGA TGTTCCTGAC CACGCGGACC GAGACCGAGG CGCTGCTGCT GGAGCAGAAC
CTCATCAAGC AGCTCAAGCC GCGCTACAAC GTGCTGTTGC GCGACGACAA GAGCTTTCCG
AACATCCTCG TGGCCAAGGA CCACGCCTAT CCGCAGATCC GCAAGCATCG CGGGCGCAAG
GCCACCAAGG GCGCCTATTA CGGCCCCTTC GCCAGTGCGG GGGCGGTGAA CCGGACGCTC
AACCAGCTGC AGAAGGCCTT CCTGCTGCGC AACTGCTCGG ACGCGATGTT CGAAAGCCGC
ACGCGCCCCT GCCTGCTCTA CCAGATCAAG CGCTGCTCGG GCCCGTGCGT CGGCTATATC
TCCGAAGAGG ATTACGACGC CTCCGTGCGC GATGCCGAAC GGTTCCTGTC GGGCAAGTCC
ACCGAGGTGC AGGAACGCCT CGCCGTCCAG ATGACCGAGG CTGCCGAGGC GATGGAGTTC
GAGCGCGCCG CCGCCCTGCG CGACCGCATC CGCGCCATGA CCCAGGTGCA GTCGGCCCAG
GGCATCAACC CGCGCGGCGT GTCCGAGGCG GATGTGATCG CGCTCCACCT CGAGAACGGG
CAGGCCTGCG TGCAGGTGTT CTTCATCCGA GCAAACCAGA ACTGGGGCAA CCGCGACTTC
TATCCGCGCA CCGGGGCGGG GGCCGAGGCG CCCGAAATCC TCGAAGCCTT CCTCGGGCAG
TTCTACGACC AGAAGGACCC GCCCCGGCAG ATCCTGCTGT CCCATCCGAT CGACAACCAT
GACCTCATGG CCGACGCGCT GGGCGAAAAG AGGGGCCGCA AGGTCGAGAT CCTCGTGCCC
AAGCGCGGCG AAAAGGCCGA GTTGATCGCC GGGGCAGAGC GCAACGCCCG CGAGAGCCTC
GCCCGCCGCA TGGCCGAAAG CGCCACGCAG ACCAAGCTGC TGCGCGGTCT GGCCGAGGCC
TTCGATCTGG ACGCGCCGCC AAAGCGGATC GAGGTCTACG ACAACTCCCA TATCCAGGGC
ACGGACGCGG TCGGCGCGAT GATCGTGTCG GGCCCGGACG GGTTCCTGAA ATCCCAGTAC
CGCAAGTTCA ACATTCGCGG CACCGACCTG ACCCCGGGCG ACGATTTCGG CATGATGAAA
GAGGTGTTGC AACGCCGCTT CCAGCGCCTG CTGAAAGAAG ACCCCGACCG CGAGAGCGAG
GCTTGGCCGG ACCTGTTGCT GATCGACGGC GGGGCGGGGC AGGTCTCGGC GGTGGCGGAC
ATCCTCGGCG ATCTCGGGGT GGAGGACGTG CCCTTCATCG GCGTGGCCAA GGGCGTGGAC
CGCGACGCGG GCAAGGAGGA ATTCCACCGC CCCGGTGCGC GCCCCACGGC TCTCCGCCAC
AACGACCCGG TGCTTTATTT CGTCCAGCGC CTGCGCGACG AGGCGCACCG TTTCGCCATC
GGCACGCACC GGGCGAAGCG GTCGAAATCG GTCTCCCGCA CGCCGCTCGA CGATGTGCCG
GGGGTGGGTG CCAAGCGCAA GCGCGCATTG CTGGCGCATT TCGGCTCCGC CAAGGCGGTC
AGCCGCGCCA ATCTGTCGGA CCTCAAGGCG GTCGAGGGCG TCTCGGCGAC CATGGCGCAG
ACCATCTACG ACTTCTTTCA CGACCAGAAG GGCTGA
 
Protein sequence
MSDNGELSDG LDPEAEALTG HARIQAYLKT LDGSPGVYRM LDAQSRVLYV GKARNLKARV 
SNYARPSGHS ARIARMISET ASMMFLTTRT ETEALLLEQN LIKQLKPRYN VLLRDDKSFP
NILVAKDHAY PQIRKHRGRK ATKGAYYGPF ASAGAVNRTL NQLQKAFLLR NCSDAMFESR
TRPCLLYQIK RCSGPCVGYI SEEDYDASVR DAERFLSGKS TEVQERLAVQ MTEAAEAMEF
ERAAALRDRI RAMTQVQSAQ GINPRGVSEA DVIALHLENG QACVQVFFIR ANQNWGNRDF
YPRTGAGAEA PEILEAFLGQ FYDQKDPPRQ ILLSHPIDNH DLMADALGEK RGRKVEILVP
KRGEKAELIA GAERNARESL ARRMAESATQ TKLLRGLAEA FDLDAPPKRI EVYDNSHIQG
TDAVGAMIVS GPDGFLKSQY RKFNIRGTDL TPGDDFGMMK EVLQRRFQRL LKEDPDRESE
AWPDLLLIDG GAGQVSAVAD ILGDLGVEDV PFIGVAKGVD RDAGKEEFHR PGARPTALRH
NDPVLYFVQR LRDEAHRFAI GTHRAKRSKS VSRTPLDDVP GVGAKRKRAL LAHFGSAKAV
SRANLSDLKA VEGVSATMAQ TIYDFFHDQK G