Gene Daro_1920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1920 
Symbol 
ID3569577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp2063927 
End bp2065171 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content67% 
IMG OID637680391 
Productexonuclease subunit SbcD 
Protein accessionYP_285136 
Protein GI71907549 
COG category[L] Replication, recombination and repair 
COG ID[COG0420] DNA repair exonuclease 
TIGRFAM ID[TIGR00619] exonuclease SbcD 


Plasmid Coverage information

Num covering plasmid clones67 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATTC TCCACACCTC CGACTGGCAC CTTGGCCAGC ACTTCATGGG CAAGAGCCGG 
CAGGCCGAGC ATCAGGCGCT AATCGTCTGG CTGCTCGAAC AGGTGGAAAC GCAGGCAGTG
GATGCCGTGC TGATCGCCGG CGACATCTTC GACACCGGCA CGCCACCCAG CTACGCCCGC
GAGTTGTACA ACCAGCTGGT CGGGCAACTC TACAAGGCCG GCGTGCCGCT GCTGGTGCTC
GGCGGCAACC ACGATTCGCC GGCCACGCTG GGCGAAAGCC GCGAACTGCT CGCCCACCTG
GGCACCACCG TCATCGGCGC AACGCACACC GACCCGGCCA CGCAGGTCAT CGTCCTGCCG
CAACGCAACG GCGAACCCGG CTGCATCGTC TGCGCCATTC CCTTCGTCCG GCCGCGCGAC
GTGCTGCAAA GCCAAGCTGG GCAAAGCGCC GAAGACAAGC AGCTGTCGCT GCAAACCGCC
ATCCAGGAAC ATTACAGCGC CGTGTTCGCC GCCGCCGTTG AACGCCAGCA GGCACTGGCC
GCCCAACTCG GCCCGAATTT CGGCCGCCAG CTGCCGATCA TCGCCACCGG CCACCTGACC
ACCGTCGGCG CCAGCACCAG CGAATCGGTC CGCGAAATCT ACGTCGGCGC CCTCGAAGCC
TTTCCGACCA CCGCCTTCCC GCCGGCTGCC TACATCGCGC TCGGCCATAT CCACCGGCCA
CAGAAAGTCG GCGGACTGGA CCATATCCGC TACTGCGGCT CGCCGATCCC GCTCAGCTTC
GACGAAGCGA AACAAACCAA GGAAATGCTG CTCGTCGATC TCGACAGCGA TGGCCTCAAG
GCCGTCACCG TGCTGCCTGT GCCGCGCTTC CAGGCGCTGG TCGCCGTCAG CGGCAATCTC
GAATCACTGG CCGGCGCAAT CGGCGCCGCA GCCGCCGAAG GCACGCGCGA ATGCCCGGCC
TGGCTTGAAG TCACCGTCGC CGAAGACGAT TACCTGGCCG ACCTGCCGGC CCGCATCGAA
GCGTTGACCG AAGGCTGGCC GGTCGAGGTG CTGCGCATCC GCCGCCAGCG CGGCAATGCC
ACGGCCCGAC TGGCCGCCGA GGCCCGCGAA ACGCTGGACG AACTCAGCCC GCACGACGTT
TTTGCCCGCC GCTTGCAACA GGAAGAACTC GGCGAAGAAA TGCAGTTGGC CCTCAACGAA
CGCTACCGCG CCGTCGTTGC CGGACTGCAG GGAGAGGAAG CATGA
 
Protein sequence
MRILHTSDWH LGQHFMGKSR QAEHQALIVW LLEQVETQAV DAVLIAGDIF DTGTPPSYAR 
ELYNQLVGQL YKAGVPLLVL GGNHDSPATL GESRELLAHL GTTVIGATHT DPATQVIVLP
QRNGEPGCIV CAIPFVRPRD VLQSQAGQSA EDKQLSLQTA IQEHYSAVFA AAVERQQALA
AQLGPNFGRQ LPIIATGHLT TVGASTSESV REIYVGALEA FPTTAFPPAA YIALGHIHRP
QKVGGLDHIR YCGSPIPLSF DEAKQTKEML LVDLDSDGLK AVTVLPVPRF QALVAVSGNL
ESLAGAIGAA AAEGTRECPA WLEVTVAEDD YLADLPARIE ALTEGWPVEV LRIRRQRGNA
TARLAAEARE TLDELSPHDV FARRLQQEEL GEEMQLALNE RYRAVVAGLQ GEEA