Gene Daro_3894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3894 
Symbol 
ID3567739 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp4184272 
End bp4185684 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content60% 
IMG OID637682368 
Productdeoxyribodipyrimidine photo-lyase type I 
Protein accessionYP_287092 
Protein GI71909505 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones68 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAG AAAAAGCCTT GGTCTGGTTC CGCCGCGACT TGCGCGACCA TGACCACGCC 
GCCCTGAGCG CGGCCCTCGC CGAGGCGCAG CAGGTGTATT GCGCCTTTGT CTTCGATAGC
GAAATTCTCG ATCCGCTGCC AACACGCCAT GATCGCCGGG TGCACTTCAT CCGCGAATCG
CTGGTCGAAC TGGATGCCGC CTTGCGAGCC AGGGGTGGCG GGCTGATCAT TCGGCACGGC
CAGGCCGTCG ACGAGATTCC TGCTCTCGCC AGGCGGCTTG GCGTATCTGC CGTCTTCACC
AACCGGGATT ACGAACCCTC GGCGAAACGC CGCGATGCCC AGGTTGCCCG GCAACTTCGG
AACGACGACA TTGCCTTTCA CGGTGTCAAG GATCAGGCCA TTTTCGATGG CGACGAAGTA
TTAACCCAGG CGGGAAAAGC CTTTTCTGTT TTTACCCCCT ACAAGAATGC CTGGCTAAAA
CGCCTGACCA CCGCTGATTA CGCTGCCTGG CCCTGTGATG GGCGATTGGC CGGCCAGGAA
CTGGCAGGCA TTCCAACGCT GGAAGAGATT GGCTTTACTC CGACCGACTT GGCCGAACTC
GGCATCCAGC CGGGCATGTC AGGCGCCAAA GGGCTGTGGG ACGATTTCTC CCGGGACCGC
ATCGAGCGCT ATGGCAGCCT GCGCGACTTT CCTGCCGTCA AGGGCGTCTC CTACCTGTCC
GTCCATCTAC GCTTCGGGAC CATCTCGATC CGCCAACTGG TCAGGCAGGC ATTGGCACAT
CAGGCTGACA CCTGGCTCAG CGAGTTGATC TGGCGCGACT TCTATTTCAT GATCCTCGAC
CATTTTCCCC ACGTCGCCGG ACACGCCTTC AAGCCGGAAT ACGATGCAAT TCAATGGGCA
AGCCGTCCTG AAGCCTTTGC AGCCTGGTGC GAAGGTTGCA CCGGCTACCC GCTAGTCGAT
GCGGCCATGC GCCAACTCAA TTTCAGCGGC TGGATGCACA ATCGGCTCCG CATGGTCGTC
GCCTCCTTCC TGACCAAGGA TCTCGGCATC GACTGGCGGC TCGGCGAAAA ATACTTTGCC
GAGCAACTCA ACGACTTCGA TCTGTCTGCC AACAACGGCG GCTGGCAGTG GGCCTCATCG
AGCGGCTGCG ATGCCCAGCC CTATTTTCGG ATTTTCAACC CGGTCACGCA GTCGGAAAAG
TTCGATGCGG AGGGCAAATT CATCCGCCGT TATGTGCCGG AACTGGCCAA GGTACACGAT
AAATACATCC ATGCCCCGTG GAAAATGGGG CGCATCGAAC AGGAAGCACT CGGGGTGGTG
ATCGGACGCG ACTACCCGTC GCCGATCGTC GATCACGCAA CGGCCAGGGA TGAAACCCTG
GCCCGCTACG CAGTCGTCAA GAAGCAGGCC TAA
 
Protein sequence
MKKEKALVWF RRDLRDHDHA ALSAALAEAQ QVYCAFVFDS EILDPLPTRH DRRVHFIRES 
LVELDAALRA RGGGLIIRHG QAVDEIPALA RRLGVSAVFT NRDYEPSAKR RDAQVARQLR
NDDIAFHGVK DQAIFDGDEV LTQAGKAFSV FTPYKNAWLK RLTTADYAAW PCDGRLAGQE
LAGIPTLEEI GFTPTDLAEL GIQPGMSGAK GLWDDFSRDR IERYGSLRDF PAVKGVSYLS
VHLRFGTISI RQLVRQALAH QADTWLSELI WRDFYFMILD HFPHVAGHAF KPEYDAIQWA
SRPEAFAAWC EGCTGYPLVD AAMRQLNFSG WMHNRLRMVV ASFLTKDLGI DWRLGEKYFA
EQLNDFDLSA NNGGWQWASS SGCDAQPYFR IFNPVTQSEK FDAEGKFIRR YVPELAKVHD
KYIHAPWKMG RIEQEALGVV IGRDYPSPIV DHATARDETL ARYAVVKKQA