Gene Daro_0473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_0473 
Symbol 
ID3569138 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp531202 
End bp534027 
Gene Length2826 bp 
Protein Length941 aa 
Translation table11 
GC content62% 
IMG OID637678915 
Productexcinuclease ABC subunit A 
Protein accessionYP_283700 
Protein GI71906113 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.78914 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAACCA TCCGCATCCG CGGTGCGCGC ACCCACAATT TAAAGAACAT CAACCTCGAC 
CTGCCGCGTG ATCAGTTGAT CGTGATCACC GGACTATCCG GCTCCGGCAA GTCCTCCCTG
GCCTTCGACA CCTTGTACGC CGAGGGACAG CGCCGTTATG TCGAATCGCT GTCGGCCTAT
GCCCGGCAGT TCCTGCAGTT GATGGAGAAA CCGGATGTCG ATCTGATCGA AGGCCTGAGC
CCGGCCATTT CCATCGAACA GAAGGCGACC TCGCACAACC CACGGTCGAC GGTCGGCACC
GTGACTGAAA TTCACGACTA CCTGCGCCTG CTCTTCGCCC GCGCCGGCAC GCCGTACTGC
CCGGACCACA ACCAGCCGCT GGAAGCGCAA ACCGTGTCGC AGATGGTCGA CACCGTGCTC
GCCCTGCCGG CCGAAACCAA GCTGATGATC CTCGCCCCGG TCGTTGCCAA CCGCAAGGGC
GAACAAGTCG ACCTGTTCAC CGAATTGCGC GCCCAAGGCT TTGCCCGCGT CCGTGTCGAT
GGCACGGTCT ATGAGATCGA TGCCGTACCC AAGCTGGCCA AGACCCAGAA ACACACCGTC
GACGTCGTCG TCGACCGCCT GAAGGTGCGT GACGACATGC GCCAGCGCCT GGCCGAATCC
TTTGAAACAG CGCTGCGCCA TGCCGAGGGC CGAGCCATCG CGCTGGAAAT GGACAGCAAC
GTCGAGCATC TTTTTTCGGC CAAGTTCGCC TGCCCGGTCT GTTCCTACGC CTTGCAGGAA
CTGGAACCGC GCCTGTTCTC GTTCAACAAC CCGATGGGCG CCTGCCCGAA GTGCGACGGC
CTGGGCGTCA TCCAGTTCTT CGACCCGAAA CGGGTGGTCA CCCAACCAAC CGCTTCGCTG
GCCGGCGGCG CGATTCGCGG CTGGGACAAG AAGAACCAGT TCTACTTCCA GATCATCGAA
TCGCTGGCCG ACCATTACGG TTTTTCGGTC GATACGCCAT GGAACGACTT GCCGGAAAAG
GTCCAGCAAC TGGTCCTTTA CGGCTCAGGC AACGTGGCCA TCAACTTCCG CTACCTGAAC
GAGAAGGGCA CCCGCTTCGA CCGCAGCCAC AGCTTCGAAG GCATCATCCC GAACCTGGAA
CGGCGCTACC GCGGCAGCGA ATCGAACGCC GTGCGCGAGG AGTTGGCGAA GTACGTCAGC
AGTTCGGCCT GCCCGAGCTG CGCCGGCACC CGGCTGCGCA TTGAGGCACG GCATGTTCGG
GTCGGCGACA AGACCCTGTA TGAAATCAGC CGGATGCCGC TTGGCGAGGC GCGCAATTAT
TTCAACTGCC TGACGCTGAC CGGCGCCAAG GCACAGGTGG CCGACAAGAT CCTCAAGGAA
ATCACCGCCC GCCTGAGCTT CCTGATCAAC GTCGGCCTCG ATTACCTGTG CCTCGAGCGC
TCGGCCGAAA CGCTGTCCGG CGGCGAGGCG CAGCGCATCC GGCTGGCCTC GCAGATCGGC
TCGGGCCTGA CCGGCGTCAT GTATGTACTC GACGAACCCT CAATCGGTCT GCACCAGCGC
GACAATGATC GTCTGCTCGA AACATTGAAG AACCTGCGCG ACATGGGCAA CACGGTATTG
GTCGTCGAGC ATGACGAGGA TGCCATCCGC GCCGCCGACT ACGTCGTCGA CATCGGCCCT
GGCGCCGGGG TGCATGGCGG TTTCATCGTT GCCCAGGGGA CGCCGGCCGA GGTTCAGGCC
AATCCGCTAT CGATGACTGG CGATTACCTG TCCGGCCGCA AATCCATTTC TGCCCCGAAA
ACGCGCCGGG TGCCGGATCC GAACAAGCAG CTCAAGGTCG TCGGCGCCTA CGGCAACAAT
CTGAAGGACG TCACGCTGGA AATCCCCGGC GGCCTGCTTA CCTGCATCAC CGGTGTTTCC
GGCTCGGGCA AATCGACGCT GATCAACGAC ACGCTGTACG CGGCGGCCGC CAGACACCTC
TACGGCTCGA CCACGGAACC CGCACCACAC AAGGAAATCA TCGGTCTGGA GCTGTTCGAC
AAGGTAATCA ACGTCGACCA GGCGCCGATT GGCCGCACGC CGCGCTCCAA CCCGGCGACC
TACACCGGCC TGCTGACGCC GATCCGCGAG CTGTTCGCCC AGGTACCCGA ATCCCGCGTC
CGCGGCTACG GGCCGGGACG CTTCAGCTTC AACGTCAAGG GCGGACGCTG CGAGGCCTGC
CAGGGCGACG GCATGATCAA GGTGGAGATG CACTTCCTGC CCGATATCTA CGTCCCCTGC
GACGTCTGCC ACGGCAAGCG CTACAACCGC GAAACGCTGG AAGTCCAGTA CAAGGGCAAG
AACATTTACG ACATTCTCGG CATGACCGTC GAGCAGGCGC GAGAGTTCTT CGACCCCGTC
CCCAACATCG CGCGCAAGCT GCAAACACTG GTCGATGTCG GCCTCAGCTA CATCACCCTG
GGCCAGAGCG CGACCACGCT GTCCGGCGGC GAAGCCCAGC GCGTCAAGCT CGCCCTTGAA
CTGTCCAAGC GCGATACCGG CCGCACGCTG TATATCCTCG ACGAGCCGAC CACCGGCCTG
CATTTCCAGG ACATCGAAAT GCTGCTCAGC GTCCTGCAGC GTCTGGCCAA CAACGGCAAC
ACCATCGTCG TCATCGAACA CAACCTCGAC GTCATCAAGA CCGCCGACTG GATCGTCGAT
CTCGGGCCGG AGGGCGGCGA TGGCGGGGGC CGCATCCTGG TCAGCGGCAC GCCTGAAGAA
GTGGCCAAAT GCAAGGCCAG CCATACCGGG CGCTTCCTGA AACCTTTGCT TGAGAAAAAG
AAATGA
 
Protein sequence
METIRIRGAR THNLKNINLD LPRDQLIVIT GLSGSGKSSL AFDTLYAEGQ RRYVESLSAY 
ARQFLQLMEK PDVDLIEGLS PAISIEQKAT SHNPRSTVGT VTEIHDYLRL LFARAGTPYC
PDHNQPLEAQ TVSQMVDTVL ALPAETKLMI LAPVVANRKG EQVDLFTELR AQGFARVRVD
GTVYEIDAVP KLAKTQKHTV DVVVDRLKVR DDMRQRLAES FETALRHAEG RAIALEMDSN
VEHLFSAKFA CPVCSYALQE LEPRLFSFNN PMGACPKCDG LGVIQFFDPK RVVTQPTASL
AGGAIRGWDK KNQFYFQIIE SLADHYGFSV DTPWNDLPEK VQQLVLYGSG NVAINFRYLN
EKGTRFDRSH SFEGIIPNLE RRYRGSESNA VREELAKYVS SSACPSCAGT RLRIEARHVR
VGDKTLYEIS RMPLGEARNY FNCLTLTGAK AQVADKILKE ITARLSFLIN VGLDYLCLER
SAETLSGGEA QRIRLASQIG SGLTGVMYVL DEPSIGLHQR DNDRLLETLK NLRDMGNTVL
VVEHDEDAIR AADYVVDIGP GAGVHGGFIV AQGTPAEVQA NPLSMTGDYL SGRKSISAPK
TRRVPDPNKQ LKVVGAYGNN LKDVTLEIPG GLLTCITGVS GSGKSTLIND TLYAAAARHL
YGSTTEPAPH KEIIGLELFD KVINVDQAPI GRTPRSNPAT YTGLLTPIRE LFAQVPESRV
RGYGPGRFSF NVKGGRCEAC QGDGMIKVEM HFLPDIYVPC DVCHGKRYNR ETLEVQYKGK
NIYDILGMTV EQAREFFDPV PNIARKLQTL VDVGLSYITL GQSATTLSGG EAQRVKLALE
LSKRDTGRTL YILDEPTTGL HFQDIEMLLS VLQRLANNGN TIVVIEHNLD VIKTADWIVD
LGPEGGDGGG RILVSGTPEE VAKCKASHTG RFLKPLLEKK K