Gene Gdia_0538 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0538 
SymboluvrA 
ID6973934 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp592256 
End bp595168 
Gene Length2913 bp 
Protein Length970 aa 
Translation table11 
GC content68% 
IMG OID643390070 
Productexcinuclease ABC subunit A 
Protein accessionYP_002274947 
Protein GI209542718 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.566614 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.542221 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTAATC GATCCAGGCC GGCACCCGGC CTGCCAGCGG CGCAGTCGAT CCGGGTCCGC 
GGCGCGCGGG CGCACAACCT GAAGAACATC GACGTCGAAA TTCCCCGCGA CGCGCTGACG
GTCGTCACCG GCCTGTCGGG ATCGGGAAAG TCGTCCCTGG CGTTCGACAC GATCTATGCC
GAGGGCCAGC GCCGCTATGT CGAAAGCCTG TCGGCCTATG CGCGGCAGTT CCTGGAACTG
ATGGGCAAGC CGGACGTGGA CGCGATCGAG GGCCTGTCGC CCGCGATTTC CATCGAACAG
AAGACGACGT CGAAGAATCC GCGCTCCACC GTCGGCACGA TCACCGAGAT TCACGACTAC
ATGCGCCTGC TGTGGGCGCG GGCCGGCGTG CCCTATTCGC CCGCCACCGG CCTGCCGATC
GAGGCGCAGA CCATCAGCCA GATGGTCGAT CGCGTCATGG CGCTGCCGGA AGGCACGCGC
CTGATGCTGC TGGCGCCGGT CATTCGCGAC CGGAAGGGCG AATGCCGCAA GGAACTGGCG
GAACTGCAGC GCAAGGGCTT CGCCCGGGTG AAGGTGGACG GCACGCTGTA CGACATCGCC
GAGGTCCCGG ACCTGAACCG CAAGCTGCGT CATACCGTCG AGGCCGTGGT GGACCGGGTG
GTGGTGAAGC CCGGCCTGGA ATCGCGTCTG GCCGACAGTT TCGAGACCGC GCTGGGCCTG
TCCGACGGCT TGGTCTATGC CGAGGAACTG CCGCGCGGCG AGGCCGAGCC GCCGCCGCAC
ATCGTGTTCT CCTCGCGCTT CGCCTGCCCG GTCAGCGGCT TCACGCTGGA AGAGATCGAG
CCGCGCCTGT TTTCCTTCAA CGCGCCCCAG GGCGCATGCC CGGCCTGCGA CGGCATCGGG
CTGGAGACCT TCTTCGACCC GCACCTGATC GTGCCCGACG AATCCCTGTC GCTGGAACGC
GGGGCCATCG CGCCCTGGCG CAGCACGCAA AGCCCCTACT ACCAGCAGAC GCTGGACAGC
CTGGCCGCCC ATTACGGCGT GGGCATGGAC ACGCCCTGGC GGGACCTGCC GGCGGGCGTG
CGCGAGACCA TCCTGGACGG CGGCAAGGAC GAGATCCTGT TCCGCTATCG CGACGGCCGG
AAATCCTATG ACCTGACCAA GCGGTTCGAG GGCGTGGTGA CCAACCTGCG CCGCCGCATG
GCCGAAACCG ACAGCGTATG GGTGCGGGAG GAACTGTCGC GTTACCAGTC CGACAAGCCC
TGCCATGTCT GCCACGGCAC GCGCCTGCGG CCCGAGGCCC TGTCGGTGCG GGTAGCGGGT
TCGACGATCG CCGAGGCATC GGACTTGCCG ATCCGCCGGG CGCTGGACTG GTTCGGCACG
GTCGAGGCGA CGCTGACGCC GCAGCGCGCC GAAATCGCGC GGCGTATCCT GCGTGAAATC
CTGGACCGGC TGCATTTCCT GAACGATGTC GGGCTGGACT ACCTGACCCT GTCGCGCGGG
TCGGCCACCC TGTCGGGCGG GGAAAGCCAG CGGATTCGCC TGGCCAGCCA GATCGGATCG
GGCCTGACAG GAGTGCTGTA CGTGCTGGAC GAACCGTCTA TCGGCCTGCA CCAGCGCGAC
AACGAACGGC TGCTGGGCAC GCTGGACCGG TTGAAGCGGC TGGGGAACAC GCTGATCGTC
GTCGAGCATG ACGAAGACGC GATCCGGAGC GCCGACTGGC TGATCGACAT GGGGCCGGGG
GCCGGGGTCA ATGGCGGGCA CGTGGTGGCC ATCGGCACGC CCGAGGAGGT CGCGGCGAAT
CCGGCCAGCC TGACCGGCGA CTACCTGTCC GGCCGCAAGC GGATCGATGT GCCGACGGTG
CGCCGGCCGA TCGATCCCGC CCGCATGCTG GTGCTGGAGG GTGCCGGCGG CAACAACCTG
AAGGATGTCA CTGCCCATTT CCCGCTGGGC ACCTTCACCT GCGTGACCGG GGTGTCGGGG
GGCGGCAAAT CGACGCTGGT GATCGATACG CTGTACAAGG CGCTGTCGCG GCAGTTGATG
GGATCGGGGC AGAATCCCGC GCCCTATCGC GGGATCGCGG GGCTGGACCT GCTGGACAAG
ATCATCGACA TCGACCAGTC GCCGATCGGC CGCACGCCGC GATCCAATCC CGCGACCTAT
ACCGACCTGT TCGCGCCGAT CCGCGACTGG TTCGCCGAAC TGCCGGAAAG CAAGGCCAGG
GGCTACAAGG CGGGGCGGTT CTCCTTCAAC GTCAAGGGTG GCCGGTGCGA GGCCTGCCAG
GGCGACGGTG TGCTGAAGAT CGAAATGCAC TTCCTGCCCG ACGTGTTCGT GACCTGCGAC
ACCTGCAAGG GCGCGCGCTA CAACCGCGAG ACGCTGGAGG TGAAATTCCG CGGCAAGTCG
ATCGCCGACG TGCTGGCCAT GACGGTGGAC GAGGCGCTGC CGTATTTTTC TGCCGTGCCG
CGGATTCGCG ACCGGCTGGC CATCCTGCAG CAGGTGGGGC TGGGCTATGT CGCGCTGGGC
CAGCAGGCGA CGACCCTTTC GGGGGGCGAG GCGCAGCGCG TGAAGCTGTC CAAGGAACTG
GCCCGCCGCG CCACCGGACG CACGCTGTAC ATCTTGGACG AACCCACCAC CGGCCTGCAC
ACCGAGGACG TGCGCAAGCT GCTGGAGGTG CTGCATGCCC TGGTGGATCA GGGCAACACG
GTGGTGGTGA TCGAGCACAA TCTGGAAGTC ATCAAGACCG CGGACTGGGT GCTGGACATG
GGACCGGAAG GCGGCGATGG CGGCGGGCGC ATCGTGGCCG AGGGCACGCC CGAGGACATC
GCGGCGTGCC CCGAGAGCCA TACCGGGCGC TTCCTGCGCC CCCTGCTGCC GGCGGCGGCG
CCCAAGGCGC GCCGCCGTCG CTCCAAGGGC TGA
 
Protein sequence
MGNRSRPAPG LPAAQSIRVR GARAHNLKNI DVEIPRDALT VVTGLSGSGK SSLAFDTIYA 
EGQRRYVESL SAYARQFLEL MGKPDVDAIE GLSPAISIEQ KTTSKNPRST VGTITEIHDY
MRLLWARAGV PYSPATGLPI EAQTISQMVD RVMALPEGTR LMLLAPVIRD RKGECRKELA
ELQRKGFARV KVDGTLYDIA EVPDLNRKLR HTVEAVVDRV VVKPGLESRL ADSFETALGL
SDGLVYAEEL PRGEAEPPPH IVFSSRFACP VSGFTLEEIE PRLFSFNAPQ GACPACDGIG
LETFFDPHLI VPDESLSLER GAIAPWRSTQ SPYYQQTLDS LAAHYGVGMD TPWRDLPAGV
RETILDGGKD EILFRYRDGR KSYDLTKRFE GVVTNLRRRM AETDSVWVRE ELSRYQSDKP
CHVCHGTRLR PEALSVRVAG STIAEASDLP IRRALDWFGT VEATLTPQRA EIARRILREI
LDRLHFLNDV GLDYLTLSRG SATLSGGESQ RIRLASQIGS GLTGVLYVLD EPSIGLHQRD
NERLLGTLDR LKRLGNTLIV VEHDEDAIRS ADWLIDMGPG AGVNGGHVVA IGTPEEVAAN
PASLTGDYLS GRKRIDVPTV RRPIDPARML VLEGAGGNNL KDVTAHFPLG TFTCVTGVSG
GGKSTLVIDT LYKALSRQLM GSGQNPAPYR GIAGLDLLDK IIDIDQSPIG RTPRSNPATY
TDLFAPIRDW FAELPESKAR GYKAGRFSFN VKGGRCEACQ GDGVLKIEMH FLPDVFVTCD
TCKGARYNRE TLEVKFRGKS IADVLAMTVD EALPYFSAVP RIRDRLAILQ QVGLGYVALG
QQATTLSGGE AQRVKLSKEL ARRATGRTLY ILDEPTTGLH TEDVRKLLEV LHALVDQGNT
VVVIEHNLEV IKTADWVLDM GPEGGDGGGR IVAEGTPEDI AACPESHTGR FLRPLLPAAA
PKARRRRSKG