Gene Gdia_0046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0046 
SymboluvrC 
ID6973435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp52770 
End bp54659 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content68% 
IMG OID643389579 
Productexcinuclease ABC subunit C 
Protein accessionYP_002274463 
Protein GI209542234 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00194] excinuclease ABC, C subunit 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.714841 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.00962006 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGATGCCG ATGTCGTGCC GGCCTCGTCG GTGAAGGGGG TCGAGGCGAT CCTGCTGGCC 
CTGCAGACCA TGCCGCTTTC CCCGGGCGTC TATCGGATGC TGGGCGAGAA GGGCGAGGTC
CTGTACGTGG GCAAGGCCCG GATCCTGAAG CGGCGGGTGA CGTCGTACAC GCATCTGAGC
AAGCTGCCCG AGCGCCTGCG CCGCATGGTG TCCGAGACGG TCACGATGGA GATCGTGACC
ACGCACACCG AGGCCGAGGC CCTGCTGCTG GAAGCCAACT ACATCAAGCG CATGAAGCCG
CGCTTCAATA TCCTGCTGCG CGACGACAAG AGCTATCCCT GGATCATGCT GACGGATGGC
GCGGAATTCC CCCAGGTGAC CAAGCATCGG GGCAAGCCGG TCAAGGGGGC GTCGTACTGG
GGACCGTTCG CCTCGGCCTG GGCGGTCAAC CAGACGCTGA ACCTGATCCA GCGGGTCTTC
CTGCTGCGCA CCTGTTCGGA TTCGGTGTTC GCGTCCCGGA CGCGGCCCTG CCTGCTGTTC
CAGATCAAGC GCTGTTCCGC CCCCTGCGTC GCGCGCATCG GTCAGGCGGA ATACGCCCAT
CTGGTCGAGC AGGCGCGGGC CTTCCTGTCC GGCCAGCGCG GCGGCATCCG CGAGGAACTG
GTGCGCGAGA TGGAGGCGGC GGCGGCCTCG CTGGAATTCG AGCGGGCGGC GACCATTCGC
GACCGTATCC GTGGCTTCGC CGCGATGCAG GATTCCTCGG TCATCAACCC GGCGTCGCTG
GATGACGCGG ATATCGTGGC GATCGCGCAG GCGGCGGGGC ATTCCTGCAT CCAGGTCTTC
TTCATCCGTG GCGGGCGCAA TAACGGCAAC CGCGCCTTCT TCCCCGCCCA TGCGCGCGAC
GAGGCGGCGC CGGACATCGT GGGCGCGTTC CTGGCGCAGT TCTATGACGA CAAGCCGCCG
CCGGCGCAGA TCCTGCTGAA CTGCGAGATC GCGGAACACG ACCTGATGGC CGATGCCCTG
GGGATCAAGC GCGGCCGCAA GGTCGAAATC CTGGTGCCGA AGCGGGGCGA GAAGCGGGCT
GTGGTCGAAC ACGCCGAAAC GAACGCCCGC GAGGCGCTGG AGCGCAAGCT GGCCGAAAGC
ACGGCCCAGG CGCGCCTGCT GGAGGGCATG GCCGACCTGT TCGGACTGGA TGCCCCCCCG
CGGCGGATCG AGATCTACGA CAACAGCCAT ATCATGGGGA CCAATGCCTA TGGCGTCATG
GTGGTGGCGG GACCCGAGGG CTTCGACAAG CGCAGCTACC GCAAGTTCTC GATCCGTGGC
CCGATCACGC CGGGCGACGA TTTCGCGATG ATGCGCGAGG TCCTGGAACG GCGCTTCAGC
CGTGCCCTGC GCGAGCGCGA CGACTCTGCC GGCGCGTCCA GTCCGGCCGA CTGGCCGGAT
ATCGTGCTGA TCGACGGCGG CGCCGGCCAG TATTCGGCAG TACGGGCCGT GCTGGACGAA
CTGGGGGTGA CGGACGTGAC CCTGGTCGCC ATCGCCAAGG GGCCGGACCG CGATGCGGGG
CGAGAGTGGT TCCATATGGC GGATCGGCCG CCCTTCCAGT TACCGCCGCG CGATCCGGTG
CTGTACTACC TGCAGCGGTT GCGGGACGAG GCCCACCGCT TCGCCATCAC CACCCACCGG
GCGGGCCGGT CCAAGGCCCT GGTGAAATCG GAACTGGACG AGATCCCGGG CGTGGGCGCG
GCGCGCAAGC GGGCGCTGCT GAACCAGTTC GGGTCCGCCC GCGGCGTGCG GCAGGCGGGG
CTGGCGGAAC TGGAGGCCAC GCAAGGGATC AATCGCGAAA CCGCGCGCGT TGTCTACGGG
CACTTCCATC CGGGCTGGAC CGGCGCCTGA
 
Protein sequence
MDADVVPASS VKGVEAILLA LQTMPLSPGV YRMLGEKGEV LYVGKARILK RRVTSYTHLS 
KLPERLRRMV SETVTMEIVT THTEAEALLL EANYIKRMKP RFNILLRDDK SYPWIMLTDG
AEFPQVTKHR GKPVKGASYW GPFASAWAVN QTLNLIQRVF LLRTCSDSVF ASRTRPCLLF
QIKRCSAPCV ARIGQAEYAH LVEQARAFLS GQRGGIREEL VREMEAAAAS LEFERAATIR
DRIRGFAAMQ DSSVINPASL DDADIVAIAQ AAGHSCIQVF FIRGGRNNGN RAFFPAHARD
EAAPDIVGAF LAQFYDDKPP PAQILLNCEI AEHDLMADAL GIKRGRKVEI LVPKRGEKRA
VVEHAETNAR EALERKLAES TAQARLLEGM ADLFGLDAPP RRIEIYDNSH IMGTNAYGVM
VVAGPEGFDK RSYRKFSIRG PITPGDDFAM MREVLERRFS RALRERDDSA GASSPADWPD
IVLIDGGAGQ YSAVRAVLDE LGVTDVTLVA IAKGPDRDAG REWFHMADRP PFQLPPRDPV
LYYLQRLRDE AHRFAITTHR AGRSKALVKS ELDEIPGVGA ARKRALLNQF GSARGVRQAG
LAELEATQGI NRETARVVYG HFHPGWTGA