Gene Gdia_3194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_3194 
Symbol 
ID6976634 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3499390 
End bp3500676 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content72% 
IMG OID643392707 
ProductUracil-DNA glycosylase superfamily 
Protein accessionYP_002277539 
Protein GI209545310 
COG category[L] Replication, recombination and repair 
COG ID[COG1573] Uracil-DNA glycosylase 
TIGRFAM ID[TIGR00758] uracil-DNA glycosylase, family 4 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.4407 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCACG CGCACGCCAG AGTGACGCTG GCCCATGAGG TCGATTTCGC GGGCTGGCGA 
ACCGCGACGC GCCGCCTGGT CATGGCCGGC CAGCCCGCCG GGCGGATTGC GTGGCGAATC
GACGCACGCA TGGCCGACCC GATCGAGGAC CAGGGCGCCG AAGCACCGGA GGGCGACCCT
TTTCACGTAT CCCGCGCCCT GCTGGACCTT GCCGAGACCG TCATCCAGGC GCGCGATCCC
GACCGTTTCG CGCTGCTCTA CCGGCTGGTC CAGCGCAACG CGGCCGGCGA ACGCGACCTG
CCGACCCGCA CCGACGACAC CGACATCGCC CGCGCGCTGG TCCTGGCCCA GGCGGTGCGG
GACGATACCG CCCGCCTGCG CGCGCATCTG GAGCTGCCCG CCGAGGACGT GGCCATCGGG
CAGTGGACTT CGGAGTCCCA CGTCCTCGTC CCCAATGCGC GATTCCTGAC CGTTCAGCGT
TCGCTGCGCC CCTGGGCGGT CTCGACGCCG GACGAAACCC TTCTGTGGAC GGGCCGCAGC
CTGCATCTGC TGCCGCCGGG CACCGCCCCG TCCGCCCTGC CCACGGACAG GGAGTCCTGG
GAGGGGACGG GCCTGACGCT GCGCCCCGCC GATCTGCCGC GTCCGGCCCT GCGCCTGACC
GAGGGACTGG ATATCGCACG GATCGACAGC CTGCCGGCCC TGATCGCGGC GGCGCGCGAC
TGCCTGATCT GCGCGATGGC ACGCCAGACG ACGCAGACGG TCTTCAGCGA TGGGCGCCCG
GGGGCCGCGC TGATGCTGGT GGGCGAACAG CCGGGCGACC AGGAGGACCG CGTCGGCCGG
CCTTTCGTCG GGCCTGCCGG GCAATTGCTG GACCGCGCCC TGCACGAGGC CAGAATTACC
CGCGATGCCG TCTACGTCAC CAACGCGGTC AAGCATTTCC GCTTCCAGAG GCGCGGCACC
CGGCGCCTGC ATGAAAAGCC GACCGTGGAA AACGTTACGG CCTGCGCGCC CTGGCTGGCG
GCGGAACGGC GCATCGTGGC GCCGCGCGTG CTGGTGATGC TGGGCGCCAC CGCCGCCGGT
GCCGTCCTGG GCCGCAGCGT CACGATCGGG CGCGAGCGGT CGCGCCCGAT CCCACTGGCC
GACGGGTCGA CCGGGCTGGT CACCGTTCAT CCGTCCTTCC TGCTGCGCCA GCCGGACGAG
GATGCACGCA CCCGGGAATA TGCCCGCTTC GTGGCCGATC TGCGCCTGGC GCGGGACCTT
CTGCCGGCGG CTACGCTCCC GTCCTGA
 
Protein sequence
MPHAHARVTL AHEVDFAGWR TATRRLVMAG QPAGRIAWRI DARMADPIED QGAEAPEGDP 
FHVSRALLDL AETVIQARDP DRFALLYRLV QRNAAGERDL PTRTDDTDIA RALVLAQAVR
DDTARLRAHL ELPAEDVAIG QWTSESHVLV PNARFLTVQR SLRPWAVSTP DETLLWTGRS
LHLLPPGTAP SALPTDRESW EGTGLTLRPA DLPRPALRLT EGLDIARIDS LPALIAAARD
CLICAMARQT TQTVFSDGRP GAALMLVGEQ PGDQEDRVGR PFVGPAGQLL DRALHEARIT
RDAVYVTNAV KHFRFQRRGT RRLHEKPTVE NVTACAPWLA AERRIVAPRV LVMLGATAAG
AVLGRSVTIG RERSRPIPLA DGSTGLVTVH PSFLLRQPDE DARTREYARF VADLRLARDL
LPAATLPS