Gene Gdia_2846 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_2846 
Symbol 
ID6976278 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3110664 
End bp3113495 
Gene Length2832 bp 
Protein Length943 aa 
Translation table11 
GC content70% 
IMG OID643392354 
ProductDNA polymerase I 
Protein accessionYP_002277192 
Protein GI209544963 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.411951 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGGCG ACGCCCCGAT TCACCTGATC CTGGTCGACG GATCGGGGTT CATCTTCCGT 
GCCTTCCATG CCCTGCCGCC GATGACCGCG CCCGACGGCA CGCCCGTCAA CGCGGTGTTC
GGCTTCACGA ACATGCTGGC GCGCCTGCTG CGCGACCATG TGGGCACGCA TCTGGCGGTC
ATCTTCGATG CCGGGCGGGT GACCTTCCGC AACGGGATCT ATGACCAGTA CAAGGCCCAC
CGGCCCGAAC CGCCCGAGGA ACTGCGCCCG CAATTCGCCC TGGTGCGCGA CGCCACGGCG
GCCTTCGGCG TGCCGGGGAT CGAGGAAGCG GGGTGGGAGG CCGACGACCT GATCGCGGCC
TATGCCCGCC TTGTGACCGA CGCGGGCGGG CGCTGCACCG TCGTATCGTC CGACAAGGAC
CTGATGCAGC TGATCCGCCC GGGCGTGGAG ATGCTGGACC CGATCCGCCA GAAGCCGATC
GGCCCGAAGG AGGTCGAGGC GAAGTTCGGC GTGACCCCGG ACAAGGTGAT CGACGTGCAG
GCGCTGATCG GCGATTCGGT CGACAACGTG CCGGGCGTGC CGGGAATCGG GCCCAAGACC
GCATCGGCGC TGATCGCGGA ATATGGCTCG CTGGACGCGA TCCTGGACGG GGCGCCGGCG
ATGAAGCCGT CCAAGCGGCG TGACAGCCTG ATCGAACATG CCGAGCGCGC GCGCCTGTCG
CGGGTGCTGG TGACGCTGCG CGAGGACGCG CCCTGCCCCC TGGGGCTGGA AGACCTGATC
TGCCGCGATC CGGACGAGGT GAAGCTGGGC GAATGGCTGC AGTCCATGGG CTTCCGCTCC
CTGCTGCACC GGATGGGGAT GAAGGAGTCG GCCGCCGCCG TGGGGGTGGC CAGTTCCGCC
CCGGCGCCTG CCGCCGCTGT TGCCGCCGCG CCGACGCCTT CGCCCGACCG GGCGCCCTAT
GGCCCCTATG AGACCGTGAG GACGGCCGGG GCGCTGGAGA CGTGGGTGGC CGAGGCCCGC
ACGGCCGGGT TCTGCGCCAT CGACACCGAA ACCGACGGGC TGGACCCGCT GCGCGCCGGG
CTGGTCGGGA TTTCGCTGGC CGTGGCGCCG GGACGGGCCT GCTATATCCC GCTGGCGCAC
ACGGCCGAGC CCCCGCCGCC GCAATTGCTG CCCGACCTGC TGCTTGAGCC GGCGCCGCAG
GACGACGCTC CGGACCCGGT CGGGCCGCAA CTGGAAACCG GCCTGGCGCT GGCGATCCTG
GGGCCGCTGC TGGCCGACGC GTCGGTGCTG AAGATCTTCC AGAACGCGAA GTTCGACCTG
CTGGTGCTGA CGCGCGCGGG CGCGCCCCAG CCGGCGCCGG TCGACGACAC CATGCTGATC
TCCTACGCCC AGTTCGCGGG GCGGCATGGC CAGGGCATGG ACGAACTGTC GCGGCTGTAT
CTGGGTCATA CGCCGATCCC CTATGACGAG GTCACGGGGA CGGGACGCAA CCGCGTGCCG
TTCGCGCGGG TCGATGTGGC ACGCGCCACC GCCTATGCCG CCGAGGATGC CGACGTGACG
CTGCGGCTGT GGCTGTCGCT GCGCCCCACC CTGCGCATCC ATCATGCCCT GGCGCTGTAC
GAGGAACTGG AACGGCCGCT GGTCGCCGTG CTGGCGGACA TGGAGCGCGC GGGCATTGCC
ATCGACGTGG TGGAACTGCG CCGCATGTCG GCCGATTTCG CGACCCGCAT GGCGCAGATG
GAGGCCGAGA TCCAGGCCCT GGCCGGCCGG TCGTTCAATG TCGGCTCGCC CAAGCAACTG
GGCGAGATCC TGTTCGACGA GATGGGGCTG CCCGGCGGCA AGCGGATGAA ATCCGGCGCC
TGGGGGACCG ATTCCTCAGT CCTGCAGGAC CTGGCCGACC AGGGGCACGA CCTGCCGGGG
CGCATCCTGG CGTGGCGGCA ACTGGCCAAG CTGAAATCGA CCTATGCCGA CGCGCTGGTC
AAGCAGGCGG ACCCGGACAC CGCGCGGGTC CATACCTCGT TCCAGATGGC GATCACCTCG
ACCGGCCGCC TGTCGTCCAA CGAGCCGAAC CTGCAGAACA TCCCCATCCG CACCGAGGAG
GGCGGGCGCA TCCGCCGCGC CTTCGTCGCA GCACCCGGCC ATGTGCTGCT GTCGGCCGAT
TATTCGCAGA TCGAACTGCG GCTGCTGGCG CATGTCGCGG ACATTCCCGC CCTGCGCGAG
GCCTTCGCGC TGGGGCAGGA CATCCATGCC CGCACGGCGT CCGAGGTCTT CGGCATTCCG
ATCGAGGGCA TGGACCCGCT GACCCGCCGG CGGGCGAAGG CGATCAATTT CGGCATCATC
TACGGCATCA GCGCCTTCGG CCTGGGGCGG CAGCTCGGCA TTCCGCCGGG CGAGGCGCGG
GCCTATATCG ACGCCTATTT CGCCCGCTAT CCCGGCATCC GCGCCTATAT GGAAACGGTG
AAGCAGGAGG CGAAGGACCA GGGCTACGTC ACCACGCCGT TCGGCCGGCG CTGCTGGGTG
CCCGGGATCG CCGACCGCAG CGCCGTCCGC CGCGCCTATG CCGAGCGGCA GGCGATCAAC
GCGCCGCTGC AGGGCGGGGC GGCGGACATC ATCAAGCGCG CGATGGTGCG CCTGCCCCAC
GCGCTGGCGT CGGCCGGGCT TGACGGACGG CTGCTGCTTC AGGTCCATGA CGAACTTCTG
TTCGAGGTGC GGGCGGGCCA GCAGGACGCG CTGTCCGCCC TCGTGAAACA GGTCATGGAA
TCCGCGGCCA GCCTGTCGGT GCCGCTGGTG GTGGAAACCG GGACCGGAGC AAACTGGGCG
GACGCGCATT GA
 
Protein sequence
MSGDAPIHLI LVDGSGFIFR AFHALPPMTA PDGTPVNAVF GFTNMLARLL RDHVGTHLAV 
IFDAGRVTFR NGIYDQYKAH RPEPPEELRP QFALVRDATA AFGVPGIEEA GWEADDLIAA
YARLVTDAGG RCTVVSSDKD LMQLIRPGVE MLDPIRQKPI GPKEVEAKFG VTPDKVIDVQ
ALIGDSVDNV PGVPGIGPKT ASALIAEYGS LDAILDGAPA MKPSKRRDSL IEHAERARLS
RVLVTLREDA PCPLGLEDLI CRDPDEVKLG EWLQSMGFRS LLHRMGMKES AAAVGVASSA
PAPAAAVAAA PTPSPDRAPY GPYETVRTAG ALETWVAEAR TAGFCAIDTE TDGLDPLRAG
LVGISLAVAP GRACYIPLAH TAEPPPPQLL PDLLLEPAPQ DDAPDPVGPQ LETGLALAIL
GPLLADASVL KIFQNAKFDL LVLTRAGAPQ PAPVDDTMLI SYAQFAGRHG QGMDELSRLY
LGHTPIPYDE VTGTGRNRVP FARVDVARAT AYAAEDADVT LRLWLSLRPT LRIHHALALY
EELERPLVAV LADMERAGIA IDVVELRRMS ADFATRMAQM EAEIQALAGR SFNVGSPKQL
GEILFDEMGL PGGKRMKSGA WGTDSSVLQD LADQGHDLPG RILAWRQLAK LKSTYADALV
KQADPDTARV HTSFQMAITS TGRLSSNEPN LQNIPIRTEE GGRIRRAFVA APGHVLLSAD
YSQIELRLLA HVADIPALRE AFALGQDIHA RTASEVFGIP IEGMDPLTRR RAKAINFGII
YGISAFGLGR QLGIPPGEAR AYIDAYFARY PGIRAYMETV KQEAKDQGYV TTPFGRRCWV
PGIADRSAVR RAYAERQAIN APLQGGAADI IKRAMVRLPH ALASAGLDGR LLLQVHDELL
FEVRAGQQDA LSALVKQVME SAASLSVPLV VETGTGANWA DAH