Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_2846 |
Symbol | |
ID | 6976278 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 3110664 |
End bp | 3113495 |
Gene Length | 2832 bp |
Protein Length | 943 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643392354 |
Product | DNA polymerase I |
Protein accession | YP_002277192 |
Protein GI | 209544963 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.411951 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGGCG ACGCCCCGAT TCACCTGATC CTGGTCGACG GATCGGGGTT CATCTTCCGT GCCTTCCATG CCCTGCCGCC GATGACCGCG CCCGACGGCA CGCCCGTCAA CGCGGTGTTC GGCTTCACGA ACATGCTGGC GCGCCTGCTG CGCGACCATG TGGGCACGCA TCTGGCGGTC ATCTTCGATG CCGGGCGGGT GACCTTCCGC AACGGGATCT ATGACCAGTA CAAGGCCCAC CGGCCCGAAC CGCCCGAGGA ACTGCGCCCG CAATTCGCCC TGGTGCGCGA CGCCACGGCG GCCTTCGGCG TGCCGGGGAT CGAGGAAGCG GGGTGGGAGG CCGACGACCT GATCGCGGCC TATGCCCGCC TTGTGACCGA CGCGGGCGGG CGCTGCACCG TCGTATCGTC CGACAAGGAC CTGATGCAGC TGATCCGCCC GGGCGTGGAG ATGCTGGACC CGATCCGCCA GAAGCCGATC GGCCCGAAGG AGGTCGAGGC GAAGTTCGGC GTGACCCCGG ACAAGGTGAT CGACGTGCAG GCGCTGATCG GCGATTCGGT CGACAACGTG CCGGGCGTGC CGGGAATCGG GCCCAAGACC GCATCGGCGC TGATCGCGGA ATATGGCTCG CTGGACGCGA TCCTGGACGG GGCGCCGGCG ATGAAGCCGT CCAAGCGGCG TGACAGCCTG ATCGAACATG CCGAGCGCGC GCGCCTGTCG CGGGTGCTGG TGACGCTGCG CGAGGACGCG CCCTGCCCCC TGGGGCTGGA AGACCTGATC TGCCGCGATC CGGACGAGGT GAAGCTGGGC GAATGGCTGC AGTCCATGGG CTTCCGCTCC CTGCTGCACC GGATGGGGAT GAAGGAGTCG GCCGCCGCCG TGGGGGTGGC CAGTTCCGCC CCGGCGCCTG CCGCCGCTGT TGCCGCCGCG CCGACGCCTT CGCCCGACCG GGCGCCCTAT GGCCCCTATG AGACCGTGAG GACGGCCGGG GCGCTGGAGA CGTGGGTGGC CGAGGCCCGC ACGGCCGGGT TCTGCGCCAT CGACACCGAA ACCGACGGGC TGGACCCGCT GCGCGCCGGG CTGGTCGGGA TTTCGCTGGC CGTGGCGCCG GGACGGGCCT GCTATATCCC GCTGGCGCAC ACGGCCGAGC CCCCGCCGCC GCAATTGCTG CCCGACCTGC TGCTTGAGCC GGCGCCGCAG GACGACGCTC CGGACCCGGT CGGGCCGCAA CTGGAAACCG GCCTGGCGCT GGCGATCCTG GGGCCGCTGC TGGCCGACGC GTCGGTGCTG AAGATCTTCC AGAACGCGAA GTTCGACCTG CTGGTGCTGA CGCGCGCGGG CGCGCCCCAG CCGGCGCCGG TCGACGACAC CATGCTGATC TCCTACGCCC AGTTCGCGGG GCGGCATGGC CAGGGCATGG ACGAACTGTC GCGGCTGTAT CTGGGTCATA CGCCGATCCC CTATGACGAG GTCACGGGGA CGGGACGCAA CCGCGTGCCG TTCGCGCGGG TCGATGTGGC ACGCGCCACC GCCTATGCCG CCGAGGATGC CGACGTGACG CTGCGGCTGT GGCTGTCGCT GCGCCCCACC CTGCGCATCC ATCATGCCCT GGCGCTGTAC GAGGAACTGG AACGGCCGCT GGTCGCCGTG CTGGCGGACA TGGAGCGCGC GGGCATTGCC ATCGACGTGG TGGAACTGCG CCGCATGTCG GCCGATTTCG CGACCCGCAT GGCGCAGATG GAGGCCGAGA TCCAGGCCCT GGCCGGCCGG TCGTTCAATG TCGGCTCGCC CAAGCAACTG GGCGAGATCC TGTTCGACGA GATGGGGCTG CCCGGCGGCA AGCGGATGAA ATCCGGCGCC TGGGGGACCG ATTCCTCAGT CCTGCAGGAC CTGGCCGACC AGGGGCACGA CCTGCCGGGG CGCATCCTGG CGTGGCGGCA ACTGGCCAAG CTGAAATCGA CCTATGCCGA CGCGCTGGTC AAGCAGGCGG ACCCGGACAC CGCGCGGGTC CATACCTCGT TCCAGATGGC GATCACCTCG ACCGGCCGCC TGTCGTCCAA CGAGCCGAAC CTGCAGAACA TCCCCATCCG CACCGAGGAG GGCGGGCGCA TCCGCCGCGC CTTCGTCGCA GCACCCGGCC ATGTGCTGCT GTCGGCCGAT TATTCGCAGA TCGAACTGCG GCTGCTGGCG CATGTCGCGG ACATTCCCGC CCTGCGCGAG GCCTTCGCGC TGGGGCAGGA CATCCATGCC CGCACGGCGT CCGAGGTCTT CGGCATTCCG ATCGAGGGCA TGGACCCGCT GACCCGCCGG CGGGCGAAGG CGATCAATTT CGGCATCATC TACGGCATCA GCGCCTTCGG CCTGGGGCGG CAGCTCGGCA TTCCGCCGGG CGAGGCGCGG GCCTATATCG ACGCCTATTT CGCCCGCTAT CCCGGCATCC GCGCCTATAT GGAAACGGTG AAGCAGGAGG CGAAGGACCA GGGCTACGTC ACCACGCCGT TCGGCCGGCG CTGCTGGGTG CCCGGGATCG CCGACCGCAG CGCCGTCCGC CGCGCCTATG CCGAGCGGCA GGCGATCAAC GCGCCGCTGC AGGGCGGGGC GGCGGACATC ATCAAGCGCG CGATGGTGCG CCTGCCCCAC GCGCTGGCGT CGGCCGGGCT TGACGGACGG CTGCTGCTTC AGGTCCATGA CGAACTTCTG TTCGAGGTGC GGGCGGGCCA GCAGGACGCG CTGTCCGCCC TCGTGAAACA GGTCATGGAA TCCGCGGCCA GCCTGTCGGT GCCGCTGGTG GTGGAAACCG GGACCGGAGC AAACTGGGCG GACGCGCATT GA
|
Protein sequence | MSGDAPIHLI LVDGSGFIFR AFHALPPMTA PDGTPVNAVF GFTNMLARLL RDHVGTHLAV IFDAGRVTFR NGIYDQYKAH RPEPPEELRP QFALVRDATA AFGVPGIEEA GWEADDLIAA YARLVTDAGG RCTVVSSDKD LMQLIRPGVE MLDPIRQKPI GPKEVEAKFG VTPDKVIDVQ ALIGDSVDNV PGVPGIGPKT ASALIAEYGS LDAILDGAPA MKPSKRRDSL IEHAERARLS RVLVTLREDA PCPLGLEDLI CRDPDEVKLG EWLQSMGFRS LLHRMGMKES AAAVGVASSA PAPAAAVAAA PTPSPDRAPY GPYETVRTAG ALETWVAEAR TAGFCAIDTE TDGLDPLRAG LVGISLAVAP GRACYIPLAH TAEPPPPQLL PDLLLEPAPQ DDAPDPVGPQ LETGLALAIL GPLLADASVL KIFQNAKFDL LVLTRAGAPQ PAPVDDTMLI SYAQFAGRHG QGMDELSRLY LGHTPIPYDE VTGTGRNRVP FARVDVARAT AYAAEDADVT LRLWLSLRPT LRIHHALALY EELERPLVAV LADMERAGIA IDVVELRRMS ADFATRMAQM EAEIQALAGR SFNVGSPKQL GEILFDEMGL PGGKRMKSGA WGTDSSVLQD LADQGHDLPG RILAWRQLAK LKSTYADALV KQADPDTARV HTSFQMAITS TGRLSSNEPN LQNIPIRTEE GGRIRRAFVA APGHVLLSAD YSQIELRLLA HVADIPALRE AFALGQDIHA RTASEVFGIP IEGMDPLTRR RAKAINFGII YGISAFGLGR QLGIPPGEAR AYIDAYFARY PGIRAYMETV KQEAKDQGYV TTPFGRRCWV PGIADRSAVR RAYAERQAIN APLQGGAADI IKRAMVRLPH ALASAGLDGR LLLQVHDELL FEVRAGQQDA LSALVKQVME SAASLSVPLV VETGTGANWA DAH
|
| |