Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_2909 |
Symbol | polA |
ID | 5710760 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 3067301 |
End bp | 3070099 |
Gene Length | 2799 bp |
Protein Length | 932 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641268835 |
Product | DNA polymerase I |
Protein accession | YP_001534243 |
Protein GI | 159045449 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.033314 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.843304 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCATTCG GCAAGGGACA TCACCTGCAT CTGATCGACG GATCGGCGTT CATTTTCCGC GCCTATCACG CGTTGCCGCC GCTGACACGG AAGTCCGACG GGCTGCCCGT CGGGGCGGTG TCGGGCTTCT GCAACATGCT GCAGCGCTAT GTCGAAGGGA ACGCCGGGGG CGACGTGACC CATGTGGCGG TGATCTTCGA CAAGGGCAGC CATACCTTTC GCAATGATCT TTACGACCAG TACAAGGCCA ACCGCGAGGC GATGCCCGAG GACCTGCGCC CGCAGATCCC CCTGACCCGG ACCGCGACCG AGGCGTTCAA CATCGCCTGC AAGGAGATGG AGGGGTTCGA GGCGGACGAT ATCATCGCGA CGCTGGCCTG CCAGGCGCGC GAGGCCGGGG GGCGCTGCAC GATCATCAGC TCGGACAAGG ACCTGATGCA GCTGGTCGGC GGCGGGGTCG AGATGCTGGA TGCGATGAAG AACAAGACCA TCGACGTGGA CGGGGTGTTC GAGAAATTCG GCGTGGGCCC CGACCGGGTG GTCGATGTGC AGGCGCTGGC CGGGGATTCG GTGGACAACG TGCCCGGCGC GCCGGGGATC GGGATCAAGA CCGCGGCGCT GCTGATCAAC GAGTATGGCT CGCTCGAGGA GCTTCTGGAC CGGGCCGGGG AGATCAAGCA GCCCAAGCGC CGCCAGACCC TGATCGAGCA TCGCGCCCAG ATCGAGCTGT CGAAGCGGCT GGTGCAGCTC GATTGCGACA TGGAGCTGGA TTTCACAATC GAGGATCTGG AGGTGCGCGA TCCGGAGCCT GAGACCCTGC TGGGGTTTTT GGCGGAGATG GAGTTCCGCA CGCTGACAAA GCGGATCTCG GGCGCGTTGG GGGTGGAGGC GCCCGCCGTG CCGGAGCCCG CAGCGACGAC GGCCTCGCCA GAGGCGGAGG AGCACCCCCC CCTGAGCGCC TCGGAGCATG TCACCATCCG CGACGCGGAA ACCCTGCAAA GCTGGATCGA CGCGATCCAT GCGCGCGGGG TGGTGGCGGT GGATACCGAG ACCACGGGCC TCAACGAGAT GCGCGCGGAT CTGGTGGGCG TGTCGCTCTG CGTGGATCCG GGGCGCGCGG CCTACCTGCC GCTGGCGCAC AAGGACGGTG GCGGGGCCGA CGACCTGTTC GGCGGGGAGG CGAAGCTGGC CGAGGGGCAG CTGGATTTTG AGACCGCGCT TGGGATGCTC AAGCCCGTGC TGGAGGACCC TGCGGTCCTG AAGATCGGGC AGAACATGAA ATACGATGCC AAGATCCTCG CCCGGAACGG GATCACGGTG GCGCCCATCG ACGACACGAT GCTGCTGAGC TACGCGCTCC ATGCCGGGTT GCACGGGCAC GGGATGGACG CGCTGTCGGA GCGGTATCTC GACCACACGC CGATCCCGAT CAAGACCCTG CTGGGCACCG GCAAATCCGC GATCACTTTC GATTTCGTGC CCATCGAGGA GGCGACGAAA TACGCCGCCG AGGATGCGGA AATCACGCTG CGCCTGTGGC AGCGCTTGAA GCCGCGCCTG CATCTGGCGC AGGTCACCCG CGTCTACGAG TGGATGGAGC GGCCCATGGT GCCGGTTCTG GCCGAGATGG AGATGCGCGG GATCAAGGTC GATCGCGACA CCCTGAGCCG GATGTCGAAC GCGTTCGCGC AGAAGATGGC CGGGCTGGAG GCGGAAATCC ACGAACTGGC GGGGGAGAGT TTCAACGTGG GCAGCCCCGC CCAGCTGGGC GAGATCCTGT TCGAGAAGAT GGGCTTCGAG GGGGGCAAGA CCGGCAAGTC GGGCAAGTAT TCGACCCCGG CGGATGTGCT GGAGGATCTC GCGACCGAGC ATGACCTGCC GCGCCGGGTG CTCGACTGGC GGCAGCTGTC GAAGTTGAAA TCGACCTACA CGGACGCCTT GCAGGATCAT ATCAACCCCG AGACCGGGCG GGTGCATACG TCCTACTCGA TTGCCGGGGC GAATACGGGG CGGTTGGCTT CGACCGATCC GAACCTGCAG AACATCCCCG TGCGCTCCGA GGAGGGGCGG CGCATTCGCG AGGCCTTCGT GGCGGAGCCG GGGCATGTGC TCGTGTCGCT CGACTATAGC CAGATCGAGC TGCGGATCCT CGCCCATATC GCCGGGATCG ACGCGCTGAA GACCGCGTTC CGGGACGGGC TGGACATTCA TGCGATGACC GCGTCGGAGA TGTTCGATGT GCCGCTGGAC GAGATGACGC CGGAGGTGCG GCGCCAGGCC AAGGCGATCA ATTTCGGGGT GATCTACGGG ATTTCGGGCT TTGGCCTCGC GCGGAACCTG CGGATCCCGC GGGCGGAGGC GCAGGCCTTC ATCGACACCT ATTTCGAGCG GTTCCCCGGC ATCCGGGCCT ATATGGACGA CACGGTCGCC TTCGCGAAGG AGCATAAATA CGTGCAGACC CTGTTCGGGC GGAAGATCCA CACGCCGGAG ATCGGCGCGA AGGGGCCCCA GGCGGGCTTC GCCAAGCGCG CGGCGATCAA CGCGCCGATC CAGGGAACGG CGGCGGATAT CATCCGGCGC GCGATGGTGC GGATGCCCGC GGCGATTGCC GACCTGCCCG CGCGGATGCT GCTCCAGGTC CATGACGAGC TGATCTTCGA GGTGGAGGAA GACGCGGTGG ACCGGGTGAT CCCGGCGGTG CGCCAGGTGA TGGAGGGGGC CGCCGCGCCG GTGGTGCATC TCGATGTGCC GCTGACGGTG GATGCGGGCC AGGGGCGAAG CTGGGCGGAG GCGCATTAG
|
Protein sequence | MAFGKGHHLH LIDGSAFIFR AYHALPPLTR KSDGLPVGAV SGFCNMLQRY VEGNAGGDVT HVAVIFDKGS HTFRNDLYDQ YKANREAMPE DLRPQIPLTR TATEAFNIAC KEMEGFEADD IIATLACQAR EAGGRCTIIS SDKDLMQLVG GGVEMLDAMK NKTIDVDGVF EKFGVGPDRV VDVQALAGDS VDNVPGAPGI GIKTAALLIN EYGSLEELLD RAGEIKQPKR RQTLIEHRAQ IELSKRLVQL DCDMELDFTI EDLEVRDPEP ETLLGFLAEM EFRTLTKRIS GALGVEAPAV PEPAATTASP EAEEHPPLSA SEHVTIRDAE TLQSWIDAIH ARGVVAVDTE TTGLNEMRAD LVGVSLCVDP GRAAYLPLAH KDGGGADDLF GGEAKLAEGQ LDFETALGML KPVLEDPAVL KIGQNMKYDA KILARNGITV APIDDTMLLS YALHAGLHGH GMDALSERYL DHTPIPIKTL LGTGKSAITF DFVPIEEATK YAAEDAEITL RLWQRLKPRL HLAQVTRVYE WMERPMVPVL AEMEMRGIKV DRDTLSRMSN AFAQKMAGLE AEIHELAGES FNVGSPAQLG EILFEKMGFE GGKTGKSGKY STPADVLEDL ATEHDLPRRV LDWRQLSKLK STYTDALQDH INPETGRVHT SYSIAGANTG RLASTDPNLQ NIPVRSEEGR RIREAFVAEP GHVLVSLDYS QIELRILAHI AGIDALKTAF RDGLDIHAMT ASEMFDVPLD EMTPEVRRQA KAINFGVIYG ISGFGLARNL RIPRAEAQAF IDTYFERFPG IRAYMDDTVA FAKEHKYVQT LFGRKIHTPE IGAKGPQAGF AKRAAINAPI QGTAADIIRR AMVRMPAAIA DLPARMLLQV HDELIFEVEE DAVDRVIPAV RQVMEGAAAP VVHLDVPLTV DAGQGRSWAE AH
|
| |