Gene Dshi_2909 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_2909 
SymbolpolA 
ID5710760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp3067301 
End bp3070099 
Gene Length2799 bp 
Protein Length932 aa 
Translation table11 
GC content66% 
IMG OID641268835 
ProductDNA polymerase I 
Protein accessionYP_001534243 
Protein GI159045449 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.033314 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.843304 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATTCG GCAAGGGACA TCACCTGCAT CTGATCGACG GATCGGCGTT CATTTTCCGC 
GCCTATCACG CGTTGCCGCC GCTGACACGG AAGTCCGACG GGCTGCCCGT CGGGGCGGTG
TCGGGCTTCT GCAACATGCT GCAGCGCTAT GTCGAAGGGA ACGCCGGGGG CGACGTGACC
CATGTGGCGG TGATCTTCGA CAAGGGCAGC CATACCTTTC GCAATGATCT TTACGACCAG
TACAAGGCCA ACCGCGAGGC GATGCCCGAG GACCTGCGCC CGCAGATCCC CCTGACCCGG
ACCGCGACCG AGGCGTTCAA CATCGCCTGC AAGGAGATGG AGGGGTTCGA GGCGGACGAT
ATCATCGCGA CGCTGGCCTG CCAGGCGCGC GAGGCCGGGG GGCGCTGCAC GATCATCAGC
TCGGACAAGG ACCTGATGCA GCTGGTCGGC GGCGGGGTCG AGATGCTGGA TGCGATGAAG
AACAAGACCA TCGACGTGGA CGGGGTGTTC GAGAAATTCG GCGTGGGCCC CGACCGGGTG
GTCGATGTGC AGGCGCTGGC CGGGGATTCG GTGGACAACG TGCCCGGCGC GCCGGGGATC
GGGATCAAGA CCGCGGCGCT GCTGATCAAC GAGTATGGCT CGCTCGAGGA GCTTCTGGAC
CGGGCCGGGG AGATCAAGCA GCCCAAGCGC CGCCAGACCC TGATCGAGCA TCGCGCCCAG
ATCGAGCTGT CGAAGCGGCT GGTGCAGCTC GATTGCGACA TGGAGCTGGA TTTCACAATC
GAGGATCTGG AGGTGCGCGA TCCGGAGCCT GAGACCCTGC TGGGGTTTTT GGCGGAGATG
GAGTTCCGCA CGCTGACAAA GCGGATCTCG GGCGCGTTGG GGGTGGAGGC GCCCGCCGTG
CCGGAGCCCG CAGCGACGAC GGCCTCGCCA GAGGCGGAGG AGCACCCCCC CCTGAGCGCC
TCGGAGCATG TCACCATCCG CGACGCGGAA ACCCTGCAAA GCTGGATCGA CGCGATCCAT
GCGCGCGGGG TGGTGGCGGT GGATACCGAG ACCACGGGCC TCAACGAGAT GCGCGCGGAT
CTGGTGGGCG TGTCGCTCTG CGTGGATCCG GGGCGCGCGG CCTACCTGCC GCTGGCGCAC
AAGGACGGTG GCGGGGCCGA CGACCTGTTC GGCGGGGAGG CGAAGCTGGC CGAGGGGCAG
CTGGATTTTG AGACCGCGCT TGGGATGCTC AAGCCCGTGC TGGAGGACCC TGCGGTCCTG
AAGATCGGGC AGAACATGAA ATACGATGCC AAGATCCTCG CCCGGAACGG GATCACGGTG
GCGCCCATCG ACGACACGAT GCTGCTGAGC TACGCGCTCC ATGCCGGGTT GCACGGGCAC
GGGATGGACG CGCTGTCGGA GCGGTATCTC GACCACACGC CGATCCCGAT CAAGACCCTG
CTGGGCACCG GCAAATCCGC GATCACTTTC GATTTCGTGC CCATCGAGGA GGCGACGAAA
TACGCCGCCG AGGATGCGGA AATCACGCTG CGCCTGTGGC AGCGCTTGAA GCCGCGCCTG
CATCTGGCGC AGGTCACCCG CGTCTACGAG TGGATGGAGC GGCCCATGGT GCCGGTTCTG
GCCGAGATGG AGATGCGCGG GATCAAGGTC GATCGCGACA CCCTGAGCCG GATGTCGAAC
GCGTTCGCGC AGAAGATGGC CGGGCTGGAG GCGGAAATCC ACGAACTGGC GGGGGAGAGT
TTCAACGTGG GCAGCCCCGC CCAGCTGGGC GAGATCCTGT TCGAGAAGAT GGGCTTCGAG
GGGGGCAAGA CCGGCAAGTC GGGCAAGTAT TCGACCCCGG CGGATGTGCT GGAGGATCTC
GCGACCGAGC ATGACCTGCC GCGCCGGGTG CTCGACTGGC GGCAGCTGTC GAAGTTGAAA
TCGACCTACA CGGACGCCTT GCAGGATCAT ATCAACCCCG AGACCGGGCG GGTGCATACG
TCCTACTCGA TTGCCGGGGC GAATACGGGG CGGTTGGCTT CGACCGATCC GAACCTGCAG
AACATCCCCG TGCGCTCCGA GGAGGGGCGG CGCATTCGCG AGGCCTTCGT GGCGGAGCCG
GGGCATGTGC TCGTGTCGCT CGACTATAGC CAGATCGAGC TGCGGATCCT CGCCCATATC
GCCGGGATCG ACGCGCTGAA GACCGCGTTC CGGGACGGGC TGGACATTCA TGCGATGACC
GCGTCGGAGA TGTTCGATGT GCCGCTGGAC GAGATGACGC CGGAGGTGCG GCGCCAGGCC
AAGGCGATCA ATTTCGGGGT GATCTACGGG ATTTCGGGCT TTGGCCTCGC GCGGAACCTG
CGGATCCCGC GGGCGGAGGC GCAGGCCTTC ATCGACACCT ATTTCGAGCG GTTCCCCGGC
ATCCGGGCCT ATATGGACGA CACGGTCGCC TTCGCGAAGG AGCATAAATA CGTGCAGACC
CTGTTCGGGC GGAAGATCCA CACGCCGGAG ATCGGCGCGA AGGGGCCCCA GGCGGGCTTC
GCCAAGCGCG CGGCGATCAA CGCGCCGATC CAGGGAACGG CGGCGGATAT CATCCGGCGC
GCGATGGTGC GGATGCCCGC GGCGATTGCC GACCTGCCCG CGCGGATGCT GCTCCAGGTC
CATGACGAGC TGATCTTCGA GGTGGAGGAA GACGCGGTGG ACCGGGTGAT CCCGGCGGTG
CGCCAGGTGA TGGAGGGGGC CGCCGCGCCG GTGGTGCATC TCGATGTGCC GCTGACGGTG
GATGCGGGCC AGGGGCGAAG CTGGGCGGAG GCGCATTAG
 
Protein sequence
MAFGKGHHLH LIDGSAFIFR AYHALPPLTR KSDGLPVGAV SGFCNMLQRY VEGNAGGDVT 
HVAVIFDKGS HTFRNDLYDQ YKANREAMPE DLRPQIPLTR TATEAFNIAC KEMEGFEADD
IIATLACQAR EAGGRCTIIS SDKDLMQLVG GGVEMLDAMK NKTIDVDGVF EKFGVGPDRV
VDVQALAGDS VDNVPGAPGI GIKTAALLIN EYGSLEELLD RAGEIKQPKR RQTLIEHRAQ
IELSKRLVQL DCDMELDFTI EDLEVRDPEP ETLLGFLAEM EFRTLTKRIS GALGVEAPAV
PEPAATTASP EAEEHPPLSA SEHVTIRDAE TLQSWIDAIH ARGVVAVDTE TTGLNEMRAD
LVGVSLCVDP GRAAYLPLAH KDGGGADDLF GGEAKLAEGQ LDFETALGML KPVLEDPAVL
KIGQNMKYDA KILARNGITV APIDDTMLLS YALHAGLHGH GMDALSERYL DHTPIPIKTL
LGTGKSAITF DFVPIEEATK YAAEDAEITL RLWQRLKPRL HLAQVTRVYE WMERPMVPVL
AEMEMRGIKV DRDTLSRMSN AFAQKMAGLE AEIHELAGES FNVGSPAQLG EILFEKMGFE
GGKTGKSGKY STPADVLEDL ATEHDLPRRV LDWRQLSKLK STYTDALQDH INPETGRVHT
SYSIAGANTG RLASTDPNLQ NIPVRSEEGR RIREAFVAEP GHVLVSLDYS QIELRILAHI
AGIDALKTAF RDGLDIHAMT ASEMFDVPLD EMTPEVRRQA KAINFGVIYG ISGFGLARNL
RIPRAEAQAF IDTYFERFPG IRAYMDDTVA FAKEHKYVQT LFGRKIHTPE IGAKGPQAGF
AKRAAINAPI QGTAADIIRR AMVRMPAAIA DLPARMLLQV HDELIFEVEE DAVDRVIPAV
RQVMEGAAAP VVHLDVPLTV DAGQGRSWAE AH