Gene Dfer_1096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDfer_1096 
Symbol 
ID8224667 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDyadobacter fermentans DSM 18053 
KingdomBacteria 
Replicon accessionNC_013037 
Strand
Start bp1296792 
End bp1299683 
Gene Length2892 bp 
Protein Length963 aa 
Translation table11 
GC content55% 
IMG OID644928958 
ProductDNA polymerase I 
Protein accessionYP_003085510 
Protein GI255034889 
COG category[L] Replication, recombination and repair 
COG ID[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0889452 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.949667 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAAAC CCATTCATAA GCTGTTCCTG CTGGACGCTA TGGCCTTGAT TTACAGGGCA 
CATTTCGCGT TCATCAAGGC GCCGCGCATC ACTTCAAAGG GACTGAATAC CAGCGCCGTT
TTCGGCTTTA CCAACACACT GCTGGAAGTA CTGCAAAAAG AGAAACCTAC CCATATCGGC
GTTGCATTCG ATACCGCCGC GCCGACATTC CGCCATGTGC AGTTTGAAGC CTACAAGGCG
CAGCGCGAGG CGCAGCCGGA GGATATTACC GTAGCGATCC CACTCGTGAA GCGGCTTTTG
CGCGGTATGT GCATTCCCAT TCTCGAACTC GACGGCTACG AGGCAGATGA CATCATCGGC
ACCATTGCCA AGGAGGCTTC GCGGGAAGGT TTCGAGGTTT ACATGATGAC GCCCGACAAG
GATTACGGGC AATTGGTCGA ACAATACATT CACATTTACA AACCCGCATT CCTCGGCAAA
GGCGCCGAAG TGCTGGGCGT GCAGCAAATA CTCGACCGCT GGCAGATCCA GCGCATCGAC
CAGGTGATCG ACATTCTCGG ATTGATGGGC GACGCGGTCG ACAACATTCC GGGCATTCCG
GGCGTTGGCG AAAAAACGGC ACAGAAACTC ATCCAGGAAT ACGACACGAT CGAAAACCTG
ATCACACATG CCGGCGAAAT CAAAGGGAAA CTCGGTGAAA AAATCCGTGA AAACTTCGAT
AAAGCGGTGC TTAGCAAGCA GCTTGCAACG ATCGATTGCA AAGTGCCGGT GCCATTTGAT
GCCGAAGATC TCACCATTTG CTCACCCAAT GCGGAACTGA TCGCAGAGCT TTTCGATGAA
CTCGAATTCA AAACACTCAA AACCCGCATC CTGGGCGGCC AGCCAGGCGG TTCCGCAGGA
GCACTAAGTC CGCGCCCAGC AAGCGCGCAG GCTCCCGCAC GCAAAAACGC CAAAGGCCAA
CTCGACATTT TCGGTAATCC AACGGAAGAA ATCGGCCAGC AACCGGCGGC TGTGAGCGGC
GACCTTGCAG ATGGTGAACT CACCGAAGAC ACGTCCGAAA TCCGCATTCC GACATCCAAA
AGGACGATCG ACAATACTTT CCACCGCTAC CACACGGTGG ACACCCCGGA ACTGATGACG
AGCCTGGCGC ATTACCTGAG CTTGCAGGAC GCATTCTGCT TCGACACCGA AACCACCTCG
CTCGATACGA CGGACGCCGA GCTTGTAGGG CTGTCGTTTT CGTACCTGGC CGGCGAGGCT
TTCTACATTC CCGTGCCCGC CGACCGTCAG CAAGCCGAGG AGATTGTAGG GTATTTCAGG
GCGGTTTTTG AGAATGAAAA CATTGAAAAA ATCGGGCAGA ACATCAAGTA CGACATGCTC
GTGCTGAAAA ACTACGGCAT CGAAGTGCAC GGCAAGCTGA GCGATACCAT GCTCGCCCAT
TACCTGCTCG AACCGGATAA GCGCCACGGC ATGGACATTC TTGCCGCTTC CTACCTGAAC
TACGAACCGG TGTCGATCAC GTCGCTGATC GGCAAAAAAG GCGGGAAACA GGGCAATATG
CGCGACGTCG CTATTCCCGA AATCACGCAG TATGCCGGAG AGGATGCCGA TATTACGTTC
CAGCTGCATT CGATTTTCAG TCGGGAATTA CCCAAAGTGA ATGCGGCGAA GCTGTTCAAC
GAAGTGGAAA TGCCGCTCAC GAAAGTGCTG GCTTCTATGG AAAATACCGG CGTGCGGCTG
GACATCAATG CATTGAAGGA AATGTCGGCC GTGCTCGAAT CCGACCTTCG GCAAACCGAG
TCGGAGATTT ACGAAGCGGC CGGTCAATCG TTCAACATCA GCTCGCCCAA GCAGCTCGGA
GAAGTTCTCT TCGAGAAAAT GAAGCTGATT GAAAAGCCTA AAAAAACCAA AACAGGGCAA
TACGCGACAG GCGAAGAAAT CCTCTCCGAT CTGGAAGCCA ACCACCTGAT CGCACGCAAA
ATACTCGACT ACCGCGAGTT ACAGAAACTC AAATCCACTT ACGTAGATGC ATTACCGACG
ATGGTAAGCA GCCGCACCGG CCGCATTCAC ACGTCTTACA ACCAGGCCGT TGCAGCCACG
GGGCGGCTCA GTTCCACCAA CCCAAACCTG CAAAACATCC CGATACGCAC GCCGCGCGGC
CGAGAGATCC GCAAGGCATT TGTGCCTGAT TCCGCAGATT TCCAGATCCT TTCCGCGGAT
TACTCGCAGA TCGAATTGCG CATTATGGCG GCATTCAGCG GCGACGCGAG TATGACGGAA
GCGTTCAACC AGGGCCGCGA CATCCACGCT ACCACGGCCA GCAAGGTGTT CCAGGTGCCA
TTGGAAGAAG TGACTTCGGA CATGCGCCGG AAATCTAAAA TGGTCAATTT CGGTATCATT
TACGGCATTT CGGCATTCGG CCTTGCACAG CGCCTGGGCA TCCCGCGCGG GGAAGCGAGC
GAGATCATCC GCGCCTATTT CGAAGAATTC CCGGCCGTGA AAGGTTACAT GGACAAAGTC
GTGAACGACG CCCGCGAGCG CGAATATGTA GAAACGATCC TGGGCCGCCG CCGCTATCTG
CCGGATATCA ACTCCCGCAA CCAGACCAAC CGCGGCTACG CCGAACGAAA CGCCATCAAC
GCCCCGATCC AGGGCTCGGC CGCCGACATG ATCAAAGTCG CGATGATCAA CATCCACGAC
TTCATGGCAA AAGAAAAGCT CAAATCGCGG ATGATCCTGC AAGTGCATGA CGAACTCGTC
TTCGACGCGC ATTACAGCGA AATCGACCTC CTGAAAGAAC GGGTGGACGA ACTGATGCGA
AACGCAATCC CGATGGCCGT TCGCATGGAA ACCGGCATCG GCGTCGGTGC GAACTGGTTA
GAGGCGCATT GA
 
Protein sequence
MEKPIHKLFL LDAMALIYRA HFAFIKAPRI TSKGLNTSAV FGFTNTLLEV LQKEKPTHIG 
VAFDTAAPTF RHVQFEAYKA QREAQPEDIT VAIPLVKRLL RGMCIPILEL DGYEADDIIG
TIAKEASREG FEVYMMTPDK DYGQLVEQYI HIYKPAFLGK GAEVLGVQQI LDRWQIQRID
QVIDILGLMG DAVDNIPGIP GVGEKTAQKL IQEYDTIENL ITHAGEIKGK LGEKIRENFD
KAVLSKQLAT IDCKVPVPFD AEDLTICSPN AELIAELFDE LEFKTLKTRI LGGQPGGSAG
ALSPRPASAQ APARKNAKGQ LDIFGNPTEE IGQQPAAVSG DLADGELTED TSEIRIPTSK
RTIDNTFHRY HTVDTPELMT SLAHYLSLQD AFCFDTETTS LDTTDAELVG LSFSYLAGEA
FYIPVPADRQ QAEEIVGYFR AVFENENIEK IGQNIKYDML VLKNYGIEVH GKLSDTMLAH
YLLEPDKRHG MDILAASYLN YEPVSITSLI GKKGGKQGNM RDVAIPEITQ YAGEDADITF
QLHSIFSREL PKVNAAKLFN EVEMPLTKVL ASMENTGVRL DINALKEMSA VLESDLRQTE
SEIYEAAGQS FNISSPKQLG EVLFEKMKLI EKPKKTKTGQ YATGEEILSD LEANHLIARK
ILDYRELQKL KSTYVDALPT MVSSRTGRIH TSYNQAVAAT GRLSSTNPNL QNIPIRTPRG
REIRKAFVPD SADFQILSAD YSQIELRIMA AFSGDASMTE AFNQGRDIHA TTASKVFQVP
LEEVTSDMRR KSKMVNFGII YGISAFGLAQ RLGIPRGEAS EIIRAYFEEF PAVKGYMDKV
VNDAREREYV ETILGRRRYL PDINSRNQTN RGYAERNAIN APIQGSAADM IKVAMINIHD
FMAKEKLKSR MILQVHDELV FDAHYSEIDL LKERVDELMR NAIPMAVRME TGIGVGANWL
EAH