Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_2867 |
Symbol | |
ID | 8733311 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 3059406 |
End bp | 3062060 |
Gene Length | 2655 bp |
Protein Length | 884 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 646503480 |
Product | DNA polymerase I |
Protein accession | YP_003394661 |
Protein GI | 284044321 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.991449 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATACGGG GAACGGCTTC CACGCCGCTG TGGTCCCTAC TCTGTGTCGC CGTGGCCGCC GCCCCCGACA AGCCGGACGA GCTGTTCCTG ATCGATGGGA ACTCGCTCGC GTACCGCGCC TTCTTCGCGT TGCCGGAGTC GATCGCGACC TCGACCGGCT TCCCGACGAA CGCGATCTTC GGCTTCGCCT CGATGCTGGT GAAGATCCTC ACCGAGTACG GCCCGAGAGC GACGATCGTC GCGTGGGACC GCGGCCACTC CGGTCGCAGA GAGGTGTATC CCGAGTACAA GGCGCAGCGC TCCTCGCGCC CGGACCTGTT CAAGCAGCAG TGGCCGCACC TCGAGCCGCT GGTCGAGTCG TTCGGATATC AGAACGTCTC GCTCGACGGC TACGAGGCAG ACGACGTGAT CGCGACGCTC GCCGAGCGCG CGAAGGCCGC GGGCATCCCG GTGATGGTCG TGACCGGCGA CCGCGACTCG TTCCAGCTCG TCGACGAGGG CGTGCAGATC ATGGCGACCT CGCGCGGCAT CACCGAGACG AGAACGTACG ACCGCCAGGG CGTCATCGAC CGCTACGGGA TCCCGCCCGA GCTGGTCCCC GACTTCATCG GCCTCAAGGG CGACACGTCC GACAACATCC CCGGCGTCCC CGGGATCGGC GACAAGACCG CCGCGCAGCT GCTGAACGAC TTCGGCGACC TCGAGGGCGT GCTCGCGAAC GCGCACACGA TCAGAGCGAG AAAGCGGCGC GAGAACCTGA TCGAGCACGC CGAGGACGCA CGCGTCAGCA AGCAGCTCGC GACGATGCGG CGCGACCTCG AGGTCGCGAT CGACGTCGCC GCCGTCCACG GCGCCGAGCC CGACCGCTCG AGACTGCGCG AGACGTTCCG CGAGTTCGAG CTGCGCGACC CGCTGCGGCG GCTGGAGGAG GCGCTCGGCG ACGGCGACGA GGCTGCGCCG CGCCCGCAGG CCGAGCGGGC GATCGGCGCG AAGCTGCGGA CCGGCGCGGT GGCGGACCTT GCGTCGCTGG CTCCCGCCGG CGGCGAGATC GCGCTCGCCG CGCGCGAGCC CGAGAAGCCC GACGACGCGC TGTTCGGCGA GAGCGACGCG TGGCGCTTCG GCGCCTACGC CGGGCAGGAC GCGCTCGCGG GAGAGTGCGG GGGCGACGCC GGGCCGGAGG TGCTCGCCGC AGCGATCGGC GAGCGGCCAG CGCTCGCGCA CGACGCGAAG GCACTGCGCG AGGTCCCCGC GACGCTCGCG CACGACACCC TGATCGCCGC CTACCTGCTC GAACCCGCCC GCCGCAGCTA CCCGCTCGAC GAGCTGACCG AGGAGCGCGG CATCGGGACC GACGTCGAGG ACGCCGCGGC AGCCGACGCG ATCCTCGTCC ACGCGCTCAC CGCCGCGCAG CGCCCGCAGC TGGAGGAGCG CGAGCTGCTG CCGCTGTTCG ACGACGTCGA GCTGCCGCTC GTGCGCGTGC TGCGCGCGAT GGAGACGGCA GGGCTGAGGC TCGACACGGA GCTGCTGGCG ACGATCCGCA CGCGCGTGAT GGACGAGGCC GTCGCGCTCG AACGCGAGAT CTGGGAGCTG ACCGGCGAGG AGTTCATGAT CGGCTCCCCG CAGCAGCTCG GCCAGATCCT GTTCGAGAAG CTCGGCCTGT CGAGAAAGAG ACGCGGCAAG ACCGGCTACT CGACGGACGC CCGCGTGCTG CAGGCGATCC GCGGCGAGCA CCCGGTGATC GAGAAGATCG AGCGCTGGCG CGAGCTGACG AAGCTCGCCT CGACCTACCT CGACGCGCTG CCGCTGCTGA TCTCGCCCGA GGACCACCGG CTGCACACGA CGTTCAACCA GGTGACGGCC GCGACCGGCC GCCTCTCCTC GACGAACCCG AACCTGCAGA ACATCCCGAT CCGCACCCCG CTCGGTCGCG AGATCCGCGC CTGCTTCGTC GCCGAGCCCG GCAACGTCCT CATCTCCGCC GACTACTCCC AGGTCGAGCT GCGCGTGCTC GCGCACATCG CCGGCGAGGA GGTGCTGAAG GAGATCTTCC GCCGCGGCGA GGACGTCCAC ACCGCGACCG CCGCCGCGAT CCTCGGCATC GACCCTGAGC AGCTCGACGC CGGCTCGCGC TCGAAGGCGA AGATGGTCAA CTACGGCATC GTCTACGGCC TCTCGGCCTT CGGCCTCGCC GACCGCCTGC AGATCCCGCG CGAGGAGGCG CAGGAGTTCA TCGACCGCTA CCTCGACGGC TTCCCGGCCG TCCAGGCGTT CATCAGAACG ACGATCGAGC AGGCGACCGA CCAGGGTTAC GTGACGACGC TGATGGGGCG GCGCCGGCAG ATCCCCGAGC TGCGGGCGCG CAATTACCAG ATGCGCCAGC TCGGCGAGCG GCTCGCCGTC AACACCGTGA TCCAGGGCAC CGCCGCCGAC GTGATCAAGC TCGCGATGGT CAACGCCGAC CGCGCGCTGC ACGCCTCCGG CCTGCGCACG AGACTGATCC TCCAGATCCA CGACGAGCTG CTGTTCGAAG GGCCTGCGGA GGAGGCCGAG CAGGCACGCG ACCTCGTCGT CCCGCAGATG GTCGACGCGC TGGAGCTCGA CCCGCCGCTC GTCGTCGACG CGGGCATCGG CCCGAACTGG CTGGACGCGA AGTGA
|
Protein sequence | MIRGTASTPL WSLLCVAVAA APDKPDELFL IDGNSLAYRA FFALPESIAT STGFPTNAIF GFASMLVKIL TEYGPRATIV AWDRGHSGRR EVYPEYKAQR SSRPDLFKQQ WPHLEPLVES FGYQNVSLDG YEADDVIATL AERAKAAGIP VMVVTGDRDS FQLVDEGVQI MATSRGITET RTYDRQGVID RYGIPPELVP DFIGLKGDTS DNIPGVPGIG DKTAAQLLND FGDLEGVLAN AHTIRARKRR ENLIEHAEDA RVSKQLATMR RDLEVAIDVA AVHGAEPDRS RLRETFREFE LRDPLRRLEE ALGDGDEAAP RPQAERAIGA KLRTGAVADL ASLAPAGGEI ALAAREPEKP DDALFGESDA WRFGAYAGQD ALAGECGGDA GPEVLAAAIG ERPALAHDAK ALREVPATLA HDTLIAAYLL EPARRSYPLD ELTEERGIGT DVEDAAAADA ILVHALTAAQ RPQLEERELL PLFDDVELPL VRVLRAMETA GLRLDTELLA TIRTRVMDEA VALEREIWEL TGEEFMIGSP QQLGQILFEK LGLSRKRRGK TGYSTDARVL QAIRGEHPVI EKIERWRELT KLASTYLDAL PLLISPEDHR LHTTFNQVTA ATGRLSSTNP NLQNIPIRTP LGREIRACFV AEPGNVLISA DYSQVELRVL AHIAGEEVLK EIFRRGEDVH TATAAAILGI DPEQLDAGSR SKAKMVNYGI VYGLSAFGLA DRLQIPREEA QEFIDRYLDG FPAVQAFIRT TIEQATDQGY VTTLMGRRRQ IPELRARNYQ MRQLGERLAV NTVIQGTAAD VIKLAMVNAD RALHASGLRT RLILQIHDEL LFEGPAEEAE QARDLVVPQM VDALELDPPL VVDAGIGPNW LDAK
|
| |