Gene Cwoe_2867 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_2867 
Symbol 
ID8733311 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp3059406 
End bp3062060 
Gene Length2655 bp 
Protein Length884 aa 
Translation table11 
GC content71% 
IMG OID646503480 
ProductDNA polymerase I 
Protein accessionYP_003394661 
Protein GI284044321 
COG category[L] Replication, recombination and repair 
COG ID[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.991449 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATACGGG GAACGGCTTC CACGCCGCTG TGGTCCCTAC TCTGTGTCGC CGTGGCCGCC 
GCCCCCGACA AGCCGGACGA GCTGTTCCTG ATCGATGGGA ACTCGCTCGC GTACCGCGCC
TTCTTCGCGT TGCCGGAGTC GATCGCGACC TCGACCGGCT TCCCGACGAA CGCGATCTTC
GGCTTCGCCT CGATGCTGGT GAAGATCCTC ACCGAGTACG GCCCGAGAGC GACGATCGTC
GCGTGGGACC GCGGCCACTC CGGTCGCAGA GAGGTGTATC CCGAGTACAA GGCGCAGCGC
TCCTCGCGCC CGGACCTGTT CAAGCAGCAG TGGCCGCACC TCGAGCCGCT GGTCGAGTCG
TTCGGATATC AGAACGTCTC GCTCGACGGC TACGAGGCAG ACGACGTGAT CGCGACGCTC
GCCGAGCGCG CGAAGGCCGC GGGCATCCCG GTGATGGTCG TGACCGGCGA CCGCGACTCG
TTCCAGCTCG TCGACGAGGG CGTGCAGATC ATGGCGACCT CGCGCGGCAT CACCGAGACG
AGAACGTACG ACCGCCAGGG CGTCATCGAC CGCTACGGGA TCCCGCCCGA GCTGGTCCCC
GACTTCATCG GCCTCAAGGG CGACACGTCC GACAACATCC CCGGCGTCCC CGGGATCGGC
GACAAGACCG CCGCGCAGCT GCTGAACGAC TTCGGCGACC TCGAGGGCGT GCTCGCGAAC
GCGCACACGA TCAGAGCGAG AAAGCGGCGC GAGAACCTGA TCGAGCACGC CGAGGACGCA
CGCGTCAGCA AGCAGCTCGC GACGATGCGG CGCGACCTCG AGGTCGCGAT CGACGTCGCC
GCCGTCCACG GCGCCGAGCC CGACCGCTCG AGACTGCGCG AGACGTTCCG CGAGTTCGAG
CTGCGCGACC CGCTGCGGCG GCTGGAGGAG GCGCTCGGCG ACGGCGACGA GGCTGCGCCG
CGCCCGCAGG CCGAGCGGGC GATCGGCGCG AAGCTGCGGA CCGGCGCGGT GGCGGACCTT
GCGTCGCTGG CTCCCGCCGG CGGCGAGATC GCGCTCGCCG CGCGCGAGCC CGAGAAGCCC
GACGACGCGC TGTTCGGCGA GAGCGACGCG TGGCGCTTCG GCGCCTACGC CGGGCAGGAC
GCGCTCGCGG GAGAGTGCGG GGGCGACGCC GGGCCGGAGG TGCTCGCCGC AGCGATCGGC
GAGCGGCCAG CGCTCGCGCA CGACGCGAAG GCACTGCGCG AGGTCCCCGC GACGCTCGCG
CACGACACCC TGATCGCCGC CTACCTGCTC GAACCCGCCC GCCGCAGCTA CCCGCTCGAC
GAGCTGACCG AGGAGCGCGG CATCGGGACC GACGTCGAGG ACGCCGCGGC AGCCGACGCG
ATCCTCGTCC ACGCGCTCAC CGCCGCGCAG CGCCCGCAGC TGGAGGAGCG CGAGCTGCTG
CCGCTGTTCG ACGACGTCGA GCTGCCGCTC GTGCGCGTGC TGCGCGCGAT GGAGACGGCA
GGGCTGAGGC TCGACACGGA GCTGCTGGCG ACGATCCGCA CGCGCGTGAT GGACGAGGCC
GTCGCGCTCG AACGCGAGAT CTGGGAGCTG ACCGGCGAGG AGTTCATGAT CGGCTCCCCG
CAGCAGCTCG GCCAGATCCT GTTCGAGAAG CTCGGCCTGT CGAGAAAGAG ACGCGGCAAG
ACCGGCTACT CGACGGACGC CCGCGTGCTG CAGGCGATCC GCGGCGAGCA CCCGGTGATC
GAGAAGATCG AGCGCTGGCG CGAGCTGACG AAGCTCGCCT CGACCTACCT CGACGCGCTG
CCGCTGCTGA TCTCGCCCGA GGACCACCGG CTGCACACGA CGTTCAACCA GGTGACGGCC
GCGACCGGCC GCCTCTCCTC GACGAACCCG AACCTGCAGA ACATCCCGAT CCGCACCCCG
CTCGGTCGCG AGATCCGCGC CTGCTTCGTC GCCGAGCCCG GCAACGTCCT CATCTCCGCC
GACTACTCCC AGGTCGAGCT GCGCGTGCTC GCGCACATCG CCGGCGAGGA GGTGCTGAAG
GAGATCTTCC GCCGCGGCGA GGACGTCCAC ACCGCGACCG CCGCCGCGAT CCTCGGCATC
GACCCTGAGC AGCTCGACGC CGGCTCGCGC TCGAAGGCGA AGATGGTCAA CTACGGCATC
GTCTACGGCC TCTCGGCCTT CGGCCTCGCC GACCGCCTGC AGATCCCGCG CGAGGAGGCG
CAGGAGTTCA TCGACCGCTA CCTCGACGGC TTCCCGGCCG TCCAGGCGTT CATCAGAACG
ACGATCGAGC AGGCGACCGA CCAGGGTTAC GTGACGACGC TGATGGGGCG GCGCCGGCAG
ATCCCCGAGC TGCGGGCGCG CAATTACCAG ATGCGCCAGC TCGGCGAGCG GCTCGCCGTC
AACACCGTGA TCCAGGGCAC CGCCGCCGAC GTGATCAAGC TCGCGATGGT CAACGCCGAC
CGCGCGCTGC ACGCCTCCGG CCTGCGCACG AGACTGATCC TCCAGATCCA CGACGAGCTG
CTGTTCGAAG GGCCTGCGGA GGAGGCCGAG CAGGCACGCG ACCTCGTCGT CCCGCAGATG
GTCGACGCGC TGGAGCTCGA CCCGCCGCTC GTCGTCGACG CGGGCATCGG CCCGAACTGG
CTGGACGCGA AGTGA
 
Protein sequence
MIRGTASTPL WSLLCVAVAA APDKPDELFL IDGNSLAYRA FFALPESIAT STGFPTNAIF 
GFASMLVKIL TEYGPRATIV AWDRGHSGRR EVYPEYKAQR SSRPDLFKQQ WPHLEPLVES
FGYQNVSLDG YEADDVIATL AERAKAAGIP VMVVTGDRDS FQLVDEGVQI MATSRGITET
RTYDRQGVID RYGIPPELVP DFIGLKGDTS DNIPGVPGIG DKTAAQLLND FGDLEGVLAN
AHTIRARKRR ENLIEHAEDA RVSKQLATMR RDLEVAIDVA AVHGAEPDRS RLRETFREFE
LRDPLRRLEE ALGDGDEAAP RPQAERAIGA KLRTGAVADL ASLAPAGGEI ALAAREPEKP
DDALFGESDA WRFGAYAGQD ALAGECGGDA GPEVLAAAIG ERPALAHDAK ALREVPATLA
HDTLIAAYLL EPARRSYPLD ELTEERGIGT DVEDAAAADA ILVHALTAAQ RPQLEERELL
PLFDDVELPL VRVLRAMETA GLRLDTELLA TIRTRVMDEA VALEREIWEL TGEEFMIGSP
QQLGQILFEK LGLSRKRRGK TGYSTDARVL QAIRGEHPVI EKIERWRELT KLASTYLDAL
PLLISPEDHR LHTTFNQVTA ATGRLSSTNP NLQNIPIRTP LGREIRACFV AEPGNVLISA
DYSQVELRVL AHIAGEEVLK EIFRRGEDVH TATAAAILGI DPEQLDAGSR SKAKMVNYGI
VYGLSAFGLA DRLQIPREEA QEFIDRYLDG FPAVQAFIRT TIEQATDQGY VTTLMGRRRQ
IPELRARNYQ MRQLGERLAV NTVIQGTAAD VIKLAMVNAD RALHASGLRT RLILQIHDEL
LFEGPAEEAE QARDLVVPQM VDALELDPPL VVDAGIGPNW LDAK