Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_3201 |
Symbol | |
ID | 8754881 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | - |
Start bp | 3356144 |
End bp | 3358882 |
Gene Length | 2739 bp |
Protein Length | 912 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | DNA polymerase I |
Protein accession | YP_003410177 |
Protein GI | 284991623 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGCTC CGGTTGCCAC CTCAGCGCCC ACCGCCCCCA CCGACCCGGG GGGCCGCCCG CGGCTGCTGC TGCTCGACGG GCACTCGCTG GCCTACCGCG CCTTCTTCGC CCTGCCGGTC GAGAACTTCT CGACGACGAC CGGGCAGCCG ACCAACGCGG TCTACGGCTT CACCTCGATG CTGATCAACG TGCTCCGCGA CGAGCAGCCG ACGCACGTGG CGGTCGCCTT CGACGTGGGC CGCAAGACCT TCCGCAACGA GATCTACGCG GAGTACAAGG CCAACCGCAG CGAGAGCCCG ACCGACTTCC GCGGTCAGGT CAGCCTCATC CAGGAGGTGC TGGCCGCACT GCACGTCCCG GTCATCACCG CCGAGGGCTA CGAGGCCGAC GACGTCATCG CCACCCTCAC CGTGGCGGCC GTCGAGCAGG GCATGGACGT GCTCATCTGC ACCGGCGACC GCGACGCCCT GCAGCTGGTC AACGAGCACG TCACCGTGCT CTACCCGCGC AAGGGCGTCT CCGACCTGAC CCGGTTCACG CCGGAGGAGG TCGAGGCCAA GTACGGCCTG TCGCCGGCGC AGTACCCCGA CTTCGCGGCA CTGCGGGGCG ACCCCAGCGA CAACCTGCCG AGCATCCCGA GCGTCGGGGA GAAGACGGCG GCCAAGTGGG TCCGGGAGTA CGGCTCGCTC GACGCCCTGG TCGACCAGGT CGACACCGTG AAGGGCAAGG TCGGCGAGAA GCTGCGCGAG CACCTGTCCT CGGTGCTGCA GAACCGGCGG CTGACCGAGC TCGACCGTGC CGTGCCGCTG GAGCTCGGCC CGGCGGACCT CGCCGTTCGC GCCTGGGACC GCAACGAGGT GCACACCCTC TTCGACAACC TGCAGTTCCG GGTGCTGCGC GACCGGCTCT TCGCCACGCT CACCAGCGCC GAGCCGGAGG CCGAGGGCGG CTTCGACGTC GCCGACGACG AGGTGCCCGC CGGCGGGCTG GGCGCCTGGC TGGACCAGCA CGCGCGCACC AGCCGCACCG GCATCATCTT CCGGGGCACG TGGGGCCGGG GCACCGGTGA GCTGACCGGC ATCGCGCTCG CTGGCGGCGA CGACCACGCC ACCTTCGTCG ACCTGGGCCC CGGCCTCGAC GCCGTCGACG AGCAGGCGCT GGCCGACTGG CTGGCCGACC CGGACGCACA AAAGGTCGTG CACGAGGTCA AGGGCCCGCT GCTGGCCGTC TGGGCACGCG GCTGGGACCT CGCCGGCGTC GTCAGCGACA CGGCGCTGGC TGCCTACCTG GCGCTGCCCG GGCAGCGCTC CTTCGACCTG GCCGACCTCG CCGTCCGGTA CCTGCGCCGC GAGCTGAAGG ACTCCGCCGC CGAGGAGACC CAGCTGACCC TCGACGGGAT GGGGCCGTCG GAGGAGGACC TCGCCCGCGA GGCCGCGCAC GCCGACGTCC TCAAGGCCGT CGCGGTCAAC GACCTCTCCG ACGCGCTGGA TTCCCTGCTC GGCCAGCGCG GCGGCGACGC CCTGCTCGGC GGGATCGAGC TGCCGCTCAC CTTCGTGCTC GCCCGCATGG AGCAGCGCGG CATCGCCGCC GACCTGGACT TCCTGCACGA GCTGCAGCGC GAGTTCGCCG ACGGCGTGGC CGCCGCGGCC GCCGAGGCCT ACGCGGTCAT CGGCCGCGAG GTGAACCTCG GCTCGCCCAA GCAGCTGCAG GCGGTGCTCT TCGACGAGCT GGGCCTGCCC AAGACCAAGA AGATCAAGTC CGGCTACACG ACCGACGCCG AGGCACTGAC CAACCTGCTG GCGCAGACCG GCCACCCGTT CCTCGAGCAC CTGCTCCGGC ACCGCGACGT CACCCGGCTG CGCACCGTCA TCGACGGCCT GATCCCCATG GTCGACGACG GCGGGCGGAT CCACACGACG TTCCAGCAGA CGATCGCCGC CACCGGCCGG CTGTCCTCGA TCGACCCGAA CCTGCAGAAC ATCCCGATCC GCACCGCGGA GGGCCGGCGG ATCCGGCAGG CGTTCGTCGT CGGCTCCGGC CACGAGTCGC TGATGACAGC GGACTACAGC CAGATCGAGA TGCGGATCAT GGCGCACCTC TCCGGCGACG CGGGGCTCAT CGAGGCCTTC ACGTCGGGGG AGGACCTGCA CTCCTTCGTC GCCTCCCGGG CGTACGACAT CCCGATCGAG GACGTCGACC CGGAGATGCG CCGCCGGATC AAGGCGATGA GCTACGGCCT GGCCTACGGG CTGTCGGCCT ACGGACTGGC CGGGCAGCTG CGCATCTCCG TCGAGGAGGC GCGCGAGCAG ATGCACGCCT ACTTCGAGCG CTTCGGCGGG ATCCGGGAGT ACCTCGACGG CGTGGTCGAC GACGCCCGGC AGACCGGCTA CACCGAGACG ACGCTGGGCC GCCGCCGCTA CCTGCCCGAC CTCACCAGCG ACAACGGGCA GCGGCGCCAG ATGGCCGAGC GGATGGCGCT CAACGCCCCC ATCCAGGGGT CGGCGGCCGA CGTCATCAAG GTGGCCATGC TGCGGGTGGA GCAGGCCATC GCCGACGAGG GGCTGCGCTC GCGGATGCTG CTGCAGGTGC ACGACGAGCT CGTGCTCGAG GTGGCGCCCG GGGAGCGGGA GGCGCTGGAG ACCCTGGTGC GCCGCGAGAT GGCCGGTGCC GCGCAGCTCT CGGTGCCGTT GGAGGTCTCG GTCGGCTTCG GCCGGACCTG GGACGAGGCC GCCCACTGA
|
Protein sequence | MSAPVATSAP TAPTDPGGRP RLLLLDGHSL AYRAFFALPV ENFSTTTGQP TNAVYGFTSM LINVLRDEQP THVAVAFDVG RKTFRNEIYA EYKANRSESP TDFRGQVSLI QEVLAALHVP VITAEGYEAD DVIATLTVAA VEQGMDVLIC TGDRDALQLV NEHVTVLYPR KGVSDLTRFT PEEVEAKYGL SPAQYPDFAA LRGDPSDNLP SIPSVGEKTA AKWVREYGSL DALVDQVDTV KGKVGEKLRE HLSSVLQNRR LTELDRAVPL ELGPADLAVR AWDRNEVHTL FDNLQFRVLR DRLFATLTSA EPEAEGGFDV ADDEVPAGGL GAWLDQHART SRTGIIFRGT WGRGTGELTG IALAGGDDHA TFVDLGPGLD AVDEQALADW LADPDAQKVV HEVKGPLLAV WARGWDLAGV VSDTALAAYL ALPGQRSFDL ADLAVRYLRR ELKDSAAEET QLTLDGMGPS EEDLAREAAH ADVLKAVAVN DLSDALDSLL GQRGGDALLG GIELPLTFVL ARMEQRGIAA DLDFLHELQR EFADGVAAAA AEAYAVIGRE VNLGSPKQLQ AVLFDELGLP KTKKIKSGYT TDAEALTNLL AQTGHPFLEH LLRHRDVTRL RTVIDGLIPM VDDGGRIHTT FQQTIAATGR LSSIDPNLQN IPIRTAEGRR IRQAFVVGSG HESLMTADYS QIEMRIMAHL SGDAGLIEAF TSGEDLHSFV ASRAYDIPIE DVDPEMRRRI KAMSYGLAYG LSAYGLAGQL RISVEEAREQ MHAYFERFGG IREYLDGVVD DARQTGYTET TLGRRRYLPD LTSDNGQRRQ MAERMALNAP IQGSAADVIK VAMLRVEQAI ADEGLRSRML LQVHDELVLE VAPGEREALE TLVRREMAGA AQLSVPLEVS VGFGRTWDEA AH
|
| |