Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tbis_2507 |
Symbol | |
ID | 9169014 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermobispora bispora DSM 43833 |
Kingdom | Bacteria |
Replicon accession | NC_014165 |
Strand | - |
Start bp | 2934819 |
End bp | 2938025 |
Gene Length | 3207 bp |
Protein Length | 1068 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | HsdR family type I site-specific deoxyribonuclease |
Protein accession | YP_003653103 |
Protein GI | 296270471 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.351896 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.319546 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGACTG ACTTCGCCCC TCCGCTCCGC GACCTGTCCG AGGCGCGGTG GGAGGCCCTG GCGATGGGCA CCCTGGGCGA GCTCGGATGG CAGCCGCTGG AGGGCAAGGC GATCGCCCCG GGGTCCGGCG AGCGCGAGTC CTGGTCGGAG CTGATCCTGC CCGGCCGGCT GCGCGACGCG ATCGCCCGGA TCAACCCGCA GCTTCCGCCG TCCGCCGTGG ACGACGCGCT CATGGAGGTC ACCAGCGCCA GGTCGCGGGA TGCCCTGGCT GAGAACCGCC GGATCCACGA GTTCCTGACC AAGGGCATCC GGTCGGTGGT TTACACCGAC GAGCACGGCG CGGAGCACAA CCCGACGATC TGGCTGGTCG ACTTCCGCGA GCCGGAGGCC AACGACTTCC TCGCGGTCAA CCAGGTGGCG GTGGTCGAGG GCGAGCACCG GCGGCGTTTC GACGTGGTGC TGTACCTCAA CGGCCTGCCG GTCGGGTTGG TGGAGCTGAA GAAGGCCGGG GACGCGCACG CCGACCTGCA GGGGGCGTAC GCGCAGTTGC GCACGTACGT CGACGAGCTG CCGCTGGCGT TCCGCGCCAA CGTGGTCTGC GTGGTCTCCG ACGGGATCAC GGCCCGCTAC GGCACGGCGT TCACGCCGTT CGAGCACTTC GCGCCGTGGA ACGTGGACGA CGAGGGCCGG CCGGTGCCGC AGCCGCCGAC CCGCGACGAG GACCTGGCGC TGAACCTGGC GCTGCACGGC CTGTTCCAGC AGAGCCGTTT CCTGGAGATC CTGCGCGGCT ATGTCGCGTT CGCCGAGACG CCGGGCGGGA CGACCAAGCG GATCGCCAAG CCGCACCAGT ACTTCGCGGT CAGCAAGGCC GTCGGCAAGA CCATCGAGGC GACCCGGCGG GATGGGCGCG CCGGGGTGGT CTGGCACACC CAAGGCTCGG GCAAGTCGCT GGAGATGGAG CTTTACGCCC ACCAGGTCAT GACGCACCCG AGCCTCGGCA ACCCGACCAT CGTCGTCATC ACCGACCGCA CCGACCTGGA CGACCAGCTC TACTCGGCCT TCCTCGCCAG CGAGCTGCTG CCGGAGAAGC CGGTGCAGGC CGCGACCCGC GACGACCTGC GTACCGAACT CCTCAACCGC CGTACCGGCG GCATCATCTT CACCACGCTG CAGAAGTTCG GCCGCACCAA GGAGGAGCGG GAGGCGGGCC AGGCCCACCC GCTGCTGTCC GACCGGCGCA ACGTCATCGT CATCGTCGAC GAGGCGCACC GCAGCCACTA CGACAGCCTG GACGGGTACG CCCGGCACCT GCGCGACGCC CTGCCCAACG CCACGTTCGT CGCCTTCACC GGCACGCCGA TCTCCGAGGC CGACCGCAAC ACCCGCGACG TGTTCGGCGA CTACATCGAC ATCTACGACT TGACCCGGGC CGTGGACGAC GGCGCCACCG TGCGCGTCTA CCACGAGAGC CGGCTCATCC CGGTGAGCCT GCCCAAGGAT GTCGACCCCG AGGTGATCGA CGACCGGGCC GACCAGATCA CCGCGGGCCT GGACGACGCG GAACGGCAGC GCATCCAGCG CAGCGTGGCG GTGATGAACG CCGTCTACGG CGCCCCGGAC CGGCTGAAGA AGCTCGCCGC CGACCTGGTC TCCCACTGGG AGGCGCGCTC CGCGCAGATG CGCAAGTTCA TCGACGGCCC GGGCAAGGGG CTGATCGTCT GCGCCACCCG GGACATCTGC GCCCGGCTGT ACGAGGAGAT CATCGCGCTG CGCCCGGAGT GGCATGCCGA CGCCGACGAC AAGGGCAAGA TCAAGGTCGT CTACACCGGC GACCCCAAGG ACGAGCCCCA CATCAGGAAG CACGTGCGCC GGCCGTCGCA GCTCAAGGTG ATCCAGCGCC GGGCGAAGGA TCCGGACGAC GAGCTGGAGC TGGTGATCGT CCAGTCGATG TGGCTGACCG GGTTCGACTC CCCGCCGCTG CACACCCTCT ACTTGGACAA GCCGATGCGG GGGGCGGCGC TGATGCAGGC GCTCGCCCGG GTGAACCGCC CTTTCCGGGC CAAGCAGGAC GGCCTGCTCG TCGGCTACGC GCCGGTCACC CAGAGCCTGC ACGAAGCCCT GGCCGAGTAC ACCCAAGACG ACCAGGACAC CAGGCCGGTG GGCCGCGACA TCGACGAGGT CGTGGCCCAG GTCCGCGACC TGCACGACGT GATCTGCAAC GTGATCCTGC GCGGGTACGA CTGGCGCGGC AAGCTGGCGG CCAAGTCGGA CAAGGCCTAC CGGGAGGCCG TGCTCGGCAC CGTCAACTAC CTGCGTAATC CCGCGTTGCC CGAAAACCAG GTCGAGCCCG GAGAGGACAC CCTCGCCGAA AGGTTCCGGA AGGCGGCGGC GAGGCTGGAC CGGCTCTACG CGCTGTGCGC CAGCAGCGGC CAGCTCAACC CCTACCGCGA CGACATCGCC TTCTTCCAGG CCGTACGGGT TTGGATGGCC AAGTTCGACG TGGAAGACCG CCGGGCCCGC GGCCTGCCGA TCCCAGCCGA AATCGCGCTC TACCTCAAGC AGCTCACCGC CGGGATCATC GAAGCCGGAG GCGTCACCGA CATCTACCAA GCCGCCGGCA TCGACCGGCC CGACCTATCC CACCTGGACG AAGCGTACCT GGAGCGTCTG CGGGCCTCGA AGACGCCCCA CCTCGCGATC GAGGCGTTGC GGCGGGCCAT CGAGCAGACG ATGCGCCGGG TCACCCGGCA CAACGTGGTG CGGCAGAAGA CCTTCTCCGA CCGGTTGATC GAGCTGATGA ACCGGTATAC CAACCAGCAC CTCACTTCGG CCGAGATCAT CGCCGAGCTG GTGGCCATGG CCAAGGAGGT AGCCGCCGAC GCCGACCGCG GCAAGGCGTT CAATCCCCCG CTGAGCGAGG ACGAGCTGGC CTTCTACGAC GCCGTGGCGC AAAACGAGGC AGCGGTCAGG GAGATGGGCC CAGGGGTACT CGCCGACATC GCACGCGACC TTGTGCGGAC GGTACGCAAC TCCGTCACCG TCGACTGGGT CTCCCGCGAC GATGTGCGCG CCAAGCTCCG CACCATTATC AAGCGGCTGC TTGCCAAGCA CGGCTACCCG CCGGACGCCG CGCCAGCCGC CATCGACCTG GTCATTCGGC AGATGGAGAC CTTCGCCGAG GACTGGTCAC CCGAAGCCAG CCGGTAG
|
Protein sequence | MTTDFAPPLR DLSEARWEAL AMGTLGELGW QPLEGKAIAP GSGERESWSE LILPGRLRDA IARINPQLPP SAVDDALMEV TSARSRDALA ENRRIHEFLT KGIRSVVYTD EHGAEHNPTI WLVDFREPEA NDFLAVNQVA VVEGEHRRRF DVVLYLNGLP VGLVELKKAG DAHADLQGAY AQLRTYVDEL PLAFRANVVC VVSDGITARY GTAFTPFEHF APWNVDDEGR PVPQPPTRDE DLALNLALHG LFQQSRFLEI LRGYVAFAET PGGTTKRIAK PHQYFAVSKA VGKTIEATRR DGRAGVVWHT QGSGKSLEME LYAHQVMTHP SLGNPTIVVI TDRTDLDDQL YSAFLASELL PEKPVQAATR DDLRTELLNR RTGGIIFTTL QKFGRTKEER EAGQAHPLLS DRRNVIVIVD EAHRSHYDSL DGYARHLRDA LPNATFVAFT GTPISEADRN TRDVFGDYID IYDLTRAVDD GATVRVYHES RLIPVSLPKD VDPEVIDDRA DQITAGLDDA ERQRIQRSVA VMNAVYGAPD RLKKLAADLV SHWEARSAQM RKFIDGPGKG LIVCATRDIC ARLYEEIIAL RPEWHADADD KGKIKVVYTG DPKDEPHIRK HVRRPSQLKV IQRRAKDPDD ELELVIVQSM WLTGFDSPPL HTLYLDKPMR GAALMQALAR VNRPFRAKQD GLLVGYAPVT QSLHEALAEY TQDDQDTRPV GRDIDEVVAQ VRDLHDVICN VILRGYDWRG KLAAKSDKAY REAVLGTVNY LRNPALPENQ VEPGEDTLAE RFRKAAARLD RLYALCASSG QLNPYRDDIA FFQAVRVWMA KFDVEDRRAR GLPIPAEIAL YLKQLTAGII EAGGVTDIYQ AAGIDRPDLS HLDEAYLERL RASKTPHLAI EALRRAIEQT MRRVTRHNVV RQKTFSDRLI ELMNRYTNQH LTSAEIIAEL VAMAKEVAAD ADRGKAFNPP LSEDELAFYD AVAQNEAAVR EMGPGVLADI ARDLVRTVRN SVTVDWVSRD DVRAKLRTII KRLLAKHGYP PDAAPAAIDL VIRQMETFAE DWSPEASR
|
| |