Gene Tbis_2507 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTbis_2507 
Symbol 
ID9169014 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermobispora bispora DSM 43833 
KingdomBacteria 
Replicon accessionNC_014165 
Strand
Start bp2934819 
End bp2938025 
Gene Length3207 bp 
Protein Length1068 aa 
Translation table11 
GC content69% 
IMG OID 
ProductHsdR family type I site-specific deoxyribonuclease 
Protein accessionYP_003653103 
Protein GI296270471 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.351896 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.319546 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGACTG ACTTCGCCCC TCCGCTCCGC GACCTGTCCG AGGCGCGGTG GGAGGCCCTG 
GCGATGGGCA CCCTGGGCGA GCTCGGATGG CAGCCGCTGG AGGGCAAGGC GATCGCCCCG
GGGTCCGGCG AGCGCGAGTC CTGGTCGGAG CTGATCCTGC CCGGCCGGCT GCGCGACGCG
ATCGCCCGGA TCAACCCGCA GCTTCCGCCG TCCGCCGTGG ACGACGCGCT CATGGAGGTC
ACCAGCGCCA GGTCGCGGGA TGCCCTGGCT GAGAACCGCC GGATCCACGA GTTCCTGACC
AAGGGCATCC GGTCGGTGGT TTACACCGAC GAGCACGGCG CGGAGCACAA CCCGACGATC
TGGCTGGTCG ACTTCCGCGA GCCGGAGGCC AACGACTTCC TCGCGGTCAA CCAGGTGGCG
GTGGTCGAGG GCGAGCACCG GCGGCGTTTC GACGTGGTGC TGTACCTCAA CGGCCTGCCG
GTCGGGTTGG TGGAGCTGAA GAAGGCCGGG GACGCGCACG CCGACCTGCA GGGGGCGTAC
GCGCAGTTGC GCACGTACGT CGACGAGCTG CCGCTGGCGT TCCGCGCCAA CGTGGTCTGC
GTGGTCTCCG ACGGGATCAC GGCCCGCTAC GGCACGGCGT TCACGCCGTT CGAGCACTTC
GCGCCGTGGA ACGTGGACGA CGAGGGCCGG CCGGTGCCGC AGCCGCCGAC CCGCGACGAG
GACCTGGCGC TGAACCTGGC GCTGCACGGC CTGTTCCAGC AGAGCCGTTT CCTGGAGATC
CTGCGCGGCT ATGTCGCGTT CGCCGAGACG CCGGGCGGGA CGACCAAGCG GATCGCCAAG
CCGCACCAGT ACTTCGCGGT CAGCAAGGCC GTCGGCAAGA CCATCGAGGC GACCCGGCGG
GATGGGCGCG CCGGGGTGGT CTGGCACACC CAAGGCTCGG GCAAGTCGCT GGAGATGGAG
CTTTACGCCC ACCAGGTCAT GACGCACCCG AGCCTCGGCA ACCCGACCAT CGTCGTCATC
ACCGACCGCA CCGACCTGGA CGACCAGCTC TACTCGGCCT TCCTCGCCAG CGAGCTGCTG
CCGGAGAAGC CGGTGCAGGC CGCGACCCGC GACGACCTGC GTACCGAACT CCTCAACCGC
CGTACCGGCG GCATCATCTT CACCACGCTG CAGAAGTTCG GCCGCACCAA GGAGGAGCGG
GAGGCGGGCC AGGCCCACCC GCTGCTGTCC GACCGGCGCA ACGTCATCGT CATCGTCGAC
GAGGCGCACC GCAGCCACTA CGACAGCCTG GACGGGTACG CCCGGCACCT GCGCGACGCC
CTGCCCAACG CCACGTTCGT CGCCTTCACC GGCACGCCGA TCTCCGAGGC CGACCGCAAC
ACCCGCGACG TGTTCGGCGA CTACATCGAC ATCTACGACT TGACCCGGGC CGTGGACGAC
GGCGCCACCG TGCGCGTCTA CCACGAGAGC CGGCTCATCC CGGTGAGCCT GCCCAAGGAT
GTCGACCCCG AGGTGATCGA CGACCGGGCC GACCAGATCA CCGCGGGCCT GGACGACGCG
GAACGGCAGC GCATCCAGCG CAGCGTGGCG GTGATGAACG CCGTCTACGG CGCCCCGGAC
CGGCTGAAGA AGCTCGCCGC CGACCTGGTC TCCCACTGGG AGGCGCGCTC CGCGCAGATG
CGCAAGTTCA TCGACGGCCC GGGCAAGGGG CTGATCGTCT GCGCCACCCG GGACATCTGC
GCCCGGCTGT ACGAGGAGAT CATCGCGCTG CGCCCGGAGT GGCATGCCGA CGCCGACGAC
AAGGGCAAGA TCAAGGTCGT CTACACCGGC GACCCCAAGG ACGAGCCCCA CATCAGGAAG
CACGTGCGCC GGCCGTCGCA GCTCAAGGTG ATCCAGCGCC GGGCGAAGGA TCCGGACGAC
GAGCTGGAGC TGGTGATCGT CCAGTCGATG TGGCTGACCG GGTTCGACTC CCCGCCGCTG
CACACCCTCT ACTTGGACAA GCCGATGCGG GGGGCGGCGC TGATGCAGGC GCTCGCCCGG
GTGAACCGCC CTTTCCGGGC CAAGCAGGAC GGCCTGCTCG TCGGCTACGC GCCGGTCACC
CAGAGCCTGC ACGAAGCCCT GGCCGAGTAC ACCCAAGACG ACCAGGACAC CAGGCCGGTG
GGCCGCGACA TCGACGAGGT CGTGGCCCAG GTCCGCGACC TGCACGACGT GATCTGCAAC
GTGATCCTGC GCGGGTACGA CTGGCGCGGC AAGCTGGCGG CCAAGTCGGA CAAGGCCTAC
CGGGAGGCCG TGCTCGGCAC CGTCAACTAC CTGCGTAATC CCGCGTTGCC CGAAAACCAG
GTCGAGCCCG GAGAGGACAC CCTCGCCGAA AGGTTCCGGA AGGCGGCGGC GAGGCTGGAC
CGGCTCTACG CGCTGTGCGC CAGCAGCGGC CAGCTCAACC CCTACCGCGA CGACATCGCC
TTCTTCCAGG CCGTACGGGT TTGGATGGCC AAGTTCGACG TGGAAGACCG CCGGGCCCGC
GGCCTGCCGA TCCCAGCCGA AATCGCGCTC TACCTCAAGC AGCTCACCGC CGGGATCATC
GAAGCCGGAG GCGTCACCGA CATCTACCAA GCCGCCGGCA TCGACCGGCC CGACCTATCC
CACCTGGACG AAGCGTACCT GGAGCGTCTG CGGGCCTCGA AGACGCCCCA CCTCGCGATC
GAGGCGTTGC GGCGGGCCAT CGAGCAGACG ATGCGCCGGG TCACCCGGCA CAACGTGGTG
CGGCAGAAGA CCTTCTCCGA CCGGTTGATC GAGCTGATGA ACCGGTATAC CAACCAGCAC
CTCACTTCGG CCGAGATCAT CGCCGAGCTG GTGGCCATGG CCAAGGAGGT AGCCGCCGAC
GCCGACCGCG GCAAGGCGTT CAATCCCCCG CTGAGCGAGG ACGAGCTGGC CTTCTACGAC
GCCGTGGCGC AAAACGAGGC AGCGGTCAGG GAGATGGGCC CAGGGGTACT CGCCGACATC
GCACGCGACC TTGTGCGGAC GGTACGCAAC TCCGTCACCG TCGACTGGGT CTCCCGCGAC
GATGTGCGCG CCAAGCTCCG CACCATTATC AAGCGGCTGC TTGCCAAGCA CGGCTACCCG
CCGGACGCCG CGCCAGCCGC CATCGACCTG GTCATTCGGC AGATGGAGAC CTTCGCCGAG
GACTGGTCAC CCGAAGCCAG CCGGTAG
 
Protein sequence
MTTDFAPPLR DLSEARWEAL AMGTLGELGW QPLEGKAIAP GSGERESWSE LILPGRLRDA 
IARINPQLPP SAVDDALMEV TSARSRDALA ENRRIHEFLT KGIRSVVYTD EHGAEHNPTI
WLVDFREPEA NDFLAVNQVA VVEGEHRRRF DVVLYLNGLP VGLVELKKAG DAHADLQGAY
AQLRTYVDEL PLAFRANVVC VVSDGITARY GTAFTPFEHF APWNVDDEGR PVPQPPTRDE
DLALNLALHG LFQQSRFLEI LRGYVAFAET PGGTTKRIAK PHQYFAVSKA VGKTIEATRR
DGRAGVVWHT QGSGKSLEME LYAHQVMTHP SLGNPTIVVI TDRTDLDDQL YSAFLASELL
PEKPVQAATR DDLRTELLNR RTGGIIFTTL QKFGRTKEER EAGQAHPLLS DRRNVIVIVD
EAHRSHYDSL DGYARHLRDA LPNATFVAFT GTPISEADRN TRDVFGDYID IYDLTRAVDD
GATVRVYHES RLIPVSLPKD VDPEVIDDRA DQITAGLDDA ERQRIQRSVA VMNAVYGAPD
RLKKLAADLV SHWEARSAQM RKFIDGPGKG LIVCATRDIC ARLYEEIIAL RPEWHADADD
KGKIKVVYTG DPKDEPHIRK HVRRPSQLKV IQRRAKDPDD ELELVIVQSM WLTGFDSPPL
HTLYLDKPMR GAALMQALAR VNRPFRAKQD GLLVGYAPVT QSLHEALAEY TQDDQDTRPV
GRDIDEVVAQ VRDLHDVICN VILRGYDWRG KLAAKSDKAY REAVLGTVNY LRNPALPENQ
VEPGEDTLAE RFRKAAARLD RLYALCASSG QLNPYRDDIA FFQAVRVWMA KFDVEDRRAR
GLPIPAEIAL YLKQLTAGII EAGGVTDIYQ AAGIDRPDLS HLDEAYLERL RASKTPHLAI
EALRRAIEQT MRRVTRHNVV RQKTFSDRLI ELMNRYTNQH LTSAEIIAEL VAMAKEVAAD
ADRGKAFNPP LSEDELAFYD AVAQNEAAVR EMGPGVLADI ARDLVRTVRN SVTVDWVSRD
DVRAKLRTII KRLLAKHGYP PDAAPAAIDL VIRQMETFAE DWSPEASR