Gene Tbis_3102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTbis_3102 
Symbol 
ID9169621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermobispora bispora DSM 43833 
KingdomBacteria 
Replicon accessionNC_014165 
Strand
Start bp3636788 
End bp3639892 
Gene Length3105 bp 
Protein Length1034 aa 
Translation table11 
GC content77% 
IMG OID 
Productwinged helix family transcriptional regulator 
Protein accessionYP_003653690 
Protein GI296271058 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.306498 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.034161 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGATCG ACCTGCGCCT GCTCGGCGAC GCACGGTGGC AGGATCGGCC CCTGGTCGGT 
GCGCGGGTGC AGGCGCTGCT CGCCGCCCTC GTGCTCGCCG GCCGTACCGT GAGCGCCGAA
CGGCTGATCG CGGAGGTCTG GGGAGTCGAC GAGCCCGCGA ACCCGGTGAA GGCACTGCAG
GTGCTGGTCT CCCGGGCGAG GTCGATCGTG GGAGCCCAGA CCCTCACCAC CGAGGACGGC
GGGTACCGGC TCCGGGTGCC GCCCGACCGG ATCGACGCGG ACCGGCTGCA CGCCCTGGCG
GGACGCGCTG AGGCGCTGCT CGCCGTGGAC CCCGCCGCGG CGGCCGCGGT GGCCCAGGAG
GCGCTCGCGC TCGGCGCCGG GGCGCTGCCC CCGGAGGGCG AGGGGCCGCT CGCCGAGTTA
CGCCGGCGCG CCGCGGACGA CCTGGCGGCC GCACGGGTGG TGCTCGCCCG GGCGCGGAGC
CGTACCGGAG ACCACGCCGC CGCGCTGGCC GAGCTCGACG AGGCGGTGCG GGCGCGTCCG
GACGACGAAG GGCTCCTCGC CGACCTGCTC CGCAGCGAGG CGGCGGTCCG CGGAGTGGGC
GCCGCGCTGG ACCGGTTCGC GCGCTACCGC TCCGGCCTGC GCGACCGGGT CGGCGCCGAC
GTCGGACCGG AGCTGCGGCG GGTCCACCAC GAGCTGCTGG CGCTCGACCG CCCGGTGCGC
TCCGGCGTGC GGTTCGACCC CACCCCGCTG CTCGGGCGGG ACGACGACCT GCGGCGGGTG
CGGTCCCTGC TCGCCACGTC CCGGGTCGTC TCGATCACCG GGCCGGGCGG GATCGGCAAG
ACCCGGCTCG CCCACGCCGT CGCCCGCGAG GCCCCGCAGC CGGTGGTGCA CTTCGTGGAG
CTGGTCGGCG TGACCGCGGC CGAGGACCTG CTCGGCGAGA TCGGGTCGGC GCTCGGCGTC
CGCGACTCGG TGGCCGGGCG CCGGAGGCTC AGCCCGCGGC AGCTCGCCGA CGTGCGCGCC
CGGATCGCCC GGCAGCTCGA CCTCGCTCCG GCGCTGCTCG TCCTCGACAA CTGCGAGCAC
CTGGTGGAGG CGGTCGCGGA CCTGGTCGCC TACCTGATCG CGACCACCCG GGACCTGCGG
GTGCTCACCA CGAGCCGCGC GCCGCTGGCG ATCGCCGCCG AGCGCGTCTA CCCGCTGCCC
CAGCTCGGCA TGGCCGACGC CGTCGCGCTG TTCCGCGACC GCGCGACCGC GGCCCGGCCC
GGGGTGCGCC TCGACGAGGA CGACGTGCGC GCCGTCGTGG GCCGCCTGGA CGGCCTGCCC
CTCGCGATCG AGCTCGCCGC CGCCAAGGTG CGGGTGATGT CGCCGGCGGA GATCGCCCGC
CGGCTCGACG ACCGGTTCGC CCTGCTGCGC GGCGGCGACC GGAGCGCCCA CGCCCGCCAC
CAGACCCTGC TCGCCGTGAT CGACTGGTCG TGGAACCTGC TCGGCGAGCG GGAGCGCCGC
GCGCTGCGCC GCCTGGCGGC CTTCCCGGAC GGCTTCATCC TCGAGGCCGC GGAGCAGGTG
CTCGGCGGTG ACGCCCTCGG CGCGGTCGAG GAGCTCGTGG CACAGTCCCT GCTGACCGTG
GTGGAGACCG CCGACGGCGT CCGGTATCGC ATGCTGGAGA CCGTGCGCGA GTTCGGCCGC
ATGCGGCTCG ACGCGGCCGG CGAGACCGCG GAGGCGCGGG CCGCCGTGCG GGCCTGGGCC
GTCGCCTACG CCGACCGGCA CGGTGCGCGG CTGCACTCGC CGGAGCAGGC CGACGCGGTG
GACCGCCTGC GCGCGGAGGA GACCAACCTC AGCGACATCC TGCGGGAGGC GCTCGACGAG
CCCGACCCGG AGGCGGTGGT GGTGCTGTTC TCCACGCTCG CCGGCTACTG GGCCATCCGC
GGGGACTACC TGCGCGGCCA CACCCTCACC GCCGCCGTCT CCGCCGCGCT GGACGGGTGG
ACGCCGCCGC CCCGGCTCCT CGACCGGACC CGGGTGACCC TCGCCGTCGC CCTGTTCCAC
GCGGTCTTCT TCTCCCCGGA CCACGCCCGC GCCCTGGCGT CGATGCTGTC CGGGCTGGGC
GCCGACTCGG AGAACCCCAA CGTCCGGGCC ATGACGACCG TGGTGCTCGC CCTGATGTCG
GATCCCGACA GCGGCTTCGC CGCACTCGCC GGCGACCCGG ACCCGTTCGT CCGCCAGATC
GCCCTGCACC TGATGAGCCA CGGCCTGGAG AACTCCGGGG ACCCGGCCGG CGCGATCGAG
ACGGCCCAGG CCGCGCTCGA GCTCACCGGG GAGGAGGACG GTCCTTGGCA GCGGGCCATG
CTGCACACCC TGCTCGCCCA GCTCCACGCG CAGTTCGGGG ACCGCGCCGC CGTCTCCCGG
CACGCCGCCG CGGCGCTCCC GGTGCTCGAG CGGCTCGGGG CGGTGGACGA CGCCGTCCAG
CTCCGGGCCA CCGTGGCGAT GGCGGCGCTC GCCGAGGGGA ACCGGGACGA GGCCGCGCGG
ATCATCGCGG AGCTCACCGA ACCCGGCGCG GAGGACGGGT TCGGCCTCTC GGCCGGGATC
TGCCGGGCGC AGCTCGCCCT CGCCGACGGC GACATCGCGG AGGGCCTCCG GCGGCACCGC
GAGATCGCCG AGGCGCTCCG GAGCAGGCAC GCCCCCGGCG CCCCCGACGC GTTCGCGCCG
TGGGCCATGT ACGGCGAGGC CGTCGCGGTC GCGGCGTTCG CCCAGTACGG CGAGGGCGAC
GACGGGGCGG ACCTGTTCGA GTCGCTGCGG GCGAAGGTCG CGGACCTGTT CGCCGTGGAC
CGCACGTTCC TGGACTTCCC GGTGACCGGG ATGGTGCTCG CCGCGCTCGG CATCTGGGGC
CTGCTCAAAG GGGCGCTGCC GGCCGAGGAC GCGGTGCGGC TGCTCGTCCT CGCCGACCGG
TTCGCGTACA ACCGGATCAC CCCGGTCCTG CACTGGCCGG TGCTCACCGC CCACGCCGAG
CGGGTCGCCC CCGGCCTGCT CGCCCGGATC GAGGCGGAGT ACGGCGACCG GCGCGGTCCC
GGCCTGCTGG CCGAGGTCCG CGCCGTGGTG GCGCGGGTGG CGTGA
 
Protein sequence
MPIDLRLLGD ARWQDRPLVG ARVQALLAAL VLAGRTVSAE RLIAEVWGVD EPANPVKALQ 
VLVSRARSIV GAQTLTTEDG GYRLRVPPDR IDADRLHALA GRAEALLAVD PAAAAAVAQE
ALALGAGALP PEGEGPLAEL RRRAADDLAA ARVVLARARS RTGDHAAALA ELDEAVRARP
DDEGLLADLL RSEAAVRGVG AALDRFARYR SGLRDRVGAD VGPELRRVHH ELLALDRPVR
SGVRFDPTPL LGRDDDLRRV RSLLATSRVV SITGPGGIGK TRLAHAVARE APQPVVHFVE
LVGVTAAEDL LGEIGSALGV RDSVAGRRRL SPRQLADVRA RIARQLDLAP ALLVLDNCEH
LVEAVADLVA YLIATTRDLR VLTTSRAPLA IAAERVYPLP QLGMADAVAL FRDRATAARP
GVRLDEDDVR AVVGRLDGLP LAIELAAAKV RVMSPAEIAR RLDDRFALLR GGDRSAHARH
QTLLAVIDWS WNLLGERERR ALRRLAAFPD GFILEAAEQV LGGDALGAVE ELVAQSLLTV
VETADGVRYR MLETVREFGR MRLDAAGETA EARAAVRAWA VAYADRHGAR LHSPEQADAV
DRLRAEETNL SDILREALDE PDPEAVVVLF STLAGYWAIR GDYLRGHTLT AAVSAALDGW
TPPPRLLDRT RVTLAVALFH AVFFSPDHAR ALASMLSGLG ADSENPNVRA MTTVVLALMS
DPDSGFAALA GDPDPFVRQI ALHLMSHGLE NSGDPAGAIE TAQAALELTG EEDGPWQRAM
LHTLLAQLHA QFGDRAAVSR HAAAALPVLE RLGAVDDAVQ LRATVAMAAL AEGNRDEAAR
IIAELTEPGA EDGFGLSAGI CRAQLALADG DIAEGLRRHR EIAEALRSRH APGAPDAFAP
WAMYGEAVAV AAFAQYGEGD DGADLFESLR AKVADLFAVD RTFLDFPVTG MVLAALGIWG
LLKGALPAED AVRLLVLADR FAYNRITPVL HWPVLTAHAE RVAPGLLARI EAEYGDRRGP
GLLAEVRAVV ARVA