Gene Tbis_1850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTbis_1850 
Symbol 
ID9168344 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermobispora bispora DSM 43833 
KingdomBacteria 
Replicon accessionNC_014165 
Strand
Start bp2134964 
End bp2136718 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content69% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003652455 
Protein GI296269823 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAGC GACTGGCCAG GCCCGCGGTC ACCGCCGCCC TAGCCGCCCT CACGCTGGGC 
GTCTCCGCCT GCGGCGGTGG CGGCGGGGGA CAGAGCACCG GCGGTGGATC CGCCTCGTCC
GGCGACGGGT TCAACGCGGC TCTGGAGAAG GTGGTCAACC CTTCGGACAA GAAGGGCGGC
ACCCTGAAGA TGGCGATCAC CGAGGCGTGG GACTCCATCG ACCCCGGTGA CACCTACTAC
GCCCTCTCGT GGAACCTGCT CCGGCTCTAC GGGCGGCCGC TCGTCACCTA CAAGCCCGCC
CCGGGCGCGG AGGGCCGCGA GCTCGTCCCC GACCTGGCCG AGAGCCTCGG CACCCCGTCC
GACGGCGGCA AGACCTGGAC CTACCGGCTC CGCGAGGGCC TCAAGTTCGA GGACGGCACC
CCGATCACGT CCAAGGACGT CAAGTACGCG GTGCTCCGCT CGCTCGACAA GAAGACCTTC
GTCAACGGCC CGACGTACTT CAACGACTGG CTCGACCTGC CGAAGGACTT CGTCAGCGTC
TACGAGACGC CGGACGTCAA CACCGACCAG GCGATCGAGA CGCCGGACGA CCGGACGATC
GTCTTCCACC TGAAGAAGCC GTACGCGGGC TTCGACAACT TCGCGGCGCT GCCGTCCACC
GTCCCGGTCC CCAAGGACAA GGACACCGGG GTCAAGTACC GGCAGCACCC GATCGCCTCC
GGGCCGTACA TGTTCGAGAA GGTCGAGGAG GGCAAGCAGT ACACCCTGGT GCGCAACCCG
CACTGGGACC CGAACACCGA CCCGATCCGC AAGGCCCTCC CGGACCGCAT CGAGATCTCC
CTCGGCGTCG AGGCGAACGA CCTGGACAAC CGCCTCATCT CGGGCGACGT CCACGTCGAC
CTCGCCGGCA CCGGCGTGCA GGCGGCCGCC CTCGGCACCG TCCTGGGCGA CCCGGCGCTC
AAGGCGCGGG CGGACAACCC GACCAGCGCC CGCACCTGGT TCATCTCGAT CCTCGACACC
CCGCCGCTCG ACAACGTCGA GTGCCGCAAG GCGATCATCT ACGCGGCCGA CCGGGTCGGG
CTGCAGAACG CCTACGGCGG CGAGCTCCAG GGCGAGATCG CCACGGGCCT GATGCCGCCG
TCGATCGACG GATGGCAGAA GCTCGACCTG TACCCCACGG GCACCGGTGA CCTGGAGAAG
GCGAAGGCCG CCCTCCAGGC CTGCGGCCAC CCCGACGGCT TCGAGACCGT CATGGTCTAC
CGCTCGGACC GGCCGAAGGA GCAGGCCGCG GCCGAGTCGC TCCAGCAGGC GCTCGCCCGG
GTCGGCATCA AGCTCACCCT GAAGGGCTAC CCGACCAGCG ACTACTTCGC CAGCTACGCC
GGCAAGCCCG AGTTCACCCG GAAGAACAAC GTCGGCCTCG CCGCCCACGG GTGGGCCGCG
GACTGGAACG ACGGCTTCGG CTTCCTGTCG CAGATCGTCG ACAGCCGGAC CATCCGCGAG
TCCGGCAACT ACAACCTCAG CGTGAAGAGC AAGGAGATCG ACGAGCTGAT CGACAAGGCC
ATGGCCGAGC CGGACAAGGC CAAGCGCGAC GCCATCTGGG GTGAGATCGA CCGGAAGGTC
ATGGAGGGCG CGTACGTCCT GCCGGCGGTG TGGGCCAAGG CGCTGCTCCT GCGGGGCAAG
GGCGTGACCA ACGTGTTCGT CACGGGCGCC TTCGACATGT ACGACTACCT GAACATGGGC
GTCGAGCAGC CGTAA
 
Protein sequence
MKKRLARPAV TAALAALTLG VSACGGGGGG QSTGGGSASS GDGFNAALEK VVNPSDKKGG 
TLKMAITEAW DSIDPGDTYY ALSWNLLRLY GRPLVTYKPA PGAEGRELVP DLAESLGTPS
DGGKTWTYRL REGLKFEDGT PITSKDVKYA VLRSLDKKTF VNGPTYFNDW LDLPKDFVSV
YETPDVNTDQ AIETPDDRTI VFHLKKPYAG FDNFAALPST VPVPKDKDTG VKYRQHPIAS
GPYMFEKVEE GKQYTLVRNP HWDPNTDPIR KALPDRIEIS LGVEANDLDN RLISGDVHVD
LAGTGVQAAA LGTVLGDPAL KARADNPTSA RTWFISILDT PPLDNVECRK AIIYAADRVG
LQNAYGGELQ GEIATGLMPP SIDGWQKLDL YPTGTGDLEK AKAALQACGH PDGFETVMVY
RSDRPKEQAA AESLQQALAR VGIKLTLKGY PTSDYFASYA GKPEFTRKNN VGLAAHGWAA
DWNDGFGFLS QIVDSRTIRE SGNYNLSVKS KEIDELIDKA MAEPDKAKRD AIWGEIDRKV
MEGAYVLPAV WAKALLLRGK GVTNVFVTGA FDMYDYLNMG VEQP