Gene Tbis_0842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTbis_0842 
Symbol 
ID9167327 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermobispora bispora DSM 43833 
KingdomBacteria 
Replicon accessionNC_014165 
Strand
Start bp956229 
End bp957527 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content69% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003651459 
Protein GI296268827 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.516605 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCACTC GCCACGTGGG ACGCCGGATC GGCCGGCTCG CCGCCCCCAT GCTCGCCGCC 
ACCCTCATCG CCACCGCGGC GGCCTGCGGC GGCGACGGCT CCACCCAGAC CGGCTCCGGC
GGCCAGGAGA AGATCAAGCT GACGGTGGGC CTGTTCGGCG ACTTCGGCTT CGAGCCGCTG
TACGAGGAGT TCAAGAAGAC CCACCCGAAC ATCGAGATCG AGGAGCGCCA GGCGTCCTTC
GCCGACCACC ACACCAACCT CGCCGCCCAC CTCGCCACCG GTGCGGGCGC CGCGGACGTC
GAGGCCATCG AGGTCGGCTA CATCAGCCAG TTCACCGCCC AGCCGGACAA GTTCCACGAC
CTGCGCGAGT ACGGGCTCGA CAAGCGGCAG GGGGAGTACC TCCCCTGGAA GTGGCAGCAG
GGCGTCGCCC CGAACGGCGC GCTCATCGGC CTCGGCACCG ACGTCGGCGG CCTGGCCATG
TGCTACCGGA CCGACCTGTT CGAGAAGGCC GGCCTCCCCA CGGACCGGGA CGAGGTCTCC
GCGCTCTGGC CCACGTGGGA GGACTTCATC GAGACCGGCA AGAAGTTCAT GAAGTCCGCG
CCCAAGGGCA CGGCGTTCAT CGACAGCCCG GGCGAGATCC TCCGCGCGAT CATCGCGCAG
GCCCCGGTCG GCGTGTACGA CCAGAACGAC AACATCGTCG TGGCGACCAA CCCGGACGTG
AAGCGCGCCT GGGACCTGTC GGTCCAGATG ATCCAGGCCG GCCTCTCCGC GAAGATCGCC
GCGTTCACCC CCGAGTGGAA CACCGGCTTC AGCAAGGGCA CCTTCGCCAC CGTGGTCTGC
CCCGCCTGGA TGACCGCGTA CATCCAGGAC CAGGCCAAGA ACGCGGCGGG CAAGTGGGAC
ATCGCCGCGA TCCCCGGCGG CGCGGGCAAC TCCGGCGGCA GCCACCTGAC CGTGCCCAAG
CAGAGCAAGC ACCCCAAGGA GGCGGCCGAG CTGGTCGACT TCCTCACCTC CGCGGAGAGC
CAGGCCAAGG TCTTCAAGAC CACCGGCAAC TTCCCGTCCA TCCCGTCGCT CTACGACCAG
CCGGACATCC AGAACTTCAC CAAGGACTTC TTCAACGGCG CCCCGGTCGG GAAGATCTAC
TCCGAGGCCG CGAAGAAGCT CCAGCCGCAG CACCTCGGCC CGCGTGAGGG CGACGTGCGC
ACCGCGATCG GCAACGGCCT CGGCCGCGTC GAGCAGGGCA AGCAGACGCC CGAGGAGGCC
TGGGCCCAGG TCCTCAAGGA CGTCGAGAAG ATCAAGTAA
 
Protein sequence
MGTRHVGRRI GRLAAPMLAA TLIATAAACG GDGSTQTGSG GQEKIKLTVG LFGDFGFEPL 
YEEFKKTHPN IEIEERQASF ADHHTNLAAH LATGAGAADV EAIEVGYISQ FTAQPDKFHD
LREYGLDKRQ GEYLPWKWQQ GVAPNGALIG LGTDVGGLAM CYRTDLFEKA GLPTDRDEVS
ALWPTWEDFI ETGKKFMKSA PKGTAFIDSP GEILRAIIAQ APVGVYDQND NIVVATNPDV
KRAWDLSVQM IQAGLSAKIA AFTPEWNTGF SKGTFATVVC PAWMTAYIQD QAKNAAGKWD
IAAIPGGAGN SGGSHLTVPK QSKHPKEAAE LVDFLTSAES QAKVFKTTGN FPSIPSLYDQ
PDIQNFTKDF FNGAPVGKIY SEAAKKLQPQ HLGPREGDVR TAIGNGLGRV EQGKQTPEEA
WAQVLKDVEK IK