Gene Tbis_3512 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTbis_3512 
Symbol 
ID9170044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermobispora bispora DSM 43833 
KingdomBacteria 
Replicon accessionNC_014165 
Strand
Start bp4099650 
End bp4101392 
Gene Length1743 bp 
Protein Length580 aa 
Translation table11 
GC content68% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003654093 
Protein GI296271461 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.290938 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGAGCC GAGTGCGGTT GCTGATACCG CTCCTCGCGG TCGGCCTCCT GACGGCGTGC 
ACGGGCGGGG CCGAGACGCC CGCGCGATCC CCGGCACCGG CGGCCGCCGC GGTGCCGGGC
GGAGAATCGC TCCCCCGGCA CGAGACCCTC TACACCAGCG GCACGCAGTG GGGGCCACCG
GCGAACTGGA ACCCGCTCCG GGAATGGGAC TTCGCCACCG GGACGAAGGG CCTCGTCTAC
GAGACCCTCT TCCTCTACGA CCCGAGCATC GACCGGCTCA TCCCGTGGCT CGCCGAGAGC
GGCTCCTGGA CCGGGGAGAA GGAGTACACC CTCAAGCTCC GGAAGGGCAT CACCTGGGCG
GACGGCGAGC CGTTCACCGC CGAGGACGTG GTCTTCACCT TCGAGCTCGG CAAGCTGGAG
ACCGTCCCCT ACCACCAGCT CTGGGAGTGG CTGGCGCGGG CCGAGGCGGT GGACCAGCAC
ACGGTCAGGT TCACCTTCAC TGAGGCCAAC CACCAGGAGT GGTCGACCCA CCTCTACAGC
CGGGCGATCG TGCCCAAGCA CCTGTGGGAG GTCCGGTCCG AGGAGGAGGT GATGAACGGC
GCCAACGAGA ACCCCATCGG CACCGGGCCG TACGCCTACC ACTCGCACGA CCAGGACCGC
ATGGTCTGGG TGCGCCGGGA CGGCTGGTGG GCGACCAAGG TGATCGGCAA GCGGGTCGCG
CCCAAGTACA TCGTGGACAT CGTCAACTCG AGCAACGAGG TGGCGATGGA CTGGCTGCTC
CAGAAGCACC TCGACCTGAG CAACAACTTC CTCCCGGGCG TCGCCAACCT GGTCACCGGT
GACTTCGGCC TCCAGACCTA CTACAACCGG CCGCCGTACA TGCTCGCCGC GAACACGGCC
TGGCTGGTGA TGAACACCAA GAAGAAGCCG ATGGACGACC CGGTGTTCCG GAGGGCGCTC
GCCCACGCCA TCGACACCAG GAAGATCGTC GAGGGCGTGT ACCAGAACCT GGTGCAGGCG
GCGAACCCGA CCGGGCTCCT CCCGCAGTGG AGCAAGTACA TCGACCAGGA CGTGGTGAAC
CGGCTCGGCT TCTTCTACAG CCCGGCCAAG GCGAAGGAGC TGCTCATCGA CGCCGGCTAC
CGGGACCGGG ACGGGGACGG CTTCATGGAG TCGCCCAGCG GGGCGAAGAT CGCGCTCAAG
ATCGCCGTGC CGGCCGGGTG GACCGACTGG ATGGAGGCCG CCCGGGTGAT CAGCGAGGGC
GCCAAGGGGG CCGGGATCAA CCTCGAGCCG GAGTTCCCCG ACTACAACGC GCTCGTCGAC
GCCCGCAACT CCGGCAAGTT CGACATGGTC CTCAACAACG ACCGCCAGCT CGCCAGCACC
CCGTGGCGGT ACTACGACTT CATCTTCCGC CTGCCGGTGC GCAAGCAGCA GACCACGGCG
AACTTCGGCC GGTACGAGAA CAAGCAGGCC TGGCGGCTGG TCCGGGAGCT CGACGGCGTC
CGGACCGACG ACGTCGAGGG GATGAAGCGG ATCATCTCCC GGCTCCAGGA GATCCACCTC
CGGGAGATGC CGATCATCCC GCTCTGGTAC AACGGGCTGT GGGCGCAGAT GACCAGCGCG
GTCTGGACGA ACTGGCCGTC CGAGGCGATG GGAGCCCCCA AGCACGCTCC GAGCATGTGG
CGGGACTGGA TGGAGATGGG CGGCCTCCTC ATGCTGACCG AGCTCCGGCC GGCGGCGGGC
TGA
 
Protein sequence
MRSRVRLLIP LLAVGLLTAC TGGAETPARS PAPAAAAVPG GESLPRHETL YTSGTQWGPP 
ANWNPLREWD FATGTKGLVY ETLFLYDPSI DRLIPWLAES GSWTGEKEYT LKLRKGITWA
DGEPFTAEDV VFTFELGKLE TVPYHQLWEW LARAEAVDQH TVRFTFTEAN HQEWSTHLYS
RAIVPKHLWE VRSEEEVMNG ANENPIGTGP YAYHSHDQDR MVWVRRDGWW ATKVIGKRVA
PKYIVDIVNS SNEVAMDWLL QKHLDLSNNF LPGVANLVTG DFGLQTYYNR PPYMLAANTA
WLVMNTKKKP MDDPVFRRAL AHAIDTRKIV EGVYQNLVQA ANPTGLLPQW SKYIDQDVVN
RLGFFYSPAK AKELLIDAGY RDRDGDGFME SPSGAKIALK IAVPAGWTDW MEAARVISEG
AKGAGINLEP EFPDYNALVD ARNSGKFDMV LNNDRQLAST PWRYYDFIFR LPVRKQQTTA
NFGRYENKQA WRLVRELDGV RTDDVEGMKR IISRLQEIHL REMPIIPLWY NGLWAQMTSA
VWTNWPSEAM GAPKHAPSMW RDWMEMGGLL MLTELRPAAG