Gene Tbis_1010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTbis_1010 
Symbol 
ID9167497 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermobispora bispora DSM 43833 
KingdomBacteria 
Replicon accessionNC_014165 
Strand
Start bp1138102 
End bp1139361 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content66% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003651626 
Protein GI296268994 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.227353 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAGG ACTTCGTCAA CGACCCAGCG TTCCTCCGCG GCATGACCAC CCGCCGGATC 
GGCCGCCGGG ACGCGTTCCG GCTGGCCGGG CTCTCCGCCG CCGGCCTCGC CCTCGCCGCC
TGTGGCGTGC AGGGCAAGGG CTCGCCGCGG CCGACCACCT CCGCGCAGGT CCAGTCGGAG
GTGGAGAAGT ACTGGTCGGG CAAGGTCAAG AACGGCCACG TCAACTTCGC GAACTGGCCG
CTCTACATGG ACCCCAAGCG GCCCGAGCTG AAGAAGTTCA CCGAGCGGAC CGGCATCACG
GTGACCTACA AGGAGGTCAT CCAGGACAAC CCGAGCTGGT TCGCCAAGAT CCAGCCGCTG
CTCGCCGCCG GGCAGTCGAT CGACTACGAC CTGATGGTCG TCACCAACGG GGTCCACTTC
ACCCAGCTCG TGCGGCTCGG CTACCTGGTC CCGCTCGACC ACTCCAAGCT CCCGAACTTC
GCGGCGAACG CGGCGGAGCG GTACAAGAAC GAGTCCTTCG ACCCGGGGAA CGTCTACAGC
ATCCCGTGGG CGTCCGGCAT GACCGGCATC GCCTACAACC CGAAGTACGT CGACACCCCG
CCGACGAAGA TCGCCGACCT GTGGAACCCC AAGTACAAGG GGAAGGTCGG CATGATGGCC
GACGCCCAGG AGATCGCCAA CTTCGGCCTG CTGCTGCTCG GCATCAAGCC CGAGACGTCG
ACCCCGGACG ACTGGGAGAA GGCGGCGGAG AAGCTCCGGG AGCAGCGGGA CTCCGGCATC
GTCCGGAAGT ACTACGACCA GTCGTACATC GACCCGCTCG CCAAGGGCGA CATCTGGCTC
ACCATGGCGT GGTCGGGCGA CGTCTTCCAG AAGAACATCT CCGACGGCAC GGACCTGCGG
TTCGTCATCC CCGAGGAGGG GGCGACGATC TGGACCGACA ACATGGTGAT CCCGAAGACC
GCGGAGAACC CGGTCGACGC CATCATGTTG ATGGACTTCT TCTACGAGGT GGAGATCGCG
GCCAGCCTCG CGGAGTACAT CAACTACGTC ACCCCGGTGC CCGCCGCCCA GGAGGTCGTC
CGGAAGCACG CCGCCGAGGC GACCGGTGAG GACAAGCGGC TCCTCGAGCA GCTGGCCGAG
AGCCCGCTGG TGTTCCCGTC CGAGGAGGAC TACGCGAAGC TGCACGACTA CCGCAACTTC
ACCAGCACCG AGGAGCAGCA GAAGTTCGAG CACATCTTCC AGGCGATCAC CACATCATGA
 
Protein sequence
MNKDFVNDPA FLRGMTTRRI GRRDAFRLAG LSAAGLALAA CGVQGKGSPR PTTSAQVQSE 
VEKYWSGKVK NGHVNFANWP LYMDPKRPEL KKFTERTGIT VTYKEVIQDN PSWFAKIQPL
LAAGQSIDYD LMVVTNGVHF TQLVRLGYLV PLDHSKLPNF AANAAERYKN ESFDPGNVYS
IPWASGMTGI AYNPKYVDTP PTKIADLWNP KYKGKVGMMA DAQEIANFGL LLLGIKPETS
TPDDWEKAAE KLREQRDSGI VRKYYDQSYI DPLAKGDIWL TMAWSGDVFQ KNISDGTDLR
FVIPEEGATI WTDNMVIPKT AENPVDAIML MDFFYEVEIA ASLAEYINYV TPVPAAQEVV
RKHAAEATGE DKRLLEQLAE SPLVFPSEED YAKLHDYRNF TSTEEQQKFE HIFQAITTS