Gene Tbis_2402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTbis_2402 
Symbol 
ID9168906 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermobispora bispora DSM 43833 
KingdomBacteria 
Replicon accessionNC_014165 
Strand
Start bp2800341 
End bp2802041 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content68% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003652999 
Protein GI296270367 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.951994 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGAGCAC GATGGCCGGT CGCCCTGACC GTACTGGTCT TTCTGTGGGC CGTCGGCGCA 
TGCGGCGCCG CGCCGCCCTC GGGCGGACGT GACGGCACGC CGCTGCCCTC TCCCGTCAAA
GCTCTCGACA TCAACCAGGT CGCCCGGGAC AAGGTGAAGA ACGGCGGCAC GCTGCGCTGG
GGGCTGAGCG ACTTCCCCAC GCAGTGGAAC TACAACCACG CCGACGGCTC CCTGGCGAAC
GTCAAGGTCG TCATCTCCGC GCTGCTGCCG CGGGTCTTCC GGTCCGACGA GCGGGGACGC
CTCTCCCTCG ACACCGACTA CGTCACCAAC GCGCGGATCA CGGCGACCTC GCCGAACCAG
GTGATCACCT ACACCATCAA CCCGAAGGCC CGGTGGTCCG ACGGCAAGCC GATCACCTGG
GAGGACTTCG CCGCCCAGTG GAAGGCCATG AGCGGCCGGG ACGGCGGCTA CCGGGCCGAC
TCGTCCATCG CGTACGAGAA CATCAAGAGC GTGGCGCGCG GGTCGAGCGA CCGCGAGGTG
GTCGTCACCC TCGCCGAGCC GTTCAACGAG TGGCAGTCGC TCTTCACCCC GCTCTACCCG
CGCTCGACCA ACGCGTCGCC GGACGAGTTC AACTCCGGCT GGATCAACCG GATCCCGGTC
ACCGCCGGGC CGTTCCAGGT GGAGAAGTTC GACGCCAAGG GCAAGACGAT CACGCTCGCC
CGGTCGCCGC AGTGGTGGGG CAACCCGGCG AAGCTCGACC GGATCGAGTT CCGGCACGTC
CAGCCGACCA CCATGCTGCG GGCCTTCACC AAGGGCGAGA TCGACGTGTT CGACATCGGC
CCGTCCCCGG AGAACTACGC CGCGGTGCGG GAGGTCTGGG ACGCGGTGGT CCGGCAGGCC
GCGGGCCCCG AGTACCGCCA GCTCACCTTC AACGGGGAGA GCGAGGTGCT CTCCGATCTC
CGCGTGCGGC AGGCGATCGC GCTCGCCATC GACCGCAAGG CGATCATGGA GATCGACCTC
AAGGGGCTCG GCTGGCCGAT CGTCACCCTC GACCACCACT TCCTCATGAA CAGCCAGTAC
GGGTACCGGA GCAACGCCGG CGCCCACGGC GCCTACGACC CCAAGCGCGC CGCCCGGCTG
CTCGACGAGG CCGGCTGGAA GCTGTCCGGG AAGGTGAGGT CGAAGAACGG CAAGCCGCTC
CGGCTGCGGT TCGTGGTTCC CGCCGGGGTG CGGGTGACCG AGACCCAGGC CCAGGTGGTG
CGCCTCATGC TCCAGAAGAT CGGCGTGCAG GTGGACGTGG CGCGGGTCCG CTTCCAGGAC
TTCTTCACCA AGCACCTGCT GCCCGGCAAG TTCGACATCA CCGCCTTCTC CTACCCGAGC
TCGCCGTTCC CGATCTCCAG CGCCTACGAC ATCTACGCCA ACGGGGAGCC CGGCCGGGGC
GACGAGGTGA AGTGGTACTC CAACCTGGGG CGAAGCGGGA GCAGCGAGAT CGACCAGGCG
ATGTACGACG CGGGGAGCAC GCTCGACCAG AAGGAGGTCA TCGAGCTGAT CCACGCCGCC
GACGCCCTGA TCTGGGAGAA GGTCAACGTG CTGCCCCTCT ACCAGGTCCC GCAGAACGTC
GCGGTCCGGT CCACGCTCGC CAACGTGGGC GCCAACGGCT TCTACGACCT GCGGTACGAG
GACATCGGGT ACGTGTCGTG A
 
Protein sequence
MRARWPVALT VLVFLWAVGA CGAAPPSGGR DGTPLPSPVK ALDINQVARD KVKNGGTLRW 
GLSDFPTQWN YNHADGSLAN VKVVISALLP RVFRSDERGR LSLDTDYVTN ARITATSPNQ
VITYTINPKA RWSDGKPITW EDFAAQWKAM SGRDGGYRAD SSIAYENIKS VARGSSDREV
VVTLAEPFNE WQSLFTPLYP RSTNASPDEF NSGWINRIPV TAGPFQVEKF DAKGKTITLA
RSPQWWGNPA KLDRIEFRHV QPTTMLRAFT KGEIDVFDIG PSPENYAAVR EVWDAVVRQA
AGPEYRQLTF NGESEVLSDL RVRQAIALAI DRKAIMEIDL KGLGWPIVTL DHHFLMNSQY
GYRSNAGAHG AYDPKRAARL LDEAGWKLSG KVRSKNGKPL RLRFVVPAGV RVTETQAQVV
RLMLQKIGVQ VDVARVRFQD FFTKHLLPGK FDITAFSYPS SPFPISSAYD IYANGEPGRG
DEVKWYSNLG RSGSSEIDQA MYDAGSTLDQ KEVIELIHAA DALIWEKVNV LPLYQVPQNV
AVRSTLANVG ANGFYDLRYE DIGYVS