Gene Strop_2444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_2444 
Symbol 
ID5058907 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp2746567 
End bp2750382 
Gene Length3816 bp 
Protein Length1271 aa 
Translation table11 
GC content69% 
IMG OID640474703 
Productglycoside hydrolase family 3 protein 
Protein accessionYP_001159269 
Protein GI145594972 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.592108 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.597577 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCATCCCG AACCCGAACG TCGGCGGGGC GGCTTCCGTA CCCGGACCGC CGCCCTGCTC 
GCCCTCACCC TTACCGCATC CCTTCTGGTC CTGCCCGGGG CCGCGCAGGC CCTGCCGGAC
CCGGGCACGG CCACCGACAC CGCCGTCGTG CCGCCGTCGG CCGGTGCCGT GCAGACCGTC
GAGGACTACG AGGACGGCGT CCCACCCGAG GTTCTGCTCT TCGCCTCCAG CGAACCCGAG
CGGCCCGAGG TGGGCACCGT CGCAAGCGAC GACCGACCCG GCGCGGGCGG AGACAACGAT
GCCTTCTCCG TCCGGTACGA CATCGACGGC TGGGGTGGCT TCACCCACAA CTTCATCGCC
CCGGACGGCC ACCAGGACTG GCGGGCGTAC GACGGTTTCT CGTTCTGGGT GAAGGGGGAC
GGCAGCGGCC GGCAGGTCCA GTTCGAGATC AAGGACGGCG GCGAGCACGG TGAGGCGTCG
GAGCTGTGGG AGTCGTTCTT CATCGACGAC TCCACGGGCT GGAAACAGAT CCGGACGGTC
TTCCCCGACT TCGTCTGGCG CACCAGCTAC CAGCCGGCGG ACGGTCCCAA GGACAAGGAG
CTACAGCTCG ACCGGATGTG GGGGTTCGCG GTCAACCTGC CGCAGGGCAC CGGTGAACTG
CACTTCGACC AGGTGGAGCT TTTCACCAAC GTGGCGACGG TCGCCGACTT CGAGGCCCCG
CAGCCCCAGC TCAACCCGCC GGCCGGGCAG CCCGGCGTCA TCACCTTCAG CGGCGACGAC
TCCCGCATCC CGGAGCTGAG CTACGTCGAC GCCTCCCGCG ACGGCACGCC CGCCGACAAC
CAGGCCCTGG CCGTGGCCGT CGACACCACC ACCAGCTGGG CCGGCTTCGC GCACAACCTG
AGCTTCGACA CCGAGCCGCA GGACTGGAGC AGCTTCGGCG GGTTCCGGTT CTGGTACTTC
AGCTCGCTGA CCGTGCCACC GGCCGCGCCC GGCGCCGGCC GCCGAATCGA CGTGGAGATC
AAGGACGGCG GGACGGACGT CGACCACAGC GAACTCTGGA TCACCAGCTT CACCGAGGAC
TGGGTCGGCT GGCACTTGGT CGAGCTTCCG TTCTCGCAGT TCACGTACCG CACCGACTAT
CAACCCATCG GCGGCATCAA CCAGGAACTG GACCTCGACC AGATGTGGGG CTACGCGATG
CAGCCCCGCT CCGGATACGC CGACACCTTC CGCATCGACG ACGTTGAGGT CTACGGCGTT
CCGCAGGTGG GGCCGACGGT ACGGGTGGAC GCCGACCCGG CGGTCAGCCT GCTCGACGAG
GGCGAGTCGA CCACCGTCAC GGTTCGGCTC ACCAACACCG ATGACGCACC GCTGGACAAC
GAGGTGACCC TGCGATACGC CACCGGCGCC GGCTCGGCCA CCGCCGGCGA GGACTACGAG
CCGGTCGAGG GAGAGTTCGT CTTCCCCGCC GGCACCGCCT CCGGAACCAC TCGGCAGATC
ACCGTTCAGA CCCTCTCCGA CGAGCAGGCG GAAACAGCGG AGACCATCCC GCTGACGCTG
ACCGGCAGCG GCCTCGCCGT CCCCGAGGAC CCACCGAACA TCGTGCTCAA CGCGCACGGC
CTGCCGTACC TGGACGCCTC CCGGCCGGTT GACGAGCGGG TCGCCGACCT GCTCGGGCGG
ATGTCGGTCG AGGAGAAGGT CGGCCAGATG ACCCAGGCCG AGCGGAACGC CCTCGACTCG
CCGAACGACC TGGCCACCTG GCGACTCGGC TCGCTGCTCT CCGGCGGTGG CTCGACGCCC
ACCCCGAACA CCCCGGAGTC CTGGGCGGAC ATGGTGGACG GCTACCAGAC ACGCACGTTG
CAGACCCGAC TACAGATCCC GCTGCTCTAC GGCGTCGACG CGGTGCACGG CCACAGCAAC
GTCCAGGGCA CGACGATCTT CCCGCACAAC ATCGGGCTCG GCGCGGCTCG TGACCCGGAG
CTGATCGAAC GCGTCGGACA CATCACCGCC GAGGAGACCC GAGCGACCGG ACCACAGTGG
TCGTTCGCAC CCTGCGCCTG CGTGGCCCGT GACGACCGGT GGGGGCGCAC CTACGAGGCG
TACGGGGAGG ACCCGGCGCT GGTGATCGCC AACGAGACGG TGATCGACGG GCTCCAGGGC
CGCGAGCTGG CCAACCGCAA GGACGCCGAC CGCGTGCTCG CCTCGGTGAA GCACTACGCC
GGCGACGGCG GGACCGAGTA CCAGCCGGGC AACGGAGGGT ACCCGATCGA CCAGGGCGTC
GTCGTCATGA GCCGGGAAGA GTTCGACCGG ATCCACCTGG AGCCGTACAT CCCGTCGGTG
CGCGAGCACA ACGCGGGAAC GATCATGCCG TCGTACTCCA GCGTCGACTT CACCGACGAC
GGGGTCGGCA ACCCGGTCAA GATGCACGCC CACAAGGAGC TGCTCACCGA TGTCCTGAAG
CAGGAGATCG GCTTCGACGG CTTCCTGATC AGCGACTACG CCGCGATCGA CCAGATCCCC
GGAGACTACG ACAGCGACGT AGGCATCTCG ATCAACGCCG GGCTAGACAT GATCATGGTG
CCGAACGAGT ACCAGCGCTT CGAGGAGACG CTGCTCGGCG AGATCGAGGC CGGGAACATT
CCGATGTCCC GCATCGATGA CGCGGTCAGC CGAATCCTGA CCCAGAAGTT CCACCTCGGA
CTCTTCGAGC AACCGTTCAC CGACCGCACC CACCTGGCCG ACGTGGGCTC GCCCGAGCAC
CGCGCGGTGG CCCGCGAGGC TGCCGCCAAG TCCCAGGTGC TGCTCCGCAA CACCCACCAG
GTGCTGCCAC TGGCCACCAC CGGCAAGCTC TACGTCGCCG GCGGCAACGC CGACGACATC
GGCGCGCAGT CCGGCGGCTG GACCATCACC TGGCAGGGCG GTAACGGCGA CATCACCCCC
GGCACCAGCA TCCTCGACGG CATCCAGCAG GTCGCACCGG ACGCCGAGGT GACCTACAGC
GCCGACGCCT CCGCCCCGCT GGACGGGCAT GACCGGGCCG TCGTCGTGGT CGGCGAGCAG
CCGTACGCGG AGGGCATGGG CGACGTCGGC AACAACGGCT TCACCATGAC GTTGAGCGAC
GCCGAGAAGG ACACGGTGGC CCGGGTCTGC TCGGCGGTGG ACAACTGCGT GGTGCTGGTG
GTCTCCGGTC GTCCGCTCGT GCTCGACGAT GCACTCGCCC CCGCCGACGC CGTGGTCGCC
TCCTGGCTGC CCGGCACTGA AGGGGCCGGC GTGGCCGACG TGCTCTTCGG CGAGCGGCCG
TTCACCGGTC AGCTGCCGGT GTCCTGGCCG CGTTCGTTGG ACCAGGAGCC GATCAACGTC
GGTGACGCCG ACTACGACCC GCTCTACCCG TACGGCTGGG GTCTGCGCAC CGACCCGACC
CGGGACCGGC TGCACGAGCT GCGGGCCGAA CTGGCCGAGA TCGAGCAGGA CGGCTGGACC
CGGGCCGCGG TGAAGCTGCT GGACCGGTCG CTGCGCGACG GTAGCTCCTG GCATGAGGAT
GGCTCGGTCC GCGACGAGCG ACGAGTGATC ACGAAACTGA CGGTGATCTC CACCCTGCTG
GCCCTCAGCA GCCGGGACAA CGCGGCGCAG CAGGAACTGC TGGTGTCGAC GCTCCGAGAT
GTCGCGCAGG CAGCGATCGT CCGGGAGGGC GTCACCGCCC CCTCGGCGAC GCGAACCTCC
ACGCTGACCG CGGACGCGGA GCACGCGCTC CTGACCGGCA AGCCCATCGC GGCGACGTGG
AAGCTCGCCG CGGCCTGGCG GATCGCGACA GGCTAA
 
Protein sequence
MHPEPERRRG GFRTRTAALL ALTLTASLLV LPGAAQALPD PGTATDTAVV PPSAGAVQTV 
EDYEDGVPPE VLLFASSEPE RPEVGTVASD DRPGAGGDND AFSVRYDIDG WGGFTHNFIA
PDGHQDWRAY DGFSFWVKGD GSGRQVQFEI KDGGEHGEAS ELWESFFIDD STGWKQIRTV
FPDFVWRTSY QPADGPKDKE LQLDRMWGFA VNLPQGTGEL HFDQVELFTN VATVADFEAP
QPQLNPPAGQ PGVITFSGDD SRIPELSYVD ASRDGTPADN QALAVAVDTT TSWAGFAHNL
SFDTEPQDWS SFGGFRFWYF SSLTVPPAAP GAGRRIDVEI KDGGTDVDHS ELWITSFTED
WVGWHLVELP FSQFTYRTDY QPIGGINQEL DLDQMWGYAM QPRSGYADTF RIDDVEVYGV
PQVGPTVRVD ADPAVSLLDE GESTTVTVRL TNTDDAPLDN EVTLRYATGA GSATAGEDYE
PVEGEFVFPA GTASGTTRQI TVQTLSDEQA ETAETIPLTL TGSGLAVPED PPNIVLNAHG
LPYLDASRPV DERVADLLGR MSVEEKVGQM TQAERNALDS PNDLATWRLG SLLSGGGSTP
TPNTPESWAD MVDGYQTRTL QTRLQIPLLY GVDAVHGHSN VQGTTIFPHN IGLGAARDPE
LIERVGHITA EETRATGPQW SFAPCACVAR DDRWGRTYEA YGEDPALVIA NETVIDGLQG
RELANRKDAD RVLASVKHYA GDGGTEYQPG NGGYPIDQGV VVMSREEFDR IHLEPYIPSV
REHNAGTIMP SYSSVDFTDD GVGNPVKMHA HKELLTDVLK QEIGFDGFLI SDYAAIDQIP
GDYDSDVGIS INAGLDMIMV PNEYQRFEET LLGEIEAGNI PMSRIDDAVS RILTQKFHLG
LFEQPFTDRT HLADVGSPEH RAVAREAAAK SQVLLRNTHQ VLPLATTGKL YVAGGNADDI
GAQSGGWTIT WQGGNGDITP GTSILDGIQQ VAPDAEVTYS ADASAPLDGH DRAVVVVGEQ
PYAEGMGDVG NNGFTMTLSD AEKDTVARVC SAVDNCVVLV VSGRPLVLDD ALAPADAVVA
SWLPGTEGAG VADVLFGERP FTGQLPVSWP RSLDQEPINV GDADYDPLYP YGWGLRTDPT
RDRLHELRAE LAEIEQDGWT RAAVKLLDRS LRDGSSWHED GSVRDERRVI TKLTVISTLL
ALSSRDNAAQ QELLVSTLRD VAQAAIVREG VTAPSATRTS TLTADAEHAL LTGKPIAATW
KLAAAWRIAT G