Gene Strop_3061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_3061 
Symbol 
ID5059525 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp3510166 
End bp3513006 
Gene Length2841 bp 
Protein Length946 aa 
Translation table11 
GC content70% 
IMG OID640475311 
Productalpha-1,6-glucosidase, pullulanase-type 
Protein accessionYP_001159876 
Protein GI145595579 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1523] Type II secretory pathway, pullulanase PulA and related glycosidases 
TIGRFAM ID[TIGR02103] alpha-1,6-glucosidases, pullulanase-type 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.57883 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.116697 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCCC CGCCGATCCC GCGCAAGGCG CTGCTCACTC TCGCCTCGCT GGCGACCCTC 
ACCCTCGCCG GCACCAACAC GTTCGTCCAC ATCGACCCAT CCGCCGGCAC CGACGTGCTC
GCCATGGCCG GTACCGAGCA GTGGGCTGTC GACCACTCCG CCACGCTGCT CACCGCCGGC
GAAGACCAAT CGGAGCCCGG CACCACCACG GACCTCGACC TGACCAGGCA GAAGGCACAC
TGGATCGATC GCGGCACCCT CGCCTGGCCG ACCGGACCAC CGGGCGGCAC GACGTACGCG
CTGGTCGCCG CCGCTTCCGG TGGCGTGACC ATTGTCGACG GTGAACTCGC CGGGACGTAC
CGGACAGTTC CGCTACGTGT CCGGCACCAC GGGCTGACCA AGGCACAACG CGAGCGGTTT
CCGCACCTGG CGGCGTATCA GGCTCTCGTC CTGGACCGGA AGCACCTGGC CGGGGTGCCG
GCGGCGCTGC GCGGGCAGCT TGTGGTGACC GAGCGGGACA CCGACGGCAC CCTGCGCGTC
GCCACCGGTG TGCAGGTCCC GGGTGTGCTC GATGACATCT ACGCCCCGGC GACTCGGGCG
CGACTCGGTC CAACCTTCGC CGGTGCGGTG CCGACGCTCG CGGTCTGGGC TCCCACCGCC
CGTACGGTGA CGCTGGAGTT GTTCGACACT CCGACGGCGC GACCGCGGCC GGTCACCATG
AGTCGGGATG ACCGCACCGG TGTGTGGTCG GTGCGGGGAA CCCCCTCCTG GACCGGGAAG
TACTACCGCT ACCGGGTCGA GGCGTGGCAG CCGGCGGAGC AGCAGATGGT AACTGCGGCG
GTGACCGACC CGTACTCGCT GGCGCTCGCA CCCGACTCCA CCCACAGCCA GATCGTCGAT
CTCGCCGACC CGCGGCTCGC ACCACCGGGC TGGACGAACC TGCGTAAGCC GGCGGCGGTA
CCGGCGACGA AGGCGCAGAT CTCGGAGCTG TCGGTGCGGG ACTTCTCGAT CGCTGACAGT
ACGGTCCCGG CCGAGCGACG CGGCACGTTC CGTGCCTTTA CCGACCCGGT CACCGCGGGA
ATGCGACACC TGAAGGCGCT CGGCGACGCC GGGACGACCC ACCTGCACCT GCTCCCGGCG
TTTGACTTCG CCACGATCCC CGAACGCCGC GCCGACCAGC AGCAACCCCC CTGCGACCTG
GCAGCGCTCC CGCCGGACTC GGACGAACAA CAACGCTGCG TCGCCGCGGT GGCGGACACC
GACGGCTACA ACTGGGGGTA CGACCCGCTG CACTACACCG TGCCGGAGGG CGGCTACGCG
GTCAACCCGG CCGGGGCGGC GCGGACCACC GAGTTCCGAC GGATGGTCGC CGGGGTGAAC
GGCGCCGGGC TGCGGGTGGT GCTCGACGTC GTCTACAACC ACACCTCGGC GACGGGCACC
GACCCGAAGT CGGTCCTGGA CCAGGTGGTT CCCGGCTACT ACCACCGGCT GCTGGCGGAT
GGCTCGGTCG CCACCTCGAC CTGCTGCGCC AACACCGCTC CCGAGCACGC CATGATGGGC
AAGCTCGTGG TCGACTCGGT GGTCACCTGG GCGAAAGAAT ACAAGGTGGA TGGCTTCCGG
TTCGACCTGA TGGGTCACCA CCCGAAGGCG AACATGCTGG CCGTCCGCGC GGCTCTGGAC
GAGCTGACCG TCGCCCGCGA CGGGGTGGAC GGTCGGGGCA TCCTGCTGTA CGGCGAGGGC
TGGAACTTCG GCGAGGTGGC CGACGGCGCG CGGTTCGTTC AGGCAACCCA GGCCAACATG
GCCGGCACGG GTATCGGCAC CTTCAACGAT CGGCTCCGGG ACGCGGTGCG TGGGGGCGGT
CCCTTCGACG CCAATCCGCG GCAGCAGGGC TTCGCCTCCG GGCTGTTCAC CGACCCCAAC
AACGACCCGG TCAACGGTTC GACGGCGCAG CAGCGCGCCC GACTGCTGCA CCAGCATGAC
CTGATCAAGG TGGGGCTCAG CGGCAATCTG CGCGGCTACC GGTTCACCAA CACCGCGGGT
GAGCAGGTCA CCGGGGCGCA GGTGGACTAC AACGGATCCC CGGCCGGCTA CACGGCCGCG
CCGGGTGAGT CGGTCACCTA CGTGGACGCG CACGACAACG AGATCCTGTA CGACGCGCTG
GCGTACAAGC TGCCGGCGGA CACCACGGCG GTGGACCGGG CCCGGATGCA GGTGCTCGCG
CTCGGCACCG CCGTGCTGGG GCAGGGCACC GGCTTCGTCA CCGCCGGCAC CGAGCGGCTG
CGGTCGAAGT CGCTGGACCG CAACTCGTAC AACTCGGGTG ACTGGTTCAA CCAGATCCGC
TGGGAATGCG CACAGGGCAA CGGGTTCGGC GTCGGGCTTC CGCCCGAGTC GGACAACAAG
GACAAGTGGC CGTACGCCCG GCCCCTGCTG GCCGACCCGG GGCTGGTGCC CGGCTGCGCG
ACGATGGACC TGGCCGAAGC TCGTTTTGCC GAACTGCTCC GGATTCGCTC CTCGTCGCCG
GTGTTCGGGT TGCGGACAGC AGAGCAGGTG CAGCGGCGGG TGGCCTTTCC GTTGTCCGGC
CGTACGGAGC AGCCCGGTGT GCTGACGATG ACGCTGGACA GCCGTGGGCT GGGTGGTCCG
TGGAAGTCAG TGACGGTGAT CTTCAACGCC ACCCCGCAGC CGGCCACCCA ACAGCTCATT
GACCTACGCG GGGCGGACGT GACACTGCAC CCGGTGCTGC GCACCTCCGC CGACGAGCTG
CTGCGGACGT CCTCGTTCGC GGCCGACAGC GGCACCTTCA CGGTTCCGGC TCGCAGCCTG
ACGGTCTTCG TGCAGCGGTA G
 
Protein sequence
MKPPPIPRKA LLTLASLATL TLAGTNTFVH IDPSAGTDVL AMAGTEQWAV DHSATLLTAG 
EDQSEPGTTT DLDLTRQKAH WIDRGTLAWP TGPPGGTTYA LVAAASGGVT IVDGELAGTY
RTVPLRVRHH GLTKAQRERF PHLAAYQALV LDRKHLAGVP AALRGQLVVT ERDTDGTLRV
ATGVQVPGVL DDIYAPATRA RLGPTFAGAV PTLAVWAPTA RTVTLELFDT PTARPRPVTM
SRDDRTGVWS VRGTPSWTGK YYRYRVEAWQ PAEQQMVTAA VTDPYSLALA PDSTHSQIVD
LADPRLAPPG WTNLRKPAAV PATKAQISEL SVRDFSIADS TVPAERRGTF RAFTDPVTAG
MRHLKALGDA GTTHLHLLPA FDFATIPERR ADQQQPPCDL AALPPDSDEQ QRCVAAVADT
DGYNWGYDPL HYTVPEGGYA VNPAGAARTT EFRRMVAGVN GAGLRVVLDV VYNHTSATGT
DPKSVLDQVV PGYYHRLLAD GSVATSTCCA NTAPEHAMMG KLVVDSVVTW AKEYKVDGFR
FDLMGHHPKA NMLAVRAALD ELTVARDGVD GRGILLYGEG WNFGEVADGA RFVQATQANM
AGTGIGTFND RLRDAVRGGG PFDANPRQQG FASGLFTDPN NDPVNGSTAQ QRARLLHQHD
LIKVGLSGNL RGYRFTNTAG EQVTGAQVDY NGSPAGYTAA PGESVTYVDA HDNEILYDAL
AYKLPADTTA VDRARMQVLA LGTAVLGQGT GFVTAGTERL RSKSLDRNSY NSGDWFNQIR
WECAQGNGFG VGLPPESDNK DKWPYARPLL ADPGLVPGCA TMDLAEARFA ELLRIRSSSP
VFGLRTAEQV QRRVAFPLSG RTEQPGVLTM TLDSRGLGGP WKSVTVIFNA TPQPATQQLI
DLRGADVTLH PVLRTSADEL LRTSSFAADS GTFTVPARSL TVFVQR