Gene Strop_2957 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_2957 
Symbol 
ID5059421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp3379649 
End bp3382549 
Gene Length2901 bp 
Protein Length966 aa 
Translation table11 
GC content69% 
IMG OID640475208 
Productglycoside hydrolase family protein 
Protein accessionYP_001159773 
Protein GI145595476 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACCCA TGAACCGACG CCGCCGCTGG GCGACAGTCA CCGCGGCGGT GCTGGCGGTA 
GCCGGCGGGG CGACCTTACC CGCGCAGGCC GCGTCGGCCG CGCCGGCCTG CGACGTGGTC
TACACGACCA ACGACTGGGG CAACGGCTTC ACTGCGAACA TCACCCTCAC CAATCTCGGT
GACCCGATCA CGGGCTGGAC CCTGCGGTTT GCCTTCGCCG GCAACCAGAC CATCACCCAC
GGCTGGTCGG CGAGCTGGAG CCAGAGCGGC AACGACGTCA CCGCCACCAA CGAGTCGTGG
AACGGCAACC TCGGCACCGG CGGCACCGCG CAGATCGGCT TCAACGCCGC GTACAGCGGT
ACCAACGCCG ACCCGACCTC CTTCTCCGTC AACGGCGTCC CCTGCGGTGG TGTCCAGCAG
CCGCCGACGG TCTCGCTGAA CGTGCCGGCC GGGCCCTTCG AGGCGCCGGC CGACGTGCCG
CTGACGGCCA CCGCCAGCGA CCCGGACGGG AGTATCAGCA GGGTCGACTT CTATCGCAAC
GGCCTGCTGG TCAACACCGA CACCACCGCT CCGTACGCGT ACACTCTGCC GGGCCTACCG
GCCGGCTCCT ACATCGTGCA GGCCAAGGCG TACGACGACA CCGGGCTCAG CGCCATCGCG
GAGCAGTTCT TCACCGTCGC GCCCGCCTCC GGCCCGCAGC TGGTCGCCAC CCCCGCCACG
GTGGGCGTAC CGGAGGGTGG CAGCGCCACC GTGACGCTGA CGTTGAGTGC CGCACCGGCT
GCGGACGTCC TGGTGAGCCT CGGCCGGACC GGTGACCCCG ACCTCACCGT CGCCCCCACC
TCGCTGACGC TGACCACCGG GAACTGGAAC ACCGGTGTCG ACGTGACCGT GTCGGCGGCC
GAGGATGCCG ACACCGCCGG GGGCAGCGCG ACGATCACCG CCTCCGCCGC CGGTCTCGCC
GCCTTGACGA TCACCGCGAC GGAGATTGAC AACGACACTC CCGGTGGCGA CAACGAGTAC
GTCGCGCGGT TCCTCACCCA GTACGGAAAG ATCAAGAATT CCGGGTACTT CAGCTCCGAG
GGGGTGCCGT ACCACTCCAT CGAGACCCTG ATCGTCGAGG CGCCGGACCA CGGCCACGAG
ACCACGAGTG AGGCGTTCAG CTTCTGGCTC TGGCTGGAGG CGCAGTACGG CCGGGTGACC
GAGGACTGGG CACCGTTCAC CAACGCCTGG ACGGTGCTGG AGAACTACAT CATCCCCTCC
TCCGCCGACC AACCCACCGC GGGTGCTTCC GGCACCGCGC AGTACGCCGC CGAGTACGAC
CTGCCCAGCC AGTACCCGGC GCAACTGCAA CCGAGCGTCC CGGTCGGCCA GGACCCGTTG
CGGGGTGAAC TCCAGTCCAC CTACGGCACC GGTGACATCT ACGGCATGCA CTGGCTGCTC
GACGTGGACA ACACCTACGG CTTCGGTCGG TGCGGCGACG GCACCACCCG GCCGGCGTAC
ATCAACACCT TCCAGCGCGG GCAGCAGGAG TCGGTCTGGG AGACCGTCCC GCAGCCGTCC
TGCGAGACCT TCACCCACGG CGGGCAGTAC GGCTTCCTGG ACATCTCCGT CCAGGAGCAG
AACGCGCCGG CTCAGCAGTG GAAGTACACC AACGCGCCGG ACGCCGACGC CCGGGCGGTG
CAGGCCGCGT ACTGGGCGCT GACCTGGGCC AAACAGCAGG GAAGGGCCGC GGAGGTGGCG
GCCACCGTGG CCAAGGCCGC CAAGTTGGGC GACTACCTGC GGTACGCGAT GTTCGACAAG
TACTTCAAGC AGATCGGCAA CTGTGTCGGG GCGTCCACCT GCCCTGCCGG CAGTGGCCGG
GAGTCCGCGC ACTACCTGCT GTCCTGGTAC TACGCCTGGG GCGGCGCGTA CGAGTCGGGT
CAGAACTGGT CGTGGCGGAT CGGCTCCAGC CACAACCACT TCGGCTACCA GAACCCCTTC
GCGGCCTGGG CGTTGACCAC CGTGCCGGAA CTCGAGCCGC GGTCGCCGAG CGCGACCACC
GACTGGGCCC GGAGTCTGGA ACGGCAGCTG GAGTTGTATA CCTGGCTTCA GTCCGCCGAG
GGCGCGATCG CCGGTGGCGC GACCAACAGC TGGGGCGGCC GGTACGCGCA ACCGCCCGCG
GGCACGCCGA CCTTCTACGG CATGTTCTAC GACGAGAAGC CCGTCTACCA CGACCCGCCG
TCGAACCAGT GGTTCGGCAT GCAGGTCTGG TCGATGCACC GGATCGCCGA GTTGTACCTC
GAGACCGGTG ACGCCCGGGC CGAGGCGCTG CTGGACAGGT GGGTGCCGTG GGCGATCGCC
AACACCAGGC TGGGCGCCGA CTGGTCGATA CCGGCCGAAC TCACCTGGAC GGGCCAGCCG
AACACGTGGA ACCCGACCAA CCCGGAGCCG AACACCGACC TGCACGTCGA GGTGACCGAG
ACCGGCCAGG ACGTCGGCGC CGCCGCGGCC TATGCCCGGA CCTTGATCGC ATACGCGGCG
AGGTCGGGGA ACGTGACCGC GAAGACCACC GCCAAGGGGC TGTTGGACGC GTTGCACGCC
GCCAGCGATG CCCTGGGTGT GTCGACGGTG GAGAAGCGGG GCGACTACGA GCGCTTCGAC
GATGTCTACG ACGCCAGCAC CGGGCAGGGC CTCTACCTGC CGCCGGGCTG GACGGGCACG
ATGCCCAACG GCGACGTGAT CGAGGCGGGC CGGAGCTTCG TCGAGATCCG GTCGTTCTAC
CTCAACGATC CGGACTGGCC GAAGGTCCAG GCGTATCTGG ATGGCGGCGC CGAGCCGACG
TTCCGCTACC ACCGGTTCTG GGCTCAGGCT GATGTCGCGA TGGCGTACGC GGACTTCGGA
CGGCTCTTCC CGAACGATTG A
 
Protein sequence
MRPMNRRRRW ATVTAAVLAV AGGATLPAQA ASAAPACDVV YTTNDWGNGF TANITLTNLG 
DPITGWTLRF AFAGNQTITH GWSASWSQSG NDVTATNESW NGNLGTGGTA QIGFNAAYSG
TNADPTSFSV NGVPCGGVQQ PPTVSLNVPA GPFEAPADVP LTATASDPDG SISRVDFYRN
GLLVNTDTTA PYAYTLPGLP AGSYIVQAKA YDDTGLSAIA EQFFTVAPAS GPQLVATPAT
VGVPEGGSAT VTLTLSAAPA ADVLVSLGRT GDPDLTVAPT SLTLTTGNWN TGVDVTVSAA
EDADTAGGSA TITASAAGLA ALTITATEID NDTPGGDNEY VARFLTQYGK IKNSGYFSSE
GVPYHSIETL IVEAPDHGHE TTSEAFSFWL WLEAQYGRVT EDWAPFTNAW TVLENYIIPS
SADQPTAGAS GTAQYAAEYD LPSQYPAQLQ PSVPVGQDPL RGELQSTYGT GDIYGMHWLL
DVDNTYGFGR CGDGTTRPAY INTFQRGQQE SVWETVPQPS CETFTHGGQY GFLDISVQEQ
NAPAQQWKYT NAPDADARAV QAAYWALTWA KQQGRAAEVA ATVAKAAKLG DYLRYAMFDK
YFKQIGNCVG ASTCPAGSGR ESAHYLLSWY YAWGGAYESG QNWSWRIGSS HNHFGYQNPF
AAWALTTVPE LEPRSPSATT DWARSLERQL ELYTWLQSAE GAIAGGATNS WGGRYAQPPA
GTPTFYGMFY DEKPVYHDPP SNQWFGMQVW SMHRIAELYL ETGDARAEAL LDRWVPWAIA
NTRLGADWSI PAELTWTGQP NTWNPTNPEP NTDLHVEVTE TGQDVGAAAA YARTLIAYAA
RSGNVTAKTT AKGLLDALHA ASDALGVSTV EKRGDYERFD DVYDASTGQG LYLPPGWTGT
MPNGDVIEAG RSFVEIRSFY LNDPDWPKVQ AYLDGGAEPT FRYHRFWAQA DVAMAYADFG
RLFPND