Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Strop_2957 |
Symbol | |
ID | 5059421 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora tropica CNB-440 |
Kingdom | Bacteria |
Replicon accession | NC_009380 |
Strand | - |
Start bp | 3379649 |
End bp | 3382549 |
Gene Length | 2901 bp |
Protein Length | 966 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640475208 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001159773 |
Protein GI | 145595476 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACCCA TGAACCGACG CCGCCGCTGG GCGACAGTCA CCGCGGCGGT GCTGGCGGTA GCCGGCGGGG CGACCTTACC CGCGCAGGCC GCGTCGGCCG CGCCGGCCTG CGACGTGGTC TACACGACCA ACGACTGGGG CAACGGCTTC ACTGCGAACA TCACCCTCAC CAATCTCGGT GACCCGATCA CGGGCTGGAC CCTGCGGTTT GCCTTCGCCG GCAACCAGAC CATCACCCAC GGCTGGTCGG CGAGCTGGAG CCAGAGCGGC AACGACGTCA CCGCCACCAA CGAGTCGTGG AACGGCAACC TCGGCACCGG CGGCACCGCG CAGATCGGCT TCAACGCCGC GTACAGCGGT ACCAACGCCG ACCCGACCTC CTTCTCCGTC AACGGCGTCC CCTGCGGTGG TGTCCAGCAG CCGCCGACGG TCTCGCTGAA CGTGCCGGCC GGGCCCTTCG AGGCGCCGGC CGACGTGCCG CTGACGGCCA CCGCCAGCGA CCCGGACGGG AGTATCAGCA GGGTCGACTT CTATCGCAAC GGCCTGCTGG TCAACACCGA CACCACCGCT CCGTACGCGT ACACTCTGCC GGGCCTACCG GCCGGCTCCT ACATCGTGCA GGCCAAGGCG TACGACGACA CCGGGCTCAG CGCCATCGCG GAGCAGTTCT TCACCGTCGC GCCCGCCTCC GGCCCGCAGC TGGTCGCCAC CCCCGCCACG GTGGGCGTAC CGGAGGGTGG CAGCGCCACC GTGACGCTGA CGTTGAGTGC CGCACCGGCT GCGGACGTCC TGGTGAGCCT CGGCCGGACC GGTGACCCCG ACCTCACCGT CGCCCCCACC TCGCTGACGC TGACCACCGG GAACTGGAAC ACCGGTGTCG ACGTGACCGT GTCGGCGGCC GAGGATGCCG ACACCGCCGG GGGCAGCGCG ACGATCACCG CCTCCGCCGC CGGTCTCGCC GCCTTGACGA TCACCGCGAC GGAGATTGAC AACGACACTC CCGGTGGCGA CAACGAGTAC GTCGCGCGGT TCCTCACCCA GTACGGAAAG ATCAAGAATT CCGGGTACTT CAGCTCCGAG GGGGTGCCGT ACCACTCCAT CGAGACCCTG ATCGTCGAGG CGCCGGACCA CGGCCACGAG ACCACGAGTG AGGCGTTCAG CTTCTGGCTC TGGCTGGAGG CGCAGTACGG CCGGGTGACC GAGGACTGGG CACCGTTCAC CAACGCCTGG ACGGTGCTGG AGAACTACAT CATCCCCTCC TCCGCCGACC AACCCACCGC GGGTGCTTCC GGCACCGCGC AGTACGCCGC CGAGTACGAC CTGCCCAGCC AGTACCCGGC GCAACTGCAA CCGAGCGTCC CGGTCGGCCA GGACCCGTTG CGGGGTGAAC TCCAGTCCAC CTACGGCACC GGTGACATCT ACGGCATGCA CTGGCTGCTC GACGTGGACA ACACCTACGG CTTCGGTCGG TGCGGCGACG GCACCACCCG GCCGGCGTAC ATCAACACCT TCCAGCGCGG GCAGCAGGAG TCGGTCTGGG AGACCGTCCC GCAGCCGTCC TGCGAGACCT TCACCCACGG CGGGCAGTAC GGCTTCCTGG ACATCTCCGT CCAGGAGCAG AACGCGCCGG CTCAGCAGTG GAAGTACACC AACGCGCCGG ACGCCGACGC CCGGGCGGTG CAGGCCGCGT ACTGGGCGCT GACCTGGGCC AAACAGCAGG GAAGGGCCGC GGAGGTGGCG GCCACCGTGG CCAAGGCCGC CAAGTTGGGC GACTACCTGC GGTACGCGAT GTTCGACAAG TACTTCAAGC AGATCGGCAA CTGTGTCGGG GCGTCCACCT GCCCTGCCGG CAGTGGCCGG GAGTCCGCGC ACTACCTGCT GTCCTGGTAC TACGCCTGGG GCGGCGCGTA CGAGTCGGGT CAGAACTGGT CGTGGCGGAT CGGCTCCAGC CACAACCACT TCGGCTACCA GAACCCCTTC GCGGCCTGGG CGTTGACCAC CGTGCCGGAA CTCGAGCCGC GGTCGCCGAG CGCGACCACC GACTGGGCCC GGAGTCTGGA ACGGCAGCTG GAGTTGTATA CCTGGCTTCA GTCCGCCGAG GGCGCGATCG CCGGTGGCGC GACCAACAGC TGGGGCGGCC GGTACGCGCA ACCGCCCGCG GGCACGCCGA CCTTCTACGG CATGTTCTAC GACGAGAAGC CCGTCTACCA CGACCCGCCG TCGAACCAGT GGTTCGGCAT GCAGGTCTGG TCGATGCACC GGATCGCCGA GTTGTACCTC GAGACCGGTG ACGCCCGGGC CGAGGCGCTG CTGGACAGGT GGGTGCCGTG GGCGATCGCC AACACCAGGC TGGGCGCCGA CTGGTCGATA CCGGCCGAAC TCACCTGGAC GGGCCAGCCG AACACGTGGA ACCCGACCAA CCCGGAGCCG AACACCGACC TGCACGTCGA GGTGACCGAG ACCGGCCAGG ACGTCGGCGC CGCCGCGGCC TATGCCCGGA CCTTGATCGC ATACGCGGCG AGGTCGGGGA ACGTGACCGC GAAGACCACC GCCAAGGGGC TGTTGGACGC GTTGCACGCC GCCAGCGATG CCCTGGGTGT GTCGACGGTG GAGAAGCGGG GCGACTACGA GCGCTTCGAC GATGTCTACG ACGCCAGCAC CGGGCAGGGC CTCTACCTGC CGCCGGGCTG GACGGGCACG ATGCCCAACG GCGACGTGAT CGAGGCGGGC CGGAGCTTCG TCGAGATCCG GTCGTTCTAC CTCAACGATC CGGACTGGCC GAAGGTCCAG GCGTATCTGG ATGGCGGCGC CGAGCCGACG TTCCGCTACC ACCGGTTCTG GGCTCAGGCT GATGTCGCGA TGGCGTACGC GGACTTCGGA CGGCTCTTCC CGAACGATTG A
|
Protein sequence | MRPMNRRRRW ATVTAAVLAV AGGATLPAQA ASAAPACDVV YTTNDWGNGF TANITLTNLG DPITGWTLRF AFAGNQTITH GWSASWSQSG NDVTATNESW NGNLGTGGTA QIGFNAAYSG TNADPTSFSV NGVPCGGVQQ PPTVSLNVPA GPFEAPADVP LTATASDPDG SISRVDFYRN GLLVNTDTTA PYAYTLPGLP AGSYIVQAKA YDDTGLSAIA EQFFTVAPAS GPQLVATPAT VGVPEGGSAT VTLTLSAAPA ADVLVSLGRT GDPDLTVAPT SLTLTTGNWN TGVDVTVSAA EDADTAGGSA TITASAAGLA ALTITATEID NDTPGGDNEY VARFLTQYGK IKNSGYFSSE GVPYHSIETL IVEAPDHGHE TTSEAFSFWL WLEAQYGRVT EDWAPFTNAW TVLENYIIPS SADQPTAGAS GTAQYAAEYD LPSQYPAQLQ PSVPVGQDPL RGELQSTYGT GDIYGMHWLL DVDNTYGFGR CGDGTTRPAY INTFQRGQQE SVWETVPQPS CETFTHGGQY GFLDISVQEQ NAPAQQWKYT NAPDADARAV QAAYWALTWA KQQGRAAEVA ATVAKAAKLG DYLRYAMFDK YFKQIGNCVG ASTCPAGSGR ESAHYLLSWY YAWGGAYESG QNWSWRIGSS HNHFGYQNPF AAWALTTVPE LEPRSPSATT DWARSLERQL ELYTWLQSAE GAIAGGATNS WGGRYAQPPA GTPTFYGMFY DEKPVYHDPP SNQWFGMQVW SMHRIAELYL ETGDARAEAL LDRWVPWAIA NTRLGADWSI PAELTWTGQP NTWNPTNPEP NTDLHVEVTE TGQDVGAAAA YARTLIAYAA RSGNVTAKTT AKGLLDALHA ASDALGVSTV EKRGDYERFD DVYDASTGQG LYLPPGWTGT MPNGDVIEAG RSFVEIRSFY LNDPDWPKVQ AYLDGGAEPT FRYHRFWAQA DVAMAYADFG RLFPND
|
| |