Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Strop_3602 |
Symbol | |
ID | 5060077 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora tropica CNB-440 |
Kingdom | Bacteria |
Replicon accession | NC_009380 |
Strand | - |
Start bp | 4118974 |
End bp | 4120836 |
Gene Length | 1863 bp |
Protein Length | 620 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640475857 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001160411 |
Protein GI | 145596114 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.438732 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGCAT CGACCGGGTC ACTCGGCCGG GGCCGTCCGC GGATCGGACT CTCGCCCAAG GAGATCTGTG GCATGAACGT GTGGAGAAGA CTCTCCGTCC CGCGCCCGGC CCTCGCGCTG ACCGGCGCGG GCGTCCTGGT CGTAGGCGGG TTGGTGACCC TGCCGGTCAC CATGGCCCAC GCTGCGACCC AGTGTGAGGT GTCGTACACC ACCAATGACT GGCCCGGCGG TTTCACCGCT TCCCTCAGTA TCAAGAACAC CGGGGAGGCG CTGGATGGCT GGACGCTTCG CTTCACCTTC CCGAACAGTA GCCAGCAGGT GGTGCACGGC TGGTCGGCTC GGTACAGCCA GTCCGGGCAG AACGTTGCCG TGCAGAACGA GTCGTACAAC GGTTTGGTGC CCAGTGGCGC CACCATCGAG ATCGGCTTCA ACGGTTTGTG GAGTGGCAGC AACCCCAAAC CGACGTCGTT CACGCTCAAC GAGGTGGCCT GCAACGGCGG TGGTGGCCCC ACTACGGCAC CGCCGACAAC TACGCCGCCC ACCACGGCAC CGCCGACGAC CCCACCTCCC ACCACTCCAC CTCCCACCAC CGCGCCTCCC GGAGGCCGAG TGGACAACCC GTACCTGAAC GCGGTGGGCT ACGTGAATCC GGAGTGGAAG GCCAAGGCCG AGTCAGTGTC CGGTGGCAAC CGGGTGTCGA ACACGTCGAC GGCCGTCTGG ATCGACCGGA TTGCGGCCAT TGAGGGTACC GATGACAGCC AGTCCAATGG CCCGATGGGC ATACGAGATC ACCTGGATGA GGCGCTGAGT CAGGGGGCGG ACTACATCCA GTTCGTGATC TACAACCTAC CCGGTCGGGA CTGCGCTGCG CTCGCCTCGA ACGGTGAACT GGCACCGGAC GAGTTGCCCC GCTACAAGGC CGAGTTCATC GACCCGATAG CGGCGATCCA GAGTGACGCG GCATACCAGG ATCTCCGGAT CATCAACATC ATCGAGATCG ACTCGTTGCC GAACCTGCAC GCCAACACCG GCAGCAATCC GGGCGCTACC CCGACCTGTG AGCTTGTCAA GCAGAACGGC GCCTATGTCA ACGGCATCGG CTACGCGCTG GCCACGTTGG GCGCGATCAG CAACGTCTAC AACTACGTGG ACGCTGCCCA CCACGGCTGG ATCGGCTGGG ACACCAACTT CAGCCCGGTC GCCCTGCTCC TGAAGGATGC TGCCACAGCG TCCGGCAGCA CGCTCGACGA CGTGCATGGC TTCATCGTCA ACACCGCCAA CTACTCGGCA TTGCGCGAGC CCTACTTCCA GATCACCGAC ACGGTCAACG GCCAGACGAT CCGTCAGTCC ACGTGGGTGG ACTGGAACCA GTACGTCGAC GAGTTGTCGT TTGCGCAGGA CTTCCGCGAC GAACTAGTTG CCAAGGGATT CGACTCGGGT GTGGGAATGT TGATCGATAC TTCCCGCAAC GGTTGGGGTG GCAGTGCTCG ACCAACCGCC CCTGGGCCGA CGACCGATGT GGACAGCTAT GTCGACGGTG GTCGGGTCGA CCGACGAATC CACGCCGGGA ACTGGTGCAA CCAGTCTGGT GCGGGCCTGG GTGAGCGGCC GCAGGCCGCG CCGGAGCCCG GTATCGACGC CTATGTCTGG GTAAAGCCGC CGGGCGAGTC GGACGGCTCC AGCGAGGAGA TCCCGAACAA CGACGGCAAG GGCTTTGACC GGATGTGCGA CCCGACGTAC GAGGGCAACG CCCGTAACGG CTTCAACCCC AGCGGTGCCC TGCCTGACGC GCCGATCTCC GGTGCCTGGT TCCCGGCGCA GTTCCAGCAG CTCATGCAGA ACGCCTACCC GCCGCTGTCC TGA
|
Protein sequence | MSASTGSLGR GRPRIGLSPK EICGMNVWRR LSVPRPALAL TGAGVLVVGG LVTLPVTMAH AATQCEVSYT TNDWPGGFTA SLSIKNTGEA LDGWTLRFTF PNSSQQVVHG WSARYSQSGQ NVAVQNESYN GLVPSGATIE IGFNGLWSGS NPKPTSFTLN EVACNGGGGP TTAPPTTTPP TTAPPTTPPP TTPPPTTAPP GGRVDNPYLN AVGYVNPEWK AKAESVSGGN RVSNTSTAVW IDRIAAIEGT DDSQSNGPMG IRDHLDEALS QGADYIQFVI YNLPGRDCAA LASNGELAPD ELPRYKAEFI DPIAAIQSDA AYQDLRIINI IEIDSLPNLH ANTGSNPGAT PTCELVKQNG AYVNGIGYAL ATLGAISNVY NYVDAAHHGW IGWDTNFSPV ALLLKDAATA SGSTLDDVHG FIVNTANYSA LREPYFQITD TVNGQTIRQS TWVDWNQYVD ELSFAQDFRD ELVAKGFDSG VGMLIDTSRN GWGGSARPTA PGPTTDVDSY VDGGRVDRRI HAGNWCNQSG AGLGERPQAA PEPGIDAYVW VKPPGESDGS SEEIPNNDGK GFDRMCDPTY EGNARNGFNP SGALPDAPIS GAWFPAQFQQ LMQNAYPPLS
|
| |