Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3983 |
Symbol | |
ID | 5706658 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4524324 |
End bp | 4526093 |
Gene Length | 1770 bp |
Protein Length | 589 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641273408 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001538764 |
Protein GI | 159039511 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000432599 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0119912 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTTCGC CCAAGGAGAT CTGTGGCATG AACGTGTGGA GAAGGCTCTC CGGCCCACGT CCGGCCCTCG CGCTGACCGG TGTGAGTGCC CTTGTCATAG GTGGGTTGGT GACCCTGCCG GGCACCATGG CCCACGCCGC CACCCAGTGC GAGGTGTCGT ACACCACGAA CGACTGGCCC GGCGGGTTCA CCGCCGCCAT CAGCATCAAG AACACTGGAG ACGTGCTCGA CGGTTGGACG CTCCGCTTCG CCTTTCCGGA CAGCAGCCAG CAGGTGGTGC ACGGCTGGTC GGCCCGGTAT GGCCAGTCGG GTCAGGACGT CACCGCGCAG AATGAGTCGT ACAACGGTTC GGTGGCCAGC GGCGCTACCG TCGTCATCGG CTTCAACGGC TCGTGGGACG GCAGTAACCC CAGGCCGACG TCGTTCACTC TCAACGGGGT GGCTTGCAAC GGCGGCCCCA CCACGCCGCC GCCCACCACC GCACCGCCGC CCACCACCCC GCCACCCGGT GCCCGGGTCG ACAATCCGTA CCTGAACGCA GTGGGCTACG TGAACCCGGA GTGGAAGGCC AAGGCCGAGT CGGTTCCCGG CGGCGACCGG GTGTCGAACA CGTCGACCGC CGTCTGGATC GACCGGATCG CAGCCATCGA GGGGACGGAT GACAGCCAGT CCAACGGCCC GATGGGTGTG CGTGATCACC TAAACGAGGC GCTGCGTCAG GGTGCCGACT ACATCCAGTT CGTCATCTAC AACCTGCCCG GCCGGGACTG CGCTGCGCTC GCCTCGAACG GTGAGCTCGA GCCGGACGAG CTGCCCCGCT ACAAGGCCGA GTTCATCGAC CCGATCGCGG CTATCCAGAG TGACGCGATG TACCAGGACC TGCGGATCAT AAACATCATC GAAATCGACT CGCTGCCGAA CCTGCACGCC AACACCGGTA GCAACCCAGG TGCCACTCCG ACCTGCGACC TTGTCAAGCA GAACGGCGCC TACGTCAACG GCATCGGCTA TGCGCTAGCC ACGCTGGGTG CGATCAGCAA CGTCTACAAC TATGTGGACG CCGCGCACCA TGGTTGGATC GGCTGGGACA GCAACTTCAG CCCGGTCGCC TCACTCCTGA AGGAGGCCGC CACGGCATCC GGCAGTACGG TCGACAACGC GCACGGCTTC ATTGTCAACA CCGCCAACTA CTCGGCCTTG CACGAGCCCC ATTTCCAGAT CACCGACATG GTCAACGGCC AGTCGATCCG CCAGTCCACG TGGGTGGACT GGAACCAGTA CGTGGACGAG CTGTCCTTCG CCCAGGCGTT CCGCGACGAG TTGGTCACCA AAGGCTTCGA CTCCGGAGTC GGGATGTTGA TTGATACTTC CCGAAACGGC TGGGGTGGCA GCGCCCGGCC AACCGGTCCC GGTCCGATGA CTGACGTCGA CAGTTATGTC GACGGTGGTC GCGTCGACCG ACGAATCCAC GCCGGTAACT GGTGCAATCA GTCTGGCGCG GGCCTGGGTG AGCGGCCCAG GGCCGCGCCA GAGCCGGGCA TCGACGCCTA CGTCTGGGTG AAGCCGCCGG GCGAGTCCGA CGGTTCCAGC GAGGAGATTC CGAACAACGA CGGCAAGGGC TTCGACCGGA TGTGCGACCC AACGTACGAC GGCAATGCCC GTAACGGCTA CAACCCCAGT GGAGCCCTGC CCGACGCACC GATCTCCGGC GCCTGGTTCC CCGCCCAGTT CCAGCAGCTC ATGCAGAACG CCTACCCGCC GTTGCCCTGA
|
Protein sequence | MVSPKEICGM NVWRRLSGPR PALALTGVSA LVIGGLVTLP GTMAHAATQC EVSYTTNDWP GGFTAAISIK NTGDVLDGWT LRFAFPDSSQ QVVHGWSARY GQSGQDVTAQ NESYNGSVAS GATVVIGFNG SWDGSNPRPT SFTLNGVACN GGPTTPPPTT APPPTTPPPG ARVDNPYLNA VGYVNPEWKA KAESVPGGDR VSNTSTAVWI DRIAAIEGTD DSQSNGPMGV RDHLNEALRQ GADYIQFVIY NLPGRDCAAL ASNGELEPDE LPRYKAEFID PIAAIQSDAM YQDLRIINII EIDSLPNLHA NTGSNPGATP TCDLVKQNGA YVNGIGYALA TLGAISNVYN YVDAAHHGWI GWDSNFSPVA SLLKEAATAS GSTVDNAHGF IVNTANYSAL HEPHFQITDM VNGQSIRQST WVDWNQYVDE LSFAQAFRDE LVTKGFDSGV GMLIDTSRNG WGGSARPTGP GPMTDVDSYV DGGRVDRRIH AGNWCNQSGA GLGERPRAAP EPGIDAYVWV KPPGESDGSS EEIPNNDGKG FDRMCDPTYD GNARNGYNPS GALPDAPISG AWFPAQFQQL MQNAYPPLP
|
| |