Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3180 |
Symbol | |
ID | 5705793 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 3665389 |
End bp | 3668280 |
Gene Length | 2892 bp |
Protein Length | 963 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641272611 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001537978 |
Protein GI | 159038725 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000076749 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGCCCGAC GTCGCCGCCG CGCGATGATC ATCACGGCGA TGGTGGTGGC GGCCGGCGGA GCGACCGTGC CCGCCAGGGC CGCCCAAGCC ACGCCGGCCT GCCACGTGGT CTACACGACC AACGACTGGG GCAGCGGCTT CACCGCGAAC ATCGCCCTCA CCAATCTCGG CGACCCGATC CAGGACTGGA CCCTGCGGTT CGCGTTCGCC GGCAACCAGA CCATCACCCA CGGCTGGTCG GCGACCTGGA GTCAGGACGG TAGCGACGTC ACCGCCACCA ACGAGTCGTG GAACGGTGAC CTCGGCACCG GTGATGCCGC GCACATCGGA TTCAACGCCA CGTACAGCGG CACCAACGCC GAGCCGACGT CCTTCTCCGT CAACGGCGTG AGCTGCGGCG GCGTACAGCA GCCGCCCACG GTCACGCTGG ACGTGCCCGC CGGGCCGTTC GAGGCGCCCG CCGACGTGCC GCTGACGGCC GCCGCCAGCG ATCCGGACGG GACCATCAGC AAGGTTGACT TCTACCGCAA CGGCCTGCTG GTCGACACCG ACACCACGGC CCCGTACGCG TACACCCTGC AGGCGCTGCC GGCGGGCACC TACACCGTGC AGGCCAAGGC GTACGACGAC ACCGGGCGCA GCGCCGTCGC GGAGAAGTCG TTCACCGTCG AGCCCGCGTC CGGCCCACGA CTGGTCGCCA CCCCAGCCGC GGTGGGCGTA CCCGAGGGCG CCAGCGCCAC CGTCACGCTG ACGTTGAGCG AGGCGCCGGC CGCGAGCGTC CCGGTGAGCC TCACCCGCAC CGGTGACACT GACATCACTG TCGCGCCCAC GTCGCTGACG CTGACCACCG GCAACTGGAA CACCGGTGTC ACCGTGACCG TGTCGGCGGC CGAGGACGCC GACACCGCCG GGGGCACCGC GACGATCACC GCGTCCGCTG CCGGCCTCGC CGCACTGGCG ATCACCGCGA CGGAGATGGA CAACGACAAC CCGGGCGGCG ACAACGAGTA CATCGCGCGG TTCCTCACCC AGTACGGCAA GATCAAAAAT TCGGGGTACT TCAGCCCCGA GGGCGTGCCG TACCACGCCA TCGAGACCCT GATCGTCGAG GCGCCCGACC ACGGCCACGA GACCACGAGC GAGGCATTCA GCTTCTGGCT CTGGCTGGAG GCGCAGTACG GCCGGGTGAC GGAGAACTGG GCGCCGTTCA ACACCGCCTG GACGGTGCTG GAGAACTACA TCATCCCGTC GTCGGCCGAC CAGCTCACGG CCGGCGCTCC CGGTACCGCT CAGTACGCCG CCGAGTACGA CCTGCCCAGC CAGTACCCGT CGCAGCTGCA ACCGAACGTT CCGGTCGGCC AGGACCCGCT GCGGGGTGAG CTCCAGTCCA CCTATGGCAC CGGTGACATC TACGGCATGC ACTGGCTGCT CGACGTGGAC AACACCTACG GCTTCGGTCG GTGCGGCGAC GGCACCACCC GGCCGGCGTA CATCAACACC TTCCAGCGCG GTCAGCAGGA GTCGGTCTGG GAGACCGTCC CGCAGCCCTC CTGCGAGACC TTCACCCACG GCGGGCAGTA CGGCTTCCTG GACATCTCCG TCAAGGAGCA GAACGCCCCG GCGAAGCAGT GGAGGTACAC CAACGCGCCG GACGCCGACG CCCGCGCCGT GCAGGCTGCC TACTGGGCGC TGACCTGGGC CAAGCAGCAG GGCAAGGCGG CGGATGTGGC GGCCACTGTG GCCAGGGCCG CCAAGTTGGG CGACTACCTG CGCTACGCGA TGTTCGACAA GTACTTCAAG AAGATCGGCA ACTGTGTCGG GGCGTCCACC TGCCCGGCCG GCAGTGGCCG AGAGTCCGCG CACTACCTTT TGTCCTGGTA CTACGCCTGG GGCGGCGCGT ACGAGCCGGG TCAGGACTGG TCGTGGCGGA TCGGCTCCAG CCACAACCAC TTCGGCTACC AGAACCCGTT CGCCGCCTGG GCGTTGACCA ACGTGCCGGA ACTCGAGCCG AGGTCGCCGA GCGCGACCAC CGACTGGGCC AGGAGCCTGG AGCGGCAGCT GGAGTTGTAC ACCTGGCTGC AGTCCGCCGA GGGCGCGATC GCCGGCGGCG CGACCAACAG CTGGGGCGGC CGGTACGCCC AGCCACCGGC TGGCACACCG ACCTTCTACG GCATGTTCTA CGACGAGAAG CCCGTCTACC ACGACCCGCC GTCGAACCAG TGGTTCGGCA TGCAGGTCTG GTCGATGCAC CGCGTCGCCG AGCTGTATCT GCAGACCGGT GACGCCCGGG CCGAGGTGTT GCTGGACAGG TGGGTGCCGT GGGCGATCGC CAACACGAGC CTGGGCGCCG ACTGGTCGAT CCCGGCTGAA CTGACGTGGA CGGGCAAGCC GAACACGTGG AGCCCGACCA ACCCGCAGCC GAACACCGAC CTACACGTCG AGGTCACCGA CACCGGGCAG GACGTCGGTG CCGCGGCCGC CTACGCCCGG ACCCTGATCG CCTACGCGGC GAGGTCAGGA GACGTGGCCG CCAAGACCAC CGCCAAGGGG CTGTTGGACG CGCTGCACGC CGCCAGCGAT GCCCTGGGCG TGTCGACGGT GGAGAAGCGG GGCGACTACG AGCGCTTCGA CGACGTCTAC GACGCGAGCA CCGGGCAGGG TCTCTACCTC CCGCCGGGCT GGACGGGCAC GATGCCCAAC GGTGATGTGA TCGCGGCGGG TAGGAGTTTC GTCGACATCC GGTCGTTCTA CCTGAACGAC CCGGATTGGC CGAAGGTGCA GGCATACCTT GATGGTGGCG CCGAGCCGAC GTTCCGTTAC CACCGTTTCT GGGCCCAGGC CGACGTCGCG ATGGCGTACG CCGACTTCGG GCGGCTGTTC CCGACTGGTT GA
|
Protein sequence | MARRRRRAMI ITAMVVAAGG ATVPARAAQA TPACHVVYTT NDWGSGFTAN IALTNLGDPI QDWTLRFAFA GNQTITHGWS ATWSQDGSDV TATNESWNGD LGTGDAAHIG FNATYSGTNA EPTSFSVNGV SCGGVQQPPT VTLDVPAGPF EAPADVPLTA AASDPDGTIS KVDFYRNGLL VDTDTTAPYA YTLQALPAGT YTVQAKAYDD TGRSAVAEKS FTVEPASGPR LVATPAAVGV PEGASATVTL TLSEAPAASV PVSLTRTGDT DITVAPTSLT LTTGNWNTGV TVTVSAAEDA DTAGGTATIT ASAAGLAALA ITATEMDNDN PGGDNEYIAR FLTQYGKIKN SGYFSPEGVP YHAIETLIVE APDHGHETTS EAFSFWLWLE AQYGRVTENW APFNTAWTVL ENYIIPSSAD QLTAGAPGTA QYAAEYDLPS QYPSQLQPNV PVGQDPLRGE LQSTYGTGDI YGMHWLLDVD NTYGFGRCGD GTTRPAYINT FQRGQQESVW ETVPQPSCET FTHGGQYGFL DISVKEQNAP AKQWRYTNAP DADARAVQAA YWALTWAKQQ GKAADVAATV ARAAKLGDYL RYAMFDKYFK KIGNCVGAST CPAGSGRESA HYLLSWYYAW GGAYEPGQDW SWRIGSSHNH FGYQNPFAAW ALTNVPELEP RSPSATTDWA RSLERQLELY TWLQSAEGAI AGGATNSWGG RYAQPPAGTP TFYGMFYDEK PVYHDPPSNQ WFGMQVWSMH RVAELYLQTG DARAEVLLDR WVPWAIANTS LGADWSIPAE LTWTGKPNTW SPTNPQPNTD LHVEVTDTGQ DVGAAAAYAR TLIAYAARSG DVAAKTTAKG LLDALHAASD ALGVSTVEKR GDYERFDDVY DASTGQGLYL PPGWTGTMPN GDVIAAGRSF VDIRSFYLND PDWPKVQAYL DGGAEPTFRY HRFWAQADVA MAYADFGRLF PTG
|
| |