Gene Sare_3180 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3180 
Symbol 
ID5705793 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3665389 
End bp3668280 
Gene Length2892 bp 
Protein Length963 aa 
Translation table11 
GC content69% 
IMG OID641272611 
Productglycoside hydrolase family protein 
Protein accessionYP_001537978 
Protein GI159038725 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000076749 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGCCCGAC GTCGCCGCCG CGCGATGATC ATCACGGCGA TGGTGGTGGC GGCCGGCGGA 
GCGACCGTGC CCGCCAGGGC CGCCCAAGCC ACGCCGGCCT GCCACGTGGT CTACACGACC
AACGACTGGG GCAGCGGCTT CACCGCGAAC ATCGCCCTCA CCAATCTCGG CGACCCGATC
CAGGACTGGA CCCTGCGGTT CGCGTTCGCC GGCAACCAGA CCATCACCCA CGGCTGGTCG
GCGACCTGGA GTCAGGACGG TAGCGACGTC ACCGCCACCA ACGAGTCGTG GAACGGTGAC
CTCGGCACCG GTGATGCCGC GCACATCGGA TTCAACGCCA CGTACAGCGG CACCAACGCC
GAGCCGACGT CCTTCTCCGT CAACGGCGTG AGCTGCGGCG GCGTACAGCA GCCGCCCACG
GTCACGCTGG ACGTGCCCGC CGGGCCGTTC GAGGCGCCCG CCGACGTGCC GCTGACGGCC
GCCGCCAGCG ATCCGGACGG GACCATCAGC AAGGTTGACT TCTACCGCAA CGGCCTGCTG
GTCGACACCG ACACCACGGC CCCGTACGCG TACACCCTGC AGGCGCTGCC GGCGGGCACC
TACACCGTGC AGGCCAAGGC GTACGACGAC ACCGGGCGCA GCGCCGTCGC GGAGAAGTCG
TTCACCGTCG AGCCCGCGTC CGGCCCACGA CTGGTCGCCA CCCCAGCCGC GGTGGGCGTA
CCCGAGGGCG CCAGCGCCAC CGTCACGCTG ACGTTGAGCG AGGCGCCGGC CGCGAGCGTC
CCGGTGAGCC TCACCCGCAC CGGTGACACT GACATCACTG TCGCGCCCAC GTCGCTGACG
CTGACCACCG GCAACTGGAA CACCGGTGTC ACCGTGACCG TGTCGGCGGC CGAGGACGCC
GACACCGCCG GGGGCACCGC GACGATCACC GCGTCCGCTG CCGGCCTCGC CGCACTGGCG
ATCACCGCGA CGGAGATGGA CAACGACAAC CCGGGCGGCG ACAACGAGTA CATCGCGCGG
TTCCTCACCC AGTACGGCAA GATCAAAAAT TCGGGGTACT TCAGCCCCGA GGGCGTGCCG
TACCACGCCA TCGAGACCCT GATCGTCGAG GCGCCCGACC ACGGCCACGA GACCACGAGC
GAGGCATTCA GCTTCTGGCT CTGGCTGGAG GCGCAGTACG GCCGGGTGAC GGAGAACTGG
GCGCCGTTCA ACACCGCCTG GACGGTGCTG GAGAACTACA TCATCCCGTC GTCGGCCGAC
CAGCTCACGG CCGGCGCTCC CGGTACCGCT CAGTACGCCG CCGAGTACGA CCTGCCCAGC
CAGTACCCGT CGCAGCTGCA ACCGAACGTT CCGGTCGGCC AGGACCCGCT GCGGGGTGAG
CTCCAGTCCA CCTATGGCAC CGGTGACATC TACGGCATGC ACTGGCTGCT CGACGTGGAC
AACACCTACG GCTTCGGTCG GTGCGGCGAC GGCACCACCC GGCCGGCGTA CATCAACACC
TTCCAGCGCG GTCAGCAGGA GTCGGTCTGG GAGACCGTCC CGCAGCCCTC CTGCGAGACC
TTCACCCACG GCGGGCAGTA CGGCTTCCTG GACATCTCCG TCAAGGAGCA GAACGCCCCG
GCGAAGCAGT GGAGGTACAC CAACGCGCCG GACGCCGACG CCCGCGCCGT GCAGGCTGCC
TACTGGGCGC TGACCTGGGC CAAGCAGCAG GGCAAGGCGG CGGATGTGGC GGCCACTGTG
GCCAGGGCCG CCAAGTTGGG CGACTACCTG CGCTACGCGA TGTTCGACAA GTACTTCAAG
AAGATCGGCA ACTGTGTCGG GGCGTCCACC TGCCCGGCCG GCAGTGGCCG AGAGTCCGCG
CACTACCTTT TGTCCTGGTA CTACGCCTGG GGCGGCGCGT ACGAGCCGGG TCAGGACTGG
TCGTGGCGGA TCGGCTCCAG CCACAACCAC TTCGGCTACC AGAACCCGTT CGCCGCCTGG
GCGTTGACCA ACGTGCCGGA ACTCGAGCCG AGGTCGCCGA GCGCGACCAC CGACTGGGCC
AGGAGCCTGG AGCGGCAGCT GGAGTTGTAC ACCTGGCTGC AGTCCGCCGA GGGCGCGATC
GCCGGCGGCG CGACCAACAG CTGGGGCGGC CGGTACGCCC AGCCACCGGC TGGCACACCG
ACCTTCTACG GCATGTTCTA CGACGAGAAG CCCGTCTACC ACGACCCGCC GTCGAACCAG
TGGTTCGGCA TGCAGGTCTG GTCGATGCAC CGCGTCGCCG AGCTGTATCT GCAGACCGGT
GACGCCCGGG CCGAGGTGTT GCTGGACAGG TGGGTGCCGT GGGCGATCGC CAACACGAGC
CTGGGCGCCG ACTGGTCGAT CCCGGCTGAA CTGACGTGGA CGGGCAAGCC GAACACGTGG
AGCCCGACCA ACCCGCAGCC GAACACCGAC CTACACGTCG AGGTCACCGA CACCGGGCAG
GACGTCGGTG CCGCGGCCGC CTACGCCCGG ACCCTGATCG CCTACGCGGC GAGGTCAGGA
GACGTGGCCG CCAAGACCAC CGCCAAGGGG CTGTTGGACG CGCTGCACGC CGCCAGCGAT
GCCCTGGGCG TGTCGACGGT GGAGAAGCGG GGCGACTACG AGCGCTTCGA CGACGTCTAC
GACGCGAGCA CCGGGCAGGG TCTCTACCTC CCGCCGGGCT GGACGGGCAC GATGCCCAAC
GGTGATGTGA TCGCGGCGGG TAGGAGTTTC GTCGACATCC GGTCGTTCTA CCTGAACGAC
CCGGATTGGC CGAAGGTGCA GGCATACCTT GATGGTGGCG CCGAGCCGAC GTTCCGTTAC
CACCGTTTCT GGGCCCAGGC CGACGTCGCG ATGGCGTACG CCGACTTCGG GCGGCTGTTC
CCGACTGGTT GA
 
Protein sequence
MARRRRRAMI ITAMVVAAGG ATVPARAAQA TPACHVVYTT NDWGSGFTAN IALTNLGDPI 
QDWTLRFAFA GNQTITHGWS ATWSQDGSDV TATNESWNGD LGTGDAAHIG FNATYSGTNA
EPTSFSVNGV SCGGVQQPPT VTLDVPAGPF EAPADVPLTA AASDPDGTIS KVDFYRNGLL
VDTDTTAPYA YTLQALPAGT YTVQAKAYDD TGRSAVAEKS FTVEPASGPR LVATPAAVGV
PEGASATVTL TLSEAPAASV PVSLTRTGDT DITVAPTSLT LTTGNWNTGV TVTVSAAEDA
DTAGGTATIT ASAAGLAALA ITATEMDNDN PGGDNEYIAR FLTQYGKIKN SGYFSPEGVP
YHAIETLIVE APDHGHETTS EAFSFWLWLE AQYGRVTENW APFNTAWTVL ENYIIPSSAD
QLTAGAPGTA QYAAEYDLPS QYPSQLQPNV PVGQDPLRGE LQSTYGTGDI YGMHWLLDVD
NTYGFGRCGD GTTRPAYINT FQRGQQESVW ETVPQPSCET FTHGGQYGFL DISVKEQNAP
AKQWRYTNAP DADARAVQAA YWALTWAKQQ GKAADVAATV ARAAKLGDYL RYAMFDKYFK
KIGNCVGAST CPAGSGRESA HYLLSWYYAW GGAYEPGQDW SWRIGSSHNH FGYQNPFAAW
ALTNVPELEP RSPSATTDWA RSLERQLELY TWLQSAEGAI AGGATNSWGG RYAQPPAGTP
TFYGMFYDEK PVYHDPPSNQ WFGMQVWSMH RVAELYLQTG DARAEVLLDR WVPWAIANTS
LGADWSIPAE LTWTGKPNTW SPTNPQPNTD LHVEVTDTGQ DVGAAAAYAR TLIAYAARSG
DVAAKTTAKG LLDALHAASD ALGVSTVEKR GDYERFDDVY DASTGQGLYL PPGWTGTMPN
GDVIAAGRSF VDIRSFYLND PDWPKVQAYL DGGAEPTFRY HRFWAQADVA MAYADFGRLF
PTG