Gene Sare_3983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3983 
Symbol 
ID5706658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4524324 
End bp4526093 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content65% 
IMG OID641273408 
Productglycoside hydrolase family protein 
Protein accessionYP_001538764 
Protein GI159039511 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000432599 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0119912 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTTCGC CCAAGGAGAT CTGTGGCATG AACGTGTGGA GAAGGCTCTC CGGCCCACGT 
CCGGCCCTCG CGCTGACCGG TGTGAGTGCC CTTGTCATAG GTGGGTTGGT GACCCTGCCG
GGCACCATGG CCCACGCCGC CACCCAGTGC GAGGTGTCGT ACACCACGAA CGACTGGCCC
GGCGGGTTCA CCGCCGCCAT CAGCATCAAG AACACTGGAG ACGTGCTCGA CGGTTGGACG
CTCCGCTTCG CCTTTCCGGA CAGCAGCCAG CAGGTGGTGC ACGGCTGGTC GGCCCGGTAT
GGCCAGTCGG GTCAGGACGT CACCGCGCAG AATGAGTCGT ACAACGGTTC GGTGGCCAGC
GGCGCTACCG TCGTCATCGG CTTCAACGGC TCGTGGGACG GCAGTAACCC CAGGCCGACG
TCGTTCACTC TCAACGGGGT GGCTTGCAAC GGCGGCCCCA CCACGCCGCC GCCCACCACC
GCACCGCCGC CCACCACCCC GCCACCCGGT GCCCGGGTCG ACAATCCGTA CCTGAACGCA
GTGGGCTACG TGAACCCGGA GTGGAAGGCC AAGGCCGAGT CGGTTCCCGG CGGCGACCGG
GTGTCGAACA CGTCGACCGC CGTCTGGATC GACCGGATCG CAGCCATCGA GGGGACGGAT
GACAGCCAGT CCAACGGCCC GATGGGTGTG CGTGATCACC TAAACGAGGC GCTGCGTCAG
GGTGCCGACT ACATCCAGTT CGTCATCTAC AACCTGCCCG GCCGGGACTG CGCTGCGCTC
GCCTCGAACG GTGAGCTCGA GCCGGACGAG CTGCCCCGCT ACAAGGCCGA GTTCATCGAC
CCGATCGCGG CTATCCAGAG TGACGCGATG TACCAGGACC TGCGGATCAT AAACATCATC
GAAATCGACT CGCTGCCGAA CCTGCACGCC AACACCGGTA GCAACCCAGG TGCCACTCCG
ACCTGCGACC TTGTCAAGCA GAACGGCGCC TACGTCAACG GCATCGGCTA TGCGCTAGCC
ACGCTGGGTG CGATCAGCAA CGTCTACAAC TATGTGGACG CCGCGCACCA TGGTTGGATC
GGCTGGGACA GCAACTTCAG CCCGGTCGCC TCACTCCTGA AGGAGGCCGC CACGGCATCC
GGCAGTACGG TCGACAACGC GCACGGCTTC ATTGTCAACA CCGCCAACTA CTCGGCCTTG
CACGAGCCCC ATTTCCAGAT CACCGACATG GTCAACGGCC AGTCGATCCG CCAGTCCACG
TGGGTGGACT GGAACCAGTA CGTGGACGAG CTGTCCTTCG CCCAGGCGTT CCGCGACGAG
TTGGTCACCA AAGGCTTCGA CTCCGGAGTC GGGATGTTGA TTGATACTTC CCGAAACGGC
TGGGGTGGCA GCGCCCGGCC AACCGGTCCC GGTCCGATGA CTGACGTCGA CAGTTATGTC
GACGGTGGTC GCGTCGACCG ACGAATCCAC GCCGGTAACT GGTGCAATCA GTCTGGCGCG
GGCCTGGGTG AGCGGCCCAG GGCCGCGCCA GAGCCGGGCA TCGACGCCTA CGTCTGGGTG
AAGCCGCCGG GCGAGTCCGA CGGTTCCAGC GAGGAGATTC CGAACAACGA CGGCAAGGGC
TTCGACCGGA TGTGCGACCC AACGTACGAC GGCAATGCCC GTAACGGCTA CAACCCCAGT
GGAGCCCTGC CCGACGCACC GATCTCCGGC GCCTGGTTCC CCGCCCAGTT CCAGCAGCTC
ATGCAGAACG CCTACCCGCC GTTGCCCTGA
 
Protein sequence
MVSPKEICGM NVWRRLSGPR PALALTGVSA LVIGGLVTLP GTMAHAATQC EVSYTTNDWP 
GGFTAAISIK NTGDVLDGWT LRFAFPDSSQ QVVHGWSARY GQSGQDVTAQ NESYNGSVAS
GATVVIGFNG SWDGSNPRPT SFTLNGVACN GGPTTPPPTT APPPTTPPPG ARVDNPYLNA
VGYVNPEWKA KAESVPGGDR VSNTSTAVWI DRIAAIEGTD DSQSNGPMGV RDHLNEALRQ
GADYIQFVIY NLPGRDCAAL ASNGELEPDE LPRYKAEFID PIAAIQSDAM YQDLRIINII
EIDSLPNLHA NTGSNPGATP TCDLVKQNGA YVNGIGYALA TLGAISNVYN YVDAAHHGWI
GWDSNFSPVA SLLKEAATAS GSTVDNAHGF IVNTANYSAL HEPHFQITDM VNGQSIRQST
WVDWNQYVDE LSFAQAFRDE LVTKGFDSGV GMLIDTSRNG WGGSARPTGP GPMTDVDSYV
DGGRVDRRIH AGNWCNQSGA GLGERPRAAP EPGIDAYVWV KPPGESDGSS EEIPNNDGKG
FDRMCDPTYD GNARNGYNPS GALPDAPISG AWFPAQFQQL MQNAYPPLP