Gene Sde_3237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_3237 
Symbol 
ID3965710 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp4125970 
End bp4127862 
Gene Length1893 bp 
Protein Length630 aa 
Translation table11 
GC content48% 
IMG OID637922334 
Productcellulase 
Protein accessionYP_528706 
Protein GI90022879 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000087335 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.349209 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCAG CAACCACAAA TCAATCGAGG GCACGCAGTA GCGCCTTTAA AAATATGTTG 
GCGGCATCGC TCGCAGGTTT AGGGCTACTA TCAGCTTCTG CATTTGCCGA TGTAGCCCCG
CTAACCGTAG ACGGCAATAA AATTCTTAGC GGTGGCCAGC AAGCCAGTTT TGCCGGTAAT
AGCTTATTTT GGTCTAACAA TGGCTGGGGC GGTGAGAAGT ATTACACGGC CGGTACCGTT
GAATGGCTAA AGCAAGACTG GGGCAGTAAT TTAGTTCGCG CCGCAATGGG TGTCGATGAA
AACGGCGGCT ACTTAGAAGA CCCAGCAGGA AACAAAGCGA AAGTAACAAC CGTTGTAGAT
GCAGCCATCG CTAACGATAT GTATGTAATT ATCGATTGGC ACAGCCACCA CGCCGAAGAC
TACCAAAACC AAGCCATTAG CTTTTTCCAA GATATGGCTC GCACCTACGG TAACAACAAC
AACGTTATAT ACGAAATTTA TAACGAGCCA TTACAGGTTT CTTGGAGCGG CACCATCAAG
CCTTACGCAG AAGCGGTAAT TGGCGCAATT CGCGCAATCG ACCCAGATAA CCTTATTATT
GTGGGCACGC CTACTTGGTC GCAGGATGTA GACGTAGCCT CGCGCGACCC CATCACGCAG
TACAGCAACA TTGCCTACAC TATTCACTTT TATGCGGGCA CCCACAAACA ATCCCTACGC
GATAAAGCAC AAACCGCATT AAATAATGGT ATTGCTTTGT TTGCTACCGA ATGGGGTACA
GTAAATGCCA ACGGTGACGG CGGTGTAGAC GCAGCCGAAA CTGATCGTTG GATGCAGTTT
TTTAAAGCGA ATCATATAAG CCATGCCAAC TGGGCCTTAA ACGATAAAGC CGAAGGCTCT
TCTGCATTAA AGCCTGGCTC TAACGCAAAC GGCGGCTGGA GCAATTCCGA CTTAACCGCC
TCTGGTACCT ATGTTAAAAA CTTAATTAAA ACATGGAACG ACGGCTCACC GAGCAGCAGC
TCATCTAGCA GCACCAGTTC TTCTTCAAGC AGCTCCTCGT CTAGTAGCTC ATCATCTAGC
AGCTCTTCAT CTAGTAGTTC TGGCGGTACC AATTTACCCG CGCGCATTGA AGCAGAAAAC
TACGATAGCG CACCGGTAGA AACCACTGCA GGTAATAGCG GCTCACCCAC CAATTGTTCG
TATAAAGGTA TGGGCGTAGA TGTAGAAAAC TCTACTGAAG GTGCTTGTAA TATTGGCTGG
ACTGCGGCAG GCGAAAAAGT AACTTACAAC ATTGGCAATG CCGATGGCAC TTACGATATT
GCATTGCGCG TAGCCTCTAT GGATGCGGGC AAACGTATCT CTGTGCATGT AAACAACAGC
CTAGCAGATA CCGTAACCAC ACAAGGTGGC GGCTGGCAGG CATGGACTAC CGAAACCATT
TCTAACGTGT ATATCCCATC AAACTCGGTA ATTACCGTTG AGTTTTACGA TAGTGGCTCT
AACCTAAACT TTTTAAACAT TACCGAAAGC TCGGGTACCG AACCACCTGT AGAACCACCC
GTTGAGCCGC CAGTAGAACC ACCCGTAGAC AACGGTAACT TCCCATGTAA CGACGGTAAC
TCTACGCTTG CCAACAACGG CGCCTCCATT AACCTTAACC AAGGAGCGTG TGTTAAATAC
AATCACGGCT GGGGCGATAT TCGTTTAGGC ACCTGGAGCG GCAACGGTAC CATTCGATAC
GACGTACTAG ACTGCAATAA CAACGTAATG AGTGATATTG CACAAAAACT TAATGACTTT
ACTGCTGTAG ACACCGCAAC AATGAACTGC GCACACTACA TTTATGTAAA ACAAGCCCCT
AGCAGCTACA CCCTGCAATT TGGTAGCTGG TAG
 
Protein sequence
MKSATTNQSR ARSSAFKNML AASLAGLGLL SASAFADVAP LTVDGNKILS GGQQASFAGN 
SLFWSNNGWG GEKYYTAGTV EWLKQDWGSN LVRAAMGVDE NGGYLEDPAG NKAKVTTVVD
AAIANDMYVI IDWHSHHAED YQNQAISFFQ DMARTYGNNN NVIYEIYNEP LQVSWSGTIK
PYAEAVIGAI RAIDPDNLII VGTPTWSQDV DVASRDPITQ YSNIAYTIHF YAGTHKQSLR
DKAQTALNNG IALFATEWGT VNANGDGGVD AAETDRWMQF FKANHISHAN WALNDKAEGS
SALKPGSNAN GGWSNSDLTA SGTYVKNLIK TWNDGSPSSS SSSSTSSSSS SSSSSSSSSS
SSSSSSSGGT NLPARIEAEN YDSAPVETTA GNSGSPTNCS YKGMGVDVEN STEGACNIGW
TAAGEKVTYN IGNADGTYDI ALRVASMDAG KRISVHVNNS LADTVTTQGG GWQAWTTETI
SNVYIPSNSV ITVEFYDSGS NLNFLNITES SGTEPPVEPP VEPPVEPPVD NGNFPCNDGN
STLANNGASI NLNQGACVKY NHGWGDIRLG TWSGNGTIRY DVLDCNNNVM SDIAQKLNDF
TAVDTATMNC AHYIYVKQAP SSYTLQFGSW