Gene Sde_3239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_3239 
Symbol 
ID3965729 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp4130713 
End bp4132629 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content47% 
IMG OID637922336 
Productcellulase 
Protein accessionYP_528708 
Protein GI90022881 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00277111 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.266356 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTTCA CAAGAATGAA ATCATCACAC CAAGGCGCGT GTCGACCAAG GTCTTCCACC 
CTACAGCGAC TAATCGCCTC ATCACTTACC ACCGCATGTT TGCTAGCAGC GTCTACTTTT
GCCGACGTAG CGCCGTTAAC CGTAGATGGC AACCGCATTC TCAGCGGTGG CCAAGAGGCT
AGCTTTGCCG GTAACAGTTT GTTTTGGAGC AACAATTATT GGGGCGGTGA GAAATACTAC
ACAGCCGAAA CTGTTAACTG GTTAAAACAA GACTGGGGCG CAACACTAGT GCGCGCGGCC
ATGGGTGTAG AAGATAACGG CGGCTACCTA GATGACAAAG AAGGCAACAA ACAAAAGGTA
AAAACCGTTG TAGATGCTGC TATTGCCAAC GACATGTATG TAATTATCGA TTGGCACAGC
CACCACGCCG AAGACCACAA AAGTGAAGCC ATTGCTTTTT TTGAGGATAT GGCGCGCACC
TACGGCAATA AAAAACACGT TATTTACGAA ATTTATAACG AGCCTTTACA AATTTCGTGG
AGCAACACAA TTAAACCCTA CGCCGAAGAT GTAATTAGAG CTATTCGCGC GATAGACCCC
GACAACTTAA TTATTGTTGG TACGCCAACG TGGTCGCAAG ATGTAGACGT AGCATCGCAA
GACCCCATTA CCGGCTACGC CAATATTGCC TACACATTGC ACTTTTACGC AGGCACCCAC
AAACAATCTT TACGAGACAA AGCGCAAACC GCACTTAACA ACGGCATAGC GCTTTTCGCA
ACAGAGTGGG GAACAGTAAA TGCAAACGGT GATGGCGCTG TAAACACCAC CGAAACAGAC
AAGTGGATGA CGTTCTTTAA AACCAACCAC ATAAGCCACG CAAACTGGGC GCTAAACGAC
AAATCAGAAG GCGCTTCTGC ATTAAACCCC GGAGCCAGCC CCAATGGCAA CTGGAGCAAC
GCCGACTTAA CCACATCGGG TAAGTACGTA AAAAACATTA TCAAAAACTG GAACGACGGC
ACGCCGGGAG GCAGCTCTTC AAGCTCGTCC GGCGGCTCAA CCAGTTCCTC CTCAAGCTCA
TCTAGCTCTA ATTCCAGCTC TGGTGCTGGC AAAGTAAATT TACCCGCACG CATTGAAGCC
GAAAACTATA ACAGTGCACC GGTAGAAACA ACTGCAGGCA ATAGTGGCGG CAGCGTTTCA
CAATGTACAT ACAGAGGGCT AAATGTAGAC GTACAAGACG CAAGCGAAGG CACTTGTAAT
ATTGGCTGGA CAGCAGCAGG CGAAAAAGTT ACCTACAACA TAGGCACAGC AAATAATACT
TACAATATTG CACTTCGCAC CGCATCGCTT GATGCAGGCA AGCGCGTATC GGTATATGTA
GGCAACACCC TCGCCGACAC AATAAGCACC CAAGGTGGCG GCTGGCAAAA TTGGAAGACG
CAAACCATCC CCAATGTATA TATTCCATCA AACTCAGTTA TTACCGTGGA ATTCTACGAT
GGCCGCACCA ACCTTAACTA CTTAAACATT AGTGCAGCTT CGGGGTCTTC CTCTTCAAGC
TCCTCATCTA GCTCGTCAAC GTCTAGCTCT TCTTCGAGCT CATCTTCTAG CTCTTCAGGT
GGTGGCAGTT GTAGCAGCTA TATAGATATA CCTTGGAATA CTCGCACCGA AGTTACCCTA
ACAAGTGGCG CCTGCGTTCG CTTTAACCAA AACCTTTCGG GCAAAACCCT ACAAGTGTGG
GATAGCGATG CAAACTCATC GTGCGATTTC CGGGGCACAG TTACAACAGT AGGCGGCACT
GGCAGTTTAA ATGTAAGCAG CAACTATGTT TCGTCTAAGA GCCTAACAGG AACCAAACTT
ACATTTAATT CAGCAAGTAA TAACAATTGT AAGTACGTTA AAGTTCGTGC TTATTAG
 
Protein sequence
MTFTRMKSSH QGACRPRSST LQRLIASSLT TACLLAASTF ADVAPLTVDG NRILSGGQEA 
SFAGNSLFWS NNYWGGEKYY TAETVNWLKQ DWGATLVRAA MGVEDNGGYL DDKEGNKQKV
KTVVDAAIAN DMYVIIDWHS HHAEDHKSEA IAFFEDMART YGNKKHVIYE IYNEPLQISW
SNTIKPYAED VIRAIRAIDP DNLIIVGTPT WSQDVDVASQ DPITGYANIA YTLHFYAGTH
KQSLRDKAQT ALNNGIALFA TEWGTVNANG DGAVNTTETD KWMTFFKTNH ISHANWALND
KSEGASALNP GASPNGNWSN ADLTTSGKYV KNIIKNWNDG TPGGSSSSSS GGSTSSSSSS
SSSNSSSGAG KVNLPARIEA ENYNSAPVET TAGNSGGSVS QCTYRGLNVD VQDASEGTCN
IGWTAAGEKV TYNIGTANNT YNIALRTASL DAGKRVSVYV GNTLADTIST QGGGWQNWKT
QTIPNVYIPS NSVITVEFYD GRTNLNYLNI SAASGSSSSS SSSSSSTSSS SSSSSSSSSG
GGSCSSYIDI PWNTRTEVTL TSGACVRFNQ NLSGKTLQVW DSDANSSCDF RGTVTTVGGT
GSLNVSSNYV SSKSLTGTKL TFNSASNNNC KYVKVRAY