Gene Sde_0636 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_0636 
Symbol 
ID3964965 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp803723 
End bp805459 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content50% 
IMG OID637919697 
Productcellulase 
Protein accessionYP_526110 
Protein GI90020283 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAAG TTAAAGTTTT AGCGCTGTGT GCCAGTGTGG CTGTAATGAT AGGTTGCAGT 
GATGCCGACA CTAAATTAGC TAACTCGGCC AAGGCCGAGG TGGGCTTTAC CAAAGTGAAT
CAGCTGGGTT ATTTGCCCGC GGCCAAAAAG CTGGCGGTGG TACCCGCCGT TGCAGCTGCA
AAATTCGACA TAATCGATGT AACTAGCGGT AAAGTAGCGT TTACGGGGAG TTTAAGCGAC
GTAAAAAGCT GGAGCGCGAT GGGGGACGAA TCTTTCAAGT TGGCAGACTT TAGCGCCCTG
CAAGCCGAAG GGAGTTACCG CTTAGTTGTT CAGGGTGTGA GTGATTCTTA CACCTTCGAT
ATTAGCCCAA GTGTATATAG CCAAGCGCAC GATGGAGCCC TTAAAGCCTA TTACTATAAT
CGAGCGAGCA CAGAGTTAAC AGAACAGTAC GCCGGGGTGT ATGCGCGACC TGCGGGGCAC
CCAGATACCG ACGTACGCAT ATTCGATAAC GCCGCCTCAG CCGCGCGCCC AGCAGATACA
AGCTTTGCTG CACCAAAGGG TTGGTACGAT GCTGGCGATT ACGGCAAGTA CATTGTTAAC
AGTGGTATTT CCACTTACAC CCTAATGGCT GCGTACGAGC ATTTCCCGTC GTTTTACAAG
CAACGCGATA TAGATATTCC CGAATCTGGC GATGCCGTAC CGGATATTCT CGACGAGGTA
ATGTGGAACC TTGAATGGAT GCAGGTCATG CAAGACCCGA ACGACGGCGG TGTGTACCAC
AAGCTTACCA CCCTGAATTT TTCTGGCGCA GTCATGCCGC ACGAAGCGAC TGCGCAGCGC
TATTTTATTA AAAAATCTAC CGCTGCAACG CTAGATTTTG CCGCGGTTAT GGCCACTGCA
AGCCGAGTAT ACGCACCGTT CGAAGGTGCT TTTCCTGGTA AATCAGCTGC TTATCGACAG
GCGGCCATTG CTGCGTGGGA GTGGGCACAA GCAAACCCTA GTGAGACATA TTCGCAGACA
CCGCTGAGCA AAGTTCAAAC CGGCGCCTAT GGTGATAAAA AGTTAAACGA TGAATTTGCG
TGGGCGGCCG CAGAGTTGTT TATATTGACC GGCGAGCAAA AATACTGGCA GGCGTTTAAC
AAGCAAAAAG TGCAGGCGGG TGAGTCTAGC TGGGCGAATG TTGCGGGGTT GGGGTTTATT
TCCTTGGCCA ATAATGCGCG CAGCCTGTTA AACGAAGCTC AATACAAAAC CGTTACCGAT
TCAATTGTTC GCGCTGCAGA TAGCTTGCTT GTTACTTACA AAGAGAATGC CTACCAAGTA
CCCATTGGCA ACAAAGATTT TTTCTGGGGT GGCAATTCCG GCACGTTAAA TCGCGCTTGG
GTTTTGCTTG AGGCCAATAA AATTAAACCG CAGCAAGAAT ACATCGATGC TGCACTTGCC
GCGGTGGATT ATATTTATGG TCGCAACCCT ACCAACTACT CTTTTGTCAC TGGGTTTGGC
GATAACCCTG CGGTGGGTAT CCATCATCGT CCATCCTATG CCGATGGCAT TAAAGCCCCT
GTGCCTGGTT GGCTTGCGGG CGGTGCGCAC AATGGCAAGC AAGATGGTTG TGAGTACCCT
TCCGATGCAC CGGCAAAATC CTATCTAGAC GACTGGTGCA GTTACTCCAC CAACGAAATT
GCTATTAATT GGAATGCGCC GTTAGTTTAC ATACTGGCTG CGGTAAATAA TTTGTAG
 
Protein sequence
MNKVKVLALC ASVAVMIGCS DADTKLANSA KAEVGFTKVN QLGYLPAAKK LAVVPAVAAA 
KFDIIDVTSG KVAFTGSLSD VKSWSAMGDE SFKLADFSAL QAEGSYRLVV QGVSDSYTFD
ISPSVYSQAH DGALKAYYYN RASTELTEQY AGVYARPAGH PDTDVRIFDN AASAARPADT
SFAAPKGWYD AGDYGKYIVN SGISTYTLMA AYEHFPSFYK QRDIDIPESG DAVPDILDEV
MWNLEWMQVM QDPNDGGVYH KLTTLNFSGA VMPHEATAQR YFIKKSTAAT LDFAAVMATA
SRVYAPFEGA FPGKSAAYRQ AAIAAWEWAQ ANPSETYSQT PLSKVQTGAY GDKKLNDEFA
WAAAELFILT GEQKYWQAFN KQKVQAGESS WANVAGLGFI SLANNARSLL NEAQYKTVTD
SIVRAADSLL VTYKENAYQV PIGNKDFFWG GNSGTLNRAW VLLEANKIKP QQEYIDAALA
AVDYIYGRNP TNYSFVTGFG DNPAVGIHHR PSYADGIKAP VPGWLAGGAH NGKQDGCEYP
SDAPAKSYLD DWCSYSTNEI AINWNAPLVY ILAAVNNL