Gene Sde_2994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_2994 
Symbol 
ID3967755 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp3811820 
End bp3813265 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content45% 
IMG OID637922091 
Productglucosylceramidase 
Protein accessionYP_528463 
Protein GI90022636 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5520] O-Glycosyl hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.389806 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000219808 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGTTAC TAAAAAATAT AAATAAGCAA CGCCTAGCAC GTAAAGTTAA ACTCGTCTGT 
AGCGCGATAA GCCTAGTGGT TATGGGGTTC AGCTGTGCCG CGTCAGCCAA TCAATACTGG
TTAACCAGCG GTGATCTAAG TGCGGCGTTC GAAGAGCAGG GCGAAAAGTA CGCGGTAGCT
CCCAGCCCCG AAATGCCCCT TATTACCATA GATAAAAGCC AAGCATTTCA AACCATGGAA
GGCTTTGGCT ATACCCTTAA TGGCGGCAGT GCAACTCACC TAGCCAATAT GAGTGACGCA
GCTAGAGCGC GGCTATTACA AGAAATATTT GGTCAAAGTG ATGGTGCGAA TACTAATAAG
CCCAGTATTG GCGTGTCTTA TTTGCGTTTA AGCCTAGGCG CATCAGATTT AGACCCCGCC
CCGTTTAGCT ACAACGATTT GCCGCCTGGC GAAGTGGATT TAAAGCTAGA AAAATTTACT
ATCGCCCAAG ATGAAAAAAC TCTTATCCCC ATACTTAAGC AAATATTAGC TATTAACCCA
AATATTACAT TTATGGCTAG CCCTTGGTCT CCGCCTGTAT GGATGAAAAC AAACGGCTCT
ACCATTGGTG GTGAGCTAAA CCCAGAATAC TACAGCGTAT ATGCACAATA TTTTGTTAAA
TATGTTCAAG CAATGGCTGA GCACGGCATA AACATAGATG CCATTACTAT TCAAAATGAA
CCTATGCACC CGGGTAATAA CCCAAGCTTG CTCATGCATG CAAAAGATCA AGCCGACTTT
ATTGCTAATC ACTTAGGCCC CGCGTTTAAG CAGGCAGAGC TAAAAACAAA AATCATTGTG
TGGGATCACA ACGCAGACAA ACCCGAATAC CCCATAGAGG TACTGAATCA CCCCGTTGCC
AATCAATATA TTCACGGCTC GGCATTCCAT TTATATGGCG GCGATGTAAA TGCCATAAGC
CAAGTGCACA ATGCTCACCC AGATAAGCAC TTATATTTTA CTGAGCAGTG GGTAGGCGCA
AATTCCAACT TTTGGGGCGA TGTAGCTTGG CATGTAGAAA ATTTAATTGT TGGGGCAACC
CGCAATTGGT GCAAAACGGT ATTGGAGTGG AATTTAGCCG CAGACAGTAA CTTACAGCCT
CACACTCTTG GTGGATGCGA CGCCTGCTTA GGCGCGCTAA CTATTGATGG CGATAACGTG
AAGCGCAATG CCGCGTATTA CATTATTGCC CACGCAGCTA AACATGTACC GCCAGGCTCG
GTGCGTATCC ATTCGCACCG TGTGGCGGGT TTACCTAATG TTGCTTTTCT TACGCCGCAG
AAAAAGGTTG TTGTAGTAGT GCTTAATAAT ACTACTCAAT TACAGTCATT TACATTGGTG
CACGACCGTC AAAAGTTTGC CTATTCCATG CCCGCACAGA GCGTTGTAAC GCTAGTTATA
GATTAA
 
Protein sequence
MTLLKNINKQ RLARKVKLVC SAISLVVMGF SCAASANQYW LTSGDLSAAF EEQGEKYAVA 
PSPEMPLITI DKSQAFQTME GFGYTLNGGS ATHLANMSDA ARARLLQEIF GQSDGANTNK
PSIGVSYLRL SLGASDLDPA PFSYNDLPPG EVDLKLEKFT IAQDEKTLIP ILKQILAINP
NITFMASPWS PPVWMKTNGS TIGGELNPEY YSVYAQYFVK YVQAMAEHGI NIDAITIQNE
PMHPGNNPSL LMHAKDQADF IANHLGPAFK QAELKTKIIV WDHNADKPEY PIEVLNHPVA
NQYIHGSAFH LYGGDVNAIS QVHNAHPDKH LYFTEQWVGA NSNFWGDVAW HVENLIVGAT
RNWCKTVLEW NLAADSNLQP HTLGGCDACL GALTIDGDNV KRNAAYYIIA HAAKHVPPGS
VRIHSHRVAG LPNVAFLTPQ KKVVVVVLNN TTQLQSFTLV HDRQKFAYSM PAQSVVTLVI
D