Gene Sde_1394 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_1394 
Symbol 
ID3968663 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp1805639 
End bp1806973 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content45% 
IMG OID637920470 
ProductBeta-glucosidase 
Protein accessionYP_526868 
Protein GI90021041 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID[TIGR03356] beta-galactosidase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.534606 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0533857 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAGAC TTACACTACC GCCTTCTTCT CGTTTGCGCA GCAAAGAGTT TACCTTTGGT 
GTTGCAACGT CGTCTTACCA AATTGAAGGC GGCATAGATT CTCGCCTGCC CTGTAATTGG
GATACGTTCT GTGAGCAGCC CAATACCATT ATTGATAACA CCAACGGCGC CATTGCTTGC
GACCACATAA ATAGATGGCA AGACGATATA GAACTTATTG CCAACCTAGG GGTAGATGCC
TACCGCTTTT CTATTGCGTG GGGCCGTGTT ATTAATTTAG ACGGCAGCCT CAATAATGAA
GGCGTTACAT TTTACAAAAA TATTTTAACT AAGCTTCGCG AAAAGAATTT AAAAGCTTAT
ATAACGCTAT ACCACTGGGA CTTGCCACAA CATTTAGAAG ATGCTGGCGG CTGGCTTAAC
CGCGATACCG CCTACAAGTT TCGCGACTAT GTAAACCTTA TAACCCAAGC GCTTGATGAC
GATGTATTTT GCTACACAAC GTTAAACGAG CCCTTTTGCA GTGCCTACCT TGGCTATGAA
ATTGGTGTAC ACGCACCGGG TATAAAAGAC TTAGCCAGTG GGCGCAAAGC CGCACACCAT
TTATTACTTG CCCATGGCTT AGCTATGCAA GTGCTGCGAA AAAACTGCCC CAATAGTTTA
AGCGGCATAG TGTTAAACAT GAGCCCTTGT TACGCCGGCA GCAACGCACA AGCAGATATA
GATGCAGCAA AACGCGCGGA CGATTTATTA TTTCAGTGGT ATGCACAACC GCTACTTACT
GGCTGCTACC CTGATGCAAT AAACAGCCTG CCAGACAATG CCAAACCACC TATTTGTGAA
GGCGACATGG CGTTAATAAG CCAACCTTTA GATTATTTAG GCCTTAACTA CTATACCCGC
GCAGTATTTT TTGCCGACGG TAATGGCGGT TTTACCGAAC AAGTACCTGA GGGTGTAGAG
CTAACCGATA TGGGCTGGGA AGTTTACCCG CAAGGCTTAA CCGATTTACT AATAGACCTA
AACCAACGCT ATACCCTACC CCCGTTACTT ATTACCGAAA ACGGCGCAGC AATGGTGGAC
GAACTTGTTA ACGGCGAAGT TAACGATATT GCCCGAATAA ATTATTTTCA AACCCATTTA
CAAGCGGTAC ACAACGCCAT TGAACAAGGT GTTGATGTAC GCGGTTATTT TGCTTGGAGC
CTAATGGATA ATTTTGAGTG GGCACTGGGT TACAGCAAAC GATTCGGTAT TACCTATGTA
GATTACCAAA CACAAAAGCG AACGCTAAAA GCCAGCGGCC ACGCATTTGC TGAGTTTGTC
TCGAGTAGGA GCTAA
 
Protein sequence
MNRLTLPPSS RLRSKEFTFG VATSSYQIEG GIDSRLPCNW DTFCEQPNTI IDNTNGAIAC 
DHINRWQDDI ELIANLGVDA YRFSIAWGRV INLDGSLNNE GVTFYKNILT KLREKNLKAY
ITLYHWDLPQ HLEDAGGWLN RDTAYKFRDY VNLITQALDD DVFCYTTLNE PFCSAYLGYE
IGVHAPGIKD LASGRKAAHH LLLAHGLAMQ VLRKNCPNSL SGIVLNMSPC YAGSNAQADI
DAAKRADDLL FQWYAQPLLT GCYPDAINSL PDNAKPPICE GDMALISQPL DYLGLNYYTR
AVFFADGNGG FTEQVPEGVE LTDMGWEVYP QGLTDLLIDL NQRYTLPPLL ITENGAAMVD
ELVNGEVNDI ARINYFQTHL QAVHNAIEQG VDVRGYFAWS LMDNFEWALG YSKRFGITYV
DYQTQKRTLK ASGHAFAEFV SSRS