Gene Sde_2497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_2497 
Symbol 
ID3968574 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp3155564 
End bp3158782 
Gene Length3219 bp 
Protein Length1072 aa 
Translation table11 
GC content48% 
IMG OID637921588 
Productexo-1,4-beta-glucosidase 
Protein accessionYP_527969 
Protein GI90022142 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases
[COG2755] Lysophospholipase L1 and related esterases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAATA CTTTATCCTT TAAAACATCC TTGCTTGCGG GCTTGGTGGC ATCCAGTTTA 
CTGGTTGCGG CCTGTCAGGG TGTTAAACAG CAAACGGAAG CTACTCAGAC AAAGCACAAT
ATTACCTTAT GGCCGCAGGC GTCTAGCCCT GTAATAAAGT CGCCAGATTA CGAAGCGGAA
GTGGAAGCCA AGGTAGAAGC GTTGTTAGGA CAAATGACGC TAGAGCAAAA AGTAGGGCAA
ATCCTACAGC CAGAAATTCA ATCTATTAAG CCGCATGAAG TAAAAGAATA CCACATTGGC
TCTGTACTAA ATGGTGGTGG CTCTATGCCT AACCGCATAG AAAATGCGCC GCCCATTGAA
TGGGTAAAAT TGGCCGATGC CTTTTACGAT GCCTCTATGG ACGATTCTGA CGGTGGAATC
GCAATTCCCA TTATTTGGGG TACCGATGCC GTACACGGTC ACGGCAATGT AACTGGCGCA
ACCATATTCC CGCATAACAT AGGCCTTGGT GCTGCACGCA ACCCAGCGCT TATCGAAAAA
ATTGGCGAAA TAACGGCAAA AGAAGTACGC GCAACCGGCA TTGAATGGAT ATTTGGCCCA
ACTTTGGCCG TAGCGCAAAA CGATTTATGG GGCCGCACTT ACGAAAGCTA CTCGGAAGAC
CCAGCCATAG TGGCCGACTA CGCCAGTGCC ATGGTGGTAG GTATGCAGGG CAAAGTGGAC
GACAGCGATT TTCTGTCCAC TAATCGCGTA GTTGCCACAG CAAAGCACTT TTTAGCTGAC
GGCGGTACCT TAGGAGGCAA CGATCAAGGT GATGCGCGCA TAAGCGAAGA AGAGTTGGTG
CAAATTCATA ATGCGGGCTA TGTGCCTGCC ATTGAATCGG GCGTGCAAAC GGTTATGGCC
AGTTTCTCTT TGTGGAATGG CGTAAAAATG CATGGTAACA ACTACCTACT TACCCAAGCA
CTTAAAGAGC GTATGGGGTT TGATGGTTTT ATAGTAGGGG ATTGGAATGG CCACGGGCAG
GTACCTGGGT GCACCAACGA ATCTTGCCCT CAATCGCTAA ACGCCGGTTT AGATATGTAC
ATGGTGCCTT ACGATTGGAA AAAACTGTAC AGAAACTTAA TTAGCCAAGT GCAATCGGGT
GAAATTGCCC CAAGCCGTTT AGATGACGCT GTACGCCGTA TTCTTCGGGT AAAAATTCGC
GCTAATTTGT GGGCTGCGAA ACCTTCAGAG CGAATTAATC TAGCCACTAT TGACGAGGTG
GTTGGCCACG CAAACCACCG TGAGGTAGCG CGGCAGGCGG TGCGAGAAAG TTTAGTATTG
TTAAAAAATA AAAATAGCGT ACTGCCTATT GCTGCCAATA AAACCGTGCT GGTTGCAGGT
GACGGCGCCG ATAATATTGG CAAACAATCT GGCGGTTGGA GTGTAAGCTG GCAGGGCACT
GGTAACACCA ATGCATCCTT CCCCGGTGGT ACATCTATTT ATAAAGGTAT TGCCGATGCA
GTCACTCAGG GCGGCGGTAA AGCTACGCTT TCTGTGGATG GCAGCTACAA AACTAAACCC
GATGTTGCCA TTGTGGTAAT AGGCGAAGAC CCTTACGCCG AAGGCCAAGG CGACCGCAAT
AGTTTAGAGT TCGAGCCGGT GAATAAAAAA TCGCTTGAGC TATTAAAAAA ATTAAAAGCA
GATGGCATAC CCGTTGTAAC AGTATTTATT TCTGGCCGAC CTATGTGGGC TAACCCAGAA
ATTAACGCGT CTGATGCATT TGTTGCCGCG TGGTTACCTG GCTCTGAAGG GCAGGGCGTA
GCAGATGTAC TTATAGGCAA CGCCAACGGC AAGCCTCGTT TTGATTTCAA GGGCACCTTG
TCGTTCTCTT GGCCTAAGCT GCCGACCCAA GGCTTGCTCA ACCCAACGCA CCCCAACTAC
GACCCGTTAT TTAAATTGGG ATACGGGCTA ACTTATGCCT CGAGTGAAAC TGGCCCAGAG
CAATTGGCGG AAGATGTTGA AGGTGTAGAT AAAGGCTCAA CCGGCGACAT TAATTTTTAT
GTTGGCCGCA CATTAGAGCC GTGGGAAGTG TTTGTTCGAA CTCCTGAAAG TTCGCAGCGT
TTAAGTGGCC CATTTGCAGA CTTAGGCAAT GCCAGTGTGC GTACCAGTGA TATGCAGGTA
CAAGAAGATG CCCTTACTTT TACTTGGGGC GGTAGCTGGA TGTCTATTCT GGGAATAGAA
GGAGGGCGCG GTTACGACCT TTCTTCGCAA TATAAAGAAG GCGGAGTAAT AAGCTTTAAC
TTCAATTCAA TAGATATGGC TAAAGGCGAT TTAAAAGTAC AAATGGCCTG TGGTGAAGGT
TGCACGCGTG AAGTAGATAT CACAACTATC GCACGCGACT TGGAAGGCAA AGGCTGGCAG
TCGTTAACAG TGCCCTTAGC GTGCTTTGCA CACGAAGGCG ACGATTTCAC CCATATTACT
GCGCCGTTTA ACTTATTTGC CGGTGGAAAA GGTCAAGTTG CTGTAGCCAA CATTCGCATA
CTGCGCGCCG GTACACAAAC CGTGCCGTGT GTATTGCCTA AAGATGTTTC CGTAACGCCA
GAGCCGCTGA ATGCTAGCTG GGCGATAGAT TGGTGGATGC CGCGCCACAA AGAAAAACTG
GCGCGTATCC AGCAAGGTAA TGTGGATTTA CTAATGATTG GCGATTCCAT TACCCACGGC
TGGGAAGATG CAGGTAAAGA CGTGTGGGCG CAATATTACG CGCACCGCAA TGCAGTGGAC
TTAGGCTTTA GTGGCGACCG AACCGAAAAC GTATTGTGGC GCTTACAGCA CGGCGAAGCA
GACGGTATTA AGCCTAAAGT GGCAGTGGTT ATGATTGGTA CCAACAATGC CGGCCATCGT
CACGAGCCTT CGCACTACAC AGCCAAGGGT GTTGCGGCTG TCGTTGCTGA ATTGCAAAAA
CGATTGCCTG AAACAAAGAT ATTATTACTG GGTATATTCC CTCGCGGCGA AACCAGTGAA
GACCCTTTGC GGGTATTAAA TGCCAAAACC AATACTCTTT TGGCGAAAAT GGCCGACGGA
GAGAAGGTGG TGTATTTGAA TATCAATAAA ACGTTTTTAG ATGAAAACGG CGTATTGCCT
AAAGATATAA TGCCCGACCT ATTGCACCCC AATGAAAAGG GGTACGCATT GTGGGCGAAA
GCGATGGAAC CCACCCTTAA AAAAATGCTG GGCGAATAG
 
Protein sequence
MKNTLSFKTS LLAGLVASSL LVAACQGVKQ QTEATQTKHN ITLWPQASSP VIKSPDYEAE 
VEAKVEALLG QMTLEQKVGQ ILQPEIQSIK PHEVKEYHIG SVLNGGGSMP NRIENAPPIE
WVKLADAFYD ASMDDSDGGI AIPIIWGTDA VHGHGNVTGA TIFPHNIGLG AARNPALIEK
IGEITAKEVR ATGIEWIFGP TLAVAQNDLW GRTYESYSED PAIVADYASA MVVGMQGKVD
DSDFLSTNRV VATAKHFLAD GGTLGGNDQG DARISEEELV QIHNAGYVPA IESGVQTVMA
SFSLWNGVKM HGNNYLLTQA LKERMGFDGF IVGDWNGHGQ VPGCTNESCP QSLNAGLDMY
MVPYDWKKLY RNLISQVQSG EIAPSRLDDA VRRILRVKIR ANLWAAKPSE RINLATIDEV
VGHANHREVA RQAVRESLVL LKNKNSVLPI AANKTVLVAG DGADNIGKQS GGWSVSWQGT
GNTNASFPGG TSIYKGIADA VTQGGGKATL SVDGSYKTKP DVAIVVIGED PYAEGQGDRN
SLEFEPVNKK SLELLKKLKA DGIPVVTVFI SGRPMWANPE INASDAFVAA WLPGSEGQGV
ADVLIGNANG KPRFDFKGTL SFSWPKLPTQ GLLNPTHPNY DPLFKLGYGL TYASSETGPE
QLAEDVEGVD KGSTGDINFY VGRTLEPWEV FVRTPESSQR LSGPFADLGN ASVRTSDMQV
QEDALTFTWG GSWMSILGIE GGRGYDLSSQ YKEGGVISFN FNSIDMAKGD LKVQMACGEG
CTREVDITTI ARDLEGKGWQ SLTVPLACFA HEGDDFTHIT APFNLFAGGK GQVAVANIRI
LRAGTQTVPC VLPKDVSVTP EPLNASWAID WWMPRHKEKL ARIQQGNVDL LMIGDSITHG
WEDAGKDVWA QYYAHRNAVD LGFSGDRTEN VLWRLQHGEA DGIKPKVAVV MIGTNNAGHR
HEPSHYTAKG VAAVVAELQK RLPETKILLL GIFPRGETSE DPLRVLNAKT NTLLAKMADG
EKVVYLNINK TFLDENGVLP KDIMPDLLHP NEKGYALWAK AMEPTLKKML GE