Gene Sde_3710 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_3710 
Symbol 
ID3966726 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp4696567 
End bp4697997 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content51% 
IMG OID637922807 
Productarabinogalactan endo-1,4-beta-galactosidase 
Protein accessionYP_529177 
Protein GI90023350 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3867] Arabinogalactan endo-1,4-beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000857688 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.462081 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTTAT CTATAAAGAA AGCGCTATGC GCTATGGCTG CGTCCACCTT TATTTTGCTG 
CAGGCTTCAA ACGCCCTTGC TATGGCAAAA GGTGCAGATG TGAGTTGGAT TTCTGAAATG
GAAGCAGAGG GCTACACGTT TTATAACGAT GCGGGCCAGC AGCAAGATGT GCTGCAAATT
CTAAAAGATC ACGGTATGGA TTCCATCCGT CTGCGCGTGT GGGTAAACCC CGCCGGTGGA
TGGTATAGCA GCATTAATGA CGTAATAGAA AAAGCTCAGC GCGCCAAAGC TGCGGGCATG
CGTATTATGA TCGATTTTCA CTACAGCGAC TCTTGGGCTG ACCCAGGCAA GCAATACAAG
CCCGCCGCGT GGACCAACTA TACCTTAGAC GGTTTAATGT CTGCGGTGTG GTGGCACACC
TACGATTCCC TCGTGGCCCT AAAGAATGCG GGTATTACCC CTGAATGGGT GCAAGTGGGC
AACGAAACAA ACAACGGTAT GTTATGGGAA GAGGGGCGCG CATCCGCCAA TATGCAAAAC
TATGCGTGGT TGGTGAATAG TGGCTACGAT GCCGTTAAAG AAGTGTTCCC TAATACCAAG
GCAGTGGTGC ACTTGGCAAA CTGCCACGAC AACGCAAACT TCCGCTGGAT ATTTGACGGC
TTACAAGCCA ATGGTGGTAA GTGGGATGTA ATAGGTGCCT CTATTTACCC TACCAACGCA
AGCGGTTATA GCTGGAGCCA AGCCAACAGT TTGTGCGAGG CAAACTTAAA CGATATGCAA
TCGCGCTATG GGTCCGAGGT GCTAATTGCC GAGGTTGGTG CGCCGTGGGA TCACCCAGAA
GCGAAAGCAA TCGTGAGCGA TGTAATTGCT AAGGCGCAAA ACGCCGGTGC AACAGGGGTA
TTTTATTGGG AGCCGCAGGC ATCAAACTGG CAGGGCTACA CGCTAGGTGC ATGGAACCCA
AACACCATGC GCCCCACCGA AGCATTAGAC GCGTTTATTG ACGGCAGCTC GAATGTGACA
ACCGCGCGTT TGCAATCGCG CAATAGCAAC CGCTGTATAG ATGTTAATGG CCGCAGTACA
GCAGATGGTG CCGATATCAT TCAGTGGAGT TGCCACAGCA ACGCCAACCA GCAATGGACT
TTTGAAGATA TGGGCAATAA CTACGTGCGA TTGCGCGTGG GCCACAGTAA TAAGTGCTTA
GATGTACTGG GTGCAGGCAC TGCCGATGGC GATAACGTAG TGCAGTGGGC ATGCCACAAT
AACGCCAATC AGCAATGGCT AAAAGAAGAC ATGGGCGATG GCTACTTCCG CTTAAAATCT
CGCGCCAGCG GTAAATGCGT AGATGTAAAC GCAGGCGGTG CTAACAACGG TGATTCTATT
ATTCAATGGA GTTGCCACAC TGGTTGGAAC CAGCAATGGA TGGTTTATTA G
 
Protein sequence
MTLSIKKALC AMAASTFILL QASNALAMAK GADVSWISEM EAEGYTFYND AGQQQDVLQI 
LKDHGMDSIR LRVWVNPAGG WYSSINDVIE KAQRAKAAGM RIMIDFHYSD SWADPGKQYK
PAAWTNYTLD GLMSAVWWHT YDSLVALKNA GITPEWVQVG NETNNGMLWE EGRASANMQN
YAWLVNSGYD AVKEVFPNTK AVVHLANCHD NANFRWIFDG LQANGGKWDV IGASIYPTNA
SGYSWSQANS LCEANLNDMQ SRYGSEVLIA EVGAPWDHPE AKAIVSDVIA KAQNAGATGV
FYWEPQASNW QGYTLGAWNP NTMRPTEALD AFIDGSSNVT TARLQSRNSN RCIDVNGRST
ADGADIIQWS CHSNANQQWT FEDMGNNYVR LRVGHSNKCL DVLGAGTADG DNVVQWACHN
NANQQWLKED MGDGYFRLKS RASGKCVDVN AGGANNGDSI IQWSCHTGWN QQWMVY