Gene Sde_2827 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_2827 
Symbol 
ID3968230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp3564704 
End bp3566656 
Gene Length1953 bp 
Protein Length650 aa 
Translation table11 
GC content48% 
IMG OID637921924 
Productarabinogalactan endo-1,4-beta-galactosidase 
Protein accessionYP_528296 
Protein GI90022469 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3867] Arabinogalactan endo-1,4-beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000305574 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00171331 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAAGACAA AAATAATCCA GCTATTTCTA CTTGCCTTAA CGTGCAGTTT TATGGCGAGT 
TGTGGCGGTG CCAGTGACAG CAACCCGCCA CCGGTGACTG AGCCAGAACC CCAGCCTGAA
CAAGAACCAG AGCAGGAGCC AGAAGCTGAA CCAGAAGCGG AGCCTGTTGC ATTCTATTTC
GGTAACGACC TTTCCTACGT AAACGAAATG GAAGACTGCG GCGCAGTGTA TAAAGATGCA
GGCAGTGTAG TAGACCCCTA CGAAGTGATC GCCAATCACG GCGGCAACCT TGTGCGCGTG
CGTCACTGGA ACGACCCCTA CTGGCAGGCG CTTATTACCC AGCCAGAATC TGTAGCGGCA
AATTGGAAAG CTAATTACAG TGGGCTTGAA GATGTAACAG AAACAATTCG ACGATCAAAA
GCCGCCGGTA TGGAAGTGCT GCTCGACTTT CATTTCTCAG ATATTTGGGC AGACCCCGGC
AGGCAAACAA CGCCGCGCGC ATGGGAAAAT GACTTTGGCG ATGAAGACGC CATGGCTGCG
CATATATACG ATTACGTAAC ATCAGTACTC ACAGGGTTAA ACGACGAAGG TTTAATGCCC
GAGCTTATTC AAATTGGTAA CGAATCTAAC TCCGGCATGA TGACGACTCA AAACCTCATT
ATAGAAATGA ATGATGCGGG TACGGGGTTA AATGTGAGTA AAGGCGGGCA AACAAATTAC
TCAGATCAAT ATGTTGCGCG TATGTATAAC TCTGCGATTT CTGCTGTGCG CGATATTAGT
GAAGGGATGA CCAACGCTCC GCGTATTGCT ATCCACGTGG CAGGTGCAGA TAAAGCCGTC
GCATTTTTTG ATAAGTTAAA AAGCATCGGA GTAACGGATA TTGATATCGC GGGCTTCTCG
TTTTATTACG GTTGGGAGCA AGCACCAATA GAAGACGTTG CAAGCATGAT TGCAACCTTA
AAGGAACGTC ACCCTAATTT AGACCCGTTA ATGCTTGAAA CAGGCTACCT GTGGGATGAA
GAAAACATCG ATAGCTTAGG CAATATTATT GGCATTGCCG ACCCCGCATA TTTACCTGTG
AGCAAACAGA ACCAACTTAA ATATCTTACC GATTTATCGC AAGCGGTTGC TGATGCTGGT
GGTATTGGAG TGGTGTTTTG GGAGCCATCT TGGGTATCAA CCGAATGTCG CACACCTTGG
GGGCAGGGCT CATCTCACGA GCATGTTGCT TACTTCGATC ACCGCGACGG CTTAAACTTT
CATATTGGTG GCCAATGGAT GGAGGTTACC AAGTTAAGCG AAACCCCAGA AGCGGGGCTA
GCTACTACCT TTAGAGTGGA TATGACTGGC CAAGACACCA GCGCAGGGGT ATTTATTCGC
GGGGCGTTTA CCGAAGACAC ATTGCAGCCC ATGCTATATG AAGGCGAAAA CATTTATAGT
TACACCACGC ATATCCAAGC AGCGCAAAGC GGAAGCTACC ACTATGCGAT TGGCTTAAAA
AATGGTACGC GCGAAACGGT TCCTAGCGAA TGTGCAAACC CAGAAGATAC GTTAAATCGT
TTATATACGG TGGGCGAAAA TGGGGAGCAG TTAGTTACCG CAGTTTGGGC AAGCTGTGAT
GTTTTCGATC CGCAAGCGGC TGGGCCAACA ACCTTAACGC TTAATGTAGA TATGACTGGT
GTAGATGTAA GTGGTGGTGT GTATGTTGCA GGCGACTTAA ATGCTTGGAC AATCACCGAG
CTTACACAAG TTGGCGCTAG CGCAATTTAT ACCATTAGTT ACGATTTAGC CGTAGGTGCA
GAAGGTGGCT ACTACTTCTT GAATGGCAGC GATTGGGGCG ATAGGGAAAC AATACCAGAA
GAATGTGTGG GCTACTATGA TGCAGACCGC GGCTTTTTGG TGGAAGAGCA AAGCCCACAG
GTATTGGATT TAGTGTGGAG TAGTTGTCAA TAA
 
Protein sequence
MKTKIIQLFL LALTCSFMAS CGGASDSNPP PVTEPEPQPE QEPEQEPEAE PEAEPVAFYF 
GNDLSYVNEM EDCGAVYKDA GSVVDPYEVI ANHGGNLVRV RHWNDPYWQA LITQPESVAA
NWKANYSGLE DVTETIRRSK AAGMEVLLDF HFSDIWADPG RQTTPRAWEN DFGDEDAMAA
HIYDYVTSVL TGLNDEGLMP ELIQIGNESN SGMMTTQNLI IEMNDAGTGL NVSKGGQTNY
SDQYVARMYN SAISAVRDIS EGMTNAPRIA IHVAGADKAV AFFDKLKSIG VTDIDIAGFS
FYYGWEQAPI EDVASMIATL KERHPNLDPL MLETGYLWDE ENIDSLGNII GIADPAYLPV
SKQNQLKYLT DLSQAVADAG GIGVVFWEPS WVSTECRTPW GQGSSHEHVA YFDHRDGLNF
HIGGQWMEVT KLSETPEAGL ATTFRVDMTG QDTSAGVFIR GAFTEDTLQP MLYEGENIYS
YTTHIQAAQS GSYHYAIGLK NGTRETVPSE CANPEDTLNR LYTVGENGEQ LVTAVWASCD
VFDPQAAGPT TLTLNVDMTG VDVSGGVYVA GDLNAWTITE LTQVGASAIY TISYDLAVGA
EGGYYFLNGS DWGDRETIPE ECVGYYDADR GFLVEEQSPQ VLDLVWSSCQ