Gene Sde_3882 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_3882 
Symbol 
ID3967107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp4897212 
End bp4900097 
Gene Length2886 bp 
Protein Length961 aa 
Translation table11 
GC content52% 
IMG OID637922979 
Productbeta-galactosidase 
Protein accessionYP_529349 
Protein GI90023522 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0857074 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.216431 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTT CTCTCTATTA CCACGGGCGC GTAGTGGGGC GGCTGGTTTT GTTACTTGCT 
GTTCTGTTTG GCTCTAGTGC TGCGTTTGCA GCAGAGCAAG AGGACGGTCG TGAGCGGTTG
TCTTTAAATC GCGGCTGGTA TTTTCATTTG GGCGATGTGC CTATGGCGCC TATAAAGGGC
CACAGCGAAT CTTATATGAA TGCCAAGGCG GGCAATGCGC CAGGGCCTGC GGGTAGGGAG
TTTGACGATT CTGAATGGCG ACGTTTAGAT TTGCCGCACG ATTGGGCGGT AGAAGGGCCC
TTTGACCCCA ACGAAAATAT CTCGCAGGGC TACAGGCCGC GTGGCATAAG TTGGTACCGC
CGCTATTTGC GAGTGGAGGA GAGCGATAGA GGCCGCTATT TTGAACTGCA ATTTGATGCC
ATTGCCACCC ATGCCACCGT GTGGGTAAAC GGCAATGTGG TAAAGCGTAA CTATTCGGGC
TACAACGCGA GTTATATCGA TATTACCCCT TATATTCGTT ATGGCGATGC CATGAATACC
ATTGCGGTTA AGGTAGATGC CACGCAAATG GAAGGCTGGT GGTACGAGGG GGCGGGTATG
TATCGCCACA CTTGGTTGGT TAAAGCCAAC CCCGTGCATA TTGTTACCGA CGGCCTACAC
GCCACGCCGC GTTTAGAAAG CGAAACCCTA GAGGGTGGTA AATGGAGCAT ACCGGTTGAG
GTAACCCTAA ATAACAGCGG CGAAAATCTG CAAACCGTAA CGGTAGAGGT AACGGTTACG
GCACCCAATG GCAAGCAAGT GGCCAAGCAA AGTGGCAATG TAACTGTGCC GGTGTTGGGC
GAGGCGGTGG CTAAGTTACC CGTGACTATT CAGTCGCCTG CGCTGTGGAG CCACGAGCAA
ACCAATCTAT ATCGCGTAAA CGCTGTCGTT AAGCACGGCA AGCGCGTCAT TGATGAAATA
GCACTTTCTA CCGGCTTTAG AACGGTGCGT TTTGATTCGC AGCAGGGCTT CTTTTTAAAC
GATAAGCACG TGAAACTGCA GGGTGTGTGT ATTCATCAAG ATCACGCTGG CGTAGGGGTG
GCCGTGCCCA CTAGTATTTG GCAATACCGC CTGCGCCGTT TAAAAGAGCT GGGTGTAAAC
GCTATTCGTT TTTCACATAA TGCCCCCGCC GTCGAGGTGC TGGATTTGGT GGATTCCATG
GGCTTTTTGG TGATGGACGA AAACCGCAAC TTCAACCCAT CGCCCGATTA CATGCAGCAG
CTAGAATGGA TGGTTCGTCG CGATCGCCAT CACCCGGGCA TTATTTTGTG GTCGGTATTT
AACGAAGAGC CGGTACAGGC GTCGGAGGTA GGTTATCAAA TGGTGCGCCG CATGGTGGCC
GCCGTTAAAG CATTAGACGA TACACGCCCC GTAACCGCAG CTATGAACGG CGGTTTTTTT
AGCGACTTGA ATGTATCCCA CGCGGTGGAT GTACTGGGGG CAAACTACCA AGTGCCCGAT
TACGATCGCT TTCACGCGGC GCGTCCAGAA ATGCCCTTTA CCAGCTCGGA AGATACTTCT
GCGTTTATGA TGCGCGGCGA ATTCACCACC GATTACGATA AAAACCTTAT TGCCAGCTAC
GATGAAGACT TTGCCTTTTG GGGCAATAGT CACCGCGATG CTTGGCAGGC GGTAGCAACG
CGCGACTATG TAGCGGGTGC ATTTGTATGG ACAGGGTTTG ATTACCGCGG CGAACCCACG
CCACTGGCTT GGCCTTCGGT AAGCTCGTTT TTTGGCATTA TGGATTTAAA CGGCTTTGCC
AAAACCGCGT ATTACATTCA CCGCGCGCAA TGGGTGAAGG ATGAGCCGCA AGCGTATTTA
GCGCCCCATT GGAACTGGGC GGGTAAAGAA GGGCAGATTA TTCCCGTATT GGTGATGGCC
AATGTCGACA AAGTGCAATT GCTGCTTAAT GGCAAAAGCT TAGGCGAGCA AGTGGTTGAC
CCTTTTCAAA TGAACACCTT CGATGTGGCT TATCAGCCCG GCAAACTTGA AGTAATTGGC
TATAGCAGCG GTAAAGAGGT GGTGCGCAAT AGTGTAGAAA CCACCGGCAA AGCTGTGGCT
GTGCAGTTGG TGCCAGATCG CAAAGCGCTA GTGGGCGACG GTTTTGATGC TATGCCAATT
ACCGTGCAGG CGGTGGATGC AAAAGGGCGG GTTGTACCTA CCGATAACTC GCTTATCCAC
TTTGAAATAA GCGGTGCAGG GCAGTCGATT GGTCACGGCA ATGGCAACCC CAATTCCCAC
GAGGATGAGA AAGGCGCAAC GCGCCATTTA TTTAATGGGC TCGCCCAATT AATTGTGCAA
AGTCATTACG AGAGCAGCGG CAATATCACA GTAAAAGCCA GCTCGCCGGG GTTAAAAACC
GCAAAGGTAA GCATACCGGT TAAAAAAGTG GCGGCGGTGC CCTACGTGGC CTCGCAAACG
GCCCCCAATG TGCATTTGTC CGATTGGCGT ACGTCTCCCG TGGCGAGCCA GCGCCCTGAT
CCAACCCAAA AAGTGGCCGA TAACGATATG AATAGCTGGG GGTGGGGGCA GCCGCCGTTT
ATGTCGCAAG CCAAGGGCGA AACGGCACCC GCGCGCTACC GCTTGTATCG CACTAACTTT
ACCCCGCGCA AAAATTTAGC GCGCGGTGAT GGCGAGTTAT TTATAGGCGA TGTAGTCGGC
CGCGTGGAAG TATGGCTAAA CGACGAGCTG CTGTACAAAA AAGACAACGT GCGCAAACAA
AACCTACGAC TGCCCATCCC CAAAGGAGAA GGCAACCGCG AACTTACGTT TTTGCTAGAA
GATGAAGGCC AAGACAACGC CGTAATAGGC GGCCCAGTAA TTGTGCAGCC AAAGGGTAAG
AAGTAG
 
Protein sequence
MKISLYYHGR VVGRLVLLLA VLFGSSAAFA AEQEDGRERL SLNRGWYFHL GDVPMAPIKG 
HSESYMNAKA GNAPGPAGRE FDDSEWRRLD LPHDWAVEGP FDPNENISQG YRPRGISWYR
RYLRVEESDR GRYFELQFDA IATHATVWVN GNVVKRNYSG YNASYIDITP YIRYGDAMNT
IAVKVDATQM EGWWYEGAGM YRHTWLVKAN PVHIVTDGLH ATPRLESETL EGGKWSIPVE
VTLNNSGENL QTVTVEVTVT APNGKQVAKQ SGNVTVPVLG EAVAKLPVTI QSPALWSHEQ
TNLYRVNAVV KHGKRVIDEI ALSTGFRTVR FDSQQGFFLN DKHVKLQGVC IHQDHAGVGV
AVPTSIWQYR LRRLKELGVN AIRFSHNAPA VEVLDLVDSM GFLVMDENRN FNPSPDYMQQ
LEWMVRRDRH HPGIILWSVF NEEPVQASEV GYQMVRRMVA AVKALDDTRP VTAAMNGGFF
SDLNVSHAVD VLGANYQVPD YDRFHAARPE MPFTSSEDTS AFMMRGEFTT DYDKNLIASY
DEDFAFWGNS HRDAWQAVAT RDYVAGAFVW TGFDYRGEPT PLAWPSVSSF FGIMDLNGFA
KTAYYIHRAQ WVKDEPQAYL APHWNWAGKE GQIIPVLVMA NVDKVQLLLN GKSLGEQVVD
PFQMNTFDVA YQPGKLEVIG YSSGKEVVRN SVETTGKAVA VQLVPDRKAL VGDGFDAMPI
TVQAVDAKGR VVPTDNSLIH FEISGAGQSI GHGNGNPNSH EDEKGATRHL FNGLAQLIVQ
SHYESSGNIT VKASSPGLKT AKVSIPVKKV AAVPYVASQT APNVHLSDWR TSPVASQRPD
PTQKVADNDM NSWGWGQPPF MSQAKGETAP ARYRLYRTNF TPRKNLARGD GELFIGDVVG
RVEVWLNDEL LYKKDNVRKQ NLRLPIPKGE GNRELTFLLE DEGQDNAVIG GPVIVQPKGK
K