Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sde_3882 |
Symbol | |
ID | 3967107 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Saccharophagus degradans 2-40 |
Kingdom | Bacteria |
Replicon accession | NC_007912 |
Strand | - |
Start bp | 4897212 |
End bp | 4900097 |
Gene Length | 2886 bp |
Protein Length | 961 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637922979 |
Product | beta-galactosidase |
Protein accession | YP_529349 |
Protein GI | 90023522 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0857074 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.216431 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATTT CTCTCTATTA CCACGGGCGC GTAGTGGGGC GGCTGGTTTT GTTACTTGCT GTTCTGTTTG GCTCTAGTGC TGCGTTTGCA GCAGAGCAAG AGGACGGTCG TGAGCGGTTG TCTTTAAATC GCGGCTGGTA TTTTCATTTG GGCGATGTGC CTATGGCGCC TATAAAGGGC CACAGCGAAT CTTATATGAA TGCCAAGGCG GGCAATGCGC CAGGGCCTGC GGGTAGGGAG TTTGACGATT CTGAATGGCG ACGTTTAGAT TTGCCGCACG ATTGGGCGGT AGAAGGGCCC TTTGACCCCA ACGAAAATAT CTCGCAGGGC TACAGGCCGC GTGGCATAAG TTGGTACCGC CGCTATTTGC GAGTGGAGGA GAGCGATAGA GGCCGCTATT TTGAACTGCA ATTTGATGCC ATTGCCACCC ATGCCACCGT GTGGGTAAAC GGCAATGTGG TAAAGCGTAA CTATTCGGGC TACAACGCGA GTTATATCGA TATTACCCCT TATATTCGTT ATGGCGATGC CATGAATACC ATTGCGGTTA AGGTAGATGC CACGCAAATG GAAGGCTGGT GGTACGAGGG GGCGGGTATG TATCGCCACA CTTGGTTGGT TAAAGCCAAC CCCGTGCATA TTGTTACCGA CGGCCTACAC GCCACGCCGC GTTTAGAAAG CGAAACCCTA GAGGGTGGTA AATGGAGCAT ACCGGTTGAG GTAACCCTAA ATAACAGCGG CGAAAATCTG CAAACCGTAA CGGTAGAGGT AACGGTTACG GCACCCAATG GCAAGCAAGT GGCCAAGCAA AGTGGCAATG TAACTGTGCC GGTGTTGGGC GAGGCGGTGG CTAAGTTACC CGTGACTATT CAGTCGCCTG CGCTGTGGAG CCACGAGCAA ACCAATCTAT ATCGCGTAAA CGCTGTCGTT AAGCACGGCA AGCGCGTCAT TGATGAAATA GCACTTTCTA CCGGCTTTAG AACGGTGCGT TTTGATTCGC AGCAGGGCTT CTTTTTAAAC GATAAGCACG TGAAACTGCA GGGTGTGTGT ATTCATCAAG ATCACGCTGG CGTAGGGGTG GCCGTGCCCA CTAGTATTTG GCAATACCGC CTGCGCCGTT TAAAAGAGCT GGGTGTAAAC GCTATTCGTT TTTCACATAA TGCCCCCGCC GTCGAGGTGC TGGATTTGGT GGATTCCATG GGCTTTTTGG TGATGGACGA AAACCGCAAC TTCAACCCAT CGCCCGATTA CATGCAGCAG CTAGAATGGA TGGTTCGTCG CGATCGCCAT CACCCGGGCA TTATTTTGTG GTCGGTATTT AACGAAGAGC CGGTACAGGC GTCGGAGGTA GGTTATCAAA TGGTGCGCCG CATGGTGGCC GCCGTTAAAG CATTAGACGA TACACGCCCC GTAACCGCAG CTATGAACGG CGGTTTTTTT AGCGACTTGA ATGTATCCCA CGCGGTGGAT GTACTGGGGG CAAACTACCA AGTGCCCGAT TACGATCGCT TTCACGCGGC GCGTCCAGAA ATGCCCTTTA CCAGCTCGGA AGATACTTCT GCGTTTATGA TGCGCGGCGA ATTCACCACC GATTACGATA AAAACCTTAT TGCCAGCTAC GATGAAGACT TTGCCTTTTG GGGCAATAGT CACCGCGATG CTTGGCAGGC GGTAGCAACG CGCGACTATG TAGCGGGTGC ATTTGTATGG ACAGGGTTTG ATTACCGCGG CGAACCCACG CCACTGGCTT GGCCTTCGGT AAGCTCGTTT TTTGGCATTA TGGATTTAAA CGGCTTTGCC AAAACCGCGT ATTACATTCA CCGCGCGCAA TGGGTGAAGG ATGAGCCGCA AGCGTATTTA GCGCCCCATT GGAACTGGGC GGGTAAAGAA GGGCAGATTA TTCCCGTATT GGTGATGGCC AATGTCGACA AAGTGCAATT GCTGCTTAAT GGCAAAAGCT TAGGCGAGCA AGTGGTTGAC CCTTTTCAAA TGAACACCTT CGATGTGGCT TATCAGCCCG GCAAACTTGA AGTAATTGGC TATAGCAGCG GTAAAGAGGT GGTGCGCAAT AGTGTAGAAA CCACCGGCAA AGCTGTGGCT GTGCAGTTGG TGCCAGATCG CAAAGCGCTA GTGGGCGACG GTTTTGATGC TATGCCAATT ACCGTGCAGG CGGTGGATGC AAAAGGGCGG GTTGTACCTA CCGATAACTC GCTTATCCAC TTTGAAATAA GCGGTGCAGG GCAGTCGATT GGTCACGGCA ATGGCAACCC CAATTCCCAC GAGGATGAGA AAGGCGCAAC GCGCCATTTA TTTAATGGGC TCGCCCAATT AATTGTGCAA AGTCATTACG AGAGCAGCGG CAATATCACA GTAAAAGCCA GCTCGCCGGG GTTAAAAACC GCAAAGGTAA GCATACCGGT TAAAAAAGTG GCGGCGGTGC CCTACGTGGC CTCGCAAACG GCCCCCAATG TGCATTTGTC CGATTGGCGT ACGTCTCCCG TGGCGAGCCA GCGCCCTGAT CCAACCCAAA AAGTGGCCGA TAACGATATG AATAGCTGGG GGTGGGGGCA GCCGCCGTTT ATGTCGCAAG CCAAGGGCGA AACGGCACCC GCGCGCTACC GCTTGTATCG CACTAACTTT ACCCCGCGCA AAAATTTAGC GCGCGGTGAT GGCGAGTTAT TTATAGGCGA TGTAGTCGGC CGCGTGGAAG TATGGCTAAA CGACGAGCTG CTGTACAAAA AAGACAACGT GCGCAAACAA AACCTACGAC TGCCCATCCC CAAAGGAGAA GGCAACCGCG AACTTACGTT TTTGCTAGAA GATGAAGGCC AAGACAACGC CGTAATAGGC GGCCCAGTAA TTGTGCAGCC AAAGGGTAAG AAGTAG
|
Protein sequence | MKISLYYHGR VVGRLVLLLA VLFGSSAAFA AEQEDGRERL SLNRGWYFHL GDVPMAPIKG HSESYMNAKA GNAPGPAGRE FDDSEWRRLD LPHDWAVEGP FDPNENISQG YRPRGISWYR RYLRVEESDR GRYFELQFDA IATHATVWVN GNVVKRNYSG YNASYIDITP YIRYGDAMNT IAVKVDATQM EGWWYEGAGM YRHTWLVKAN PVHIVTDGLH ATPRLESETL EGGKWSIPVE VTLNNSGENL QTVTVEVTVT APNGKQVAKQ SGNVTVPVLG EAVAKLPVTI QSPALWSHEQ TNLYRVNAVV KHGKRVIDEI ALSTGFRTVR FDSQQGFFLN DKHVKLQGVC IHQDHAGVGV AVPTSIWQYR LRRLKELGVN AIRFSHNAPA VEVLDLVDSM GFLVMDENRN FNPSPDYMQQ LEWMVRRDRH HPGIILWSVF NEEPVQASEV GYQMVRRMVA AVKALDDTRP VTAAMNGGFF SDLNVSHAVD VLGANYQVPD YDRFHAARPE MPFTSSEDTS AFMMRGEFTT DYDKNLIASY DEDFAFWGNS HRDAWQAVAT RDYVAGAFVW TGFDYRGEPT PLAWPSVSSF FGIMDLNGFA KTAYYIHRAQ WVKDEPQAYL APHWNWAGKE GQIIPVLVMA NVDKVQLLLN GKSLGEQVVD PFQMNTFDVA YQPGKLEVIG YSSGKEVVRN SVETTGKAVA VQLVPDRKAL VGDGFDAMPI TVQAVDAKGR VVPTDNSLIH FEISGAGQSI GHGNGNPNSH EDEKGATRHL FNGLAQLIVQ SHYESSGNIT VKASSPGLKT AKVSIPVKKV AAVPYVASQT APNVHLSDWR TSPVASQRPD PTQKVADNDM NSWGWGQPPF MSQAKGETAP ARYRLYRTNF TPRKNLARGD GELFIGDVVG RVEVWLNDEL LYKKDNVRKQ NLRLPIPKGE GNRELTFLLE DEGQDNAVIG GPVIVQPKGK K
|
| |