Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sde_2986 |
Symbol | |
ID | 3967747 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Saccharophagus degradans 2-40 |
Kingdom | Bacteria |
Replicon accession | NC_007912 |
Strand | - |
Start bp | 3792301 |
End bp | 3795429 |
Gene Length | 3129 bp |
Protein Length | 1042 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 637922083 |
Product | hypothetical protein |
Protein accession | YP_528455 |
Protein GI | 90022628 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.587543 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000328281 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCGAATAA TCTGTGCTAA AGCGGTAATG TACTTGGTAT TGCTGTGCCT TGTAGGGTGT GGCGGTGGAG CAAGCGATAG CAGTATGGAG GGCGGCTCGC CTGCTATCGA TGGTGGTAGC GAAGGGGATA ATAGCAATCC AGACTCGTTG CCAGACCCAG ACCCAGACCC AGACCCAGAC CCAGACCCAG ACCCAGACCC AGACCCAGAC CCAGACCCAG ACCCAGACCC AGACCCAGAC CCAGACTCAG ACCCAGACCC AGATGCTGGT GAGACTGTGG ATAACGACGA TGTTGTGGAG GCCAGCGCGT GCGTAGCAAT CGAAGCTGTT AACTTCGACC AGTTTAGCGG TATTAATGGT TTTGCTAACG CCACAACAGA TACCAAACTT GGCTTGAGTG CATTGCAAGT GGCGGACCGA TTAGCAACGG CTGCAGCAAT ACAAACCTAC GGCGGCGATT CTGCGAGTGT AATTTTTAAA CTCCATACCA TGCAGGAATC TGATGGCGAG TCATCATATT CGGTAAAGGT AAATGGCACG CTTGTAGGGC AAGTACAAAA TACCCGTATT CATGGAACAG AGATTGCCGA TTACAGTTTG CAAACTCATG TTGTGAATAA TCAGTTTTTT GCCATTAATA CCGGTGATGA GATACAAGTT GAATTCACCA ACGCGACCAA TGGCTTAGTG GTTGAAGGTG AAACCTCTGC CACATCACGC GGTAGGTGGC ATAGTTTAGA GTTGTGTACC AATGGTGACC CTGTTGTTCT GGAGCCGGTA GAGCCGGCAA CAGGAAGTTG CGAAATAAGT GGCGACCTAC ATACTTATGA CCGTGTTGAG TTATTGTGTA ATGGTTTAGC TGCAGCCGAA AGCGAAGCGG CCACTTTTAC CGATTATCGT TTTAACGTTA CGTTCAGCAA GGGCGAAGAA AGTATTGTTG TACCCGGTCA CTTCGCGGCA GATGCGCAAG CGGCTGATTC GGGGGCAGAA GAGGGAGATA CTTGGCGCGC CTACTTTATG CCTCCATCCG CAGGGGAGTG GGGCTACAAC GTTTCGTTTC GCTCGGGCAA TAATATTGCG GTAAGTGCGG AAGCGAACGC GGGGACGCCG GTTGCTAGCC TCGACGGTAA GCAAGGGACT TTTACAATAA CGGAAGGCAG CTTTACTGCA CCGGATATGC GCGCGCGGGG TTTGTTACAG CACAAACAGG GCGAACGGTA TTTGCGTTTT AGCAAAGATA ATACGGTATT TATACAGGCA GGCTTAGATA GCCCTGAAAA TATTTTTGGT TATTCCGGTT TTGATAACAC TACAAAATAT TTTTCTGCCT CAAGCTGCAA AGGTATATTG CACGATTTTG AACCGCATTT AAGCGACTGG CAAACTGGTG ACCCCACATG GAATAACGGC AAGGGTAAGT CGTTAATTGG TTTGGTAAAT TATTTATCTG GCCGGGGCGT AAACAGTGTT TATATCATGG CGAACACCGT ACAAGGCGAC GGTTGTGATG CGCACCCGTG GGTTAACTAC AACGATACGG GCACAGAAAA AACCTTTGAT GTGAGTAAGC TCGATCAGTG GGAACGCGTG TTGCAGTTTA TGCAGCAAAA AGGAATGCTT ATTCACATTA TTACGCAAGA GCAAGAAAAC GATCAGCTGC TAAATGGCGG CGAGCTGGGG TTAGAGCGAA AACTTTATTA CCGTGAACTT ATCTCGCGTT TTGCACACCA CCCCGCATTG CAATGGAACT TAGGGGAAGA AAACGGCAAT ACGCTAGACC AGCAAAAAAG TTTTGCCGCG TTTTTTAAGC AAACAGACCC CTACGAACAC GCGGTGTTAA TGCACACCTA CCCGGGCGAG CACGATTTGT ACGAGGGGTT ACTGGGCGAT GAAAATTTTG ATGGCCCTAC CTTCCAATAT GGTGGTATCC CTAATTCTGC GTCGAACACC GAAAATGTGT ACGAAAAAGC AAAAACATGG CTAAACAAAT CTACCGACGC TGGCCGCCCA TGGGTAGTGA CGTTTACAGA GGCCTCTGGC GCAAACGCAC CGCAACCTAA CACAAGCGTA GAAAAAAGGC AGCGTGTATT TTGGATGTGG GCCAGTGTGA TGTCAGGCGG GGCAGGTTTT GAATGGTATT TGAAAAACCC TGGCGCTGGC CACGCGTACG ATTTAGCGGT AGAAGATTTG CGTGAGTTCG ACGAGTTTTG GCTGCAAGGT GGCTATTTGG CTACATTCTT CCGCGATATT TTACAGCGTG AATTGAATAT TGATTTACAA ACACTGATGG TGGCTAACGA CGTAACAGAA ACCGATAGTG ATTGGGTGTT GGCCAAAGAG GGTGAAGCAT ACGTAATTTA TTTGCGCGAT GGCGGTACTA GCGATATTAC GTTGCCCGAT AACAAAGTTT ACCAAGTGAT ATGGTTTAAC CCTCGTACCG GAGCACGCTA CCAAGGCGAT ACCTTACAGG GGCAGGGCAG TGTACCGCTG GGTGTGGCCC CTAATGAAAT CACATTAGAT TGGGCGCTAG TGGTATACCC CATTGCCGAT GCTCCGCAAC CGAGCGGCGG CTATGTAGAA AATAATGGCC TAGTGGTAAT GGAAGCAGAA AATACGCCTA GCCATTTAGA CCTGTGGCAG CAGCTAACCG AGGTAGATGG CTACACGGGG GATGGCTATA TTCAATTCAA TGGCAACGAA GTGACTAACG GCCCTGCGAA GTCGCCATTA ACTTATCAGT TTACCGTTAA TACGGCTGGC AACTATTATT TGCATTTACG TTGCGCGCGC GAAACCATTG GTGATCGCAC CGATGTGGCT AACGATGCAT TTATCCGCTT AGAAGGGGAT TTTGAAAGCG GCGCCGCCGA AACGCCATTA AATTATTTAA CAACCGATTC AAAATATTTT GGCGGGGCAG ATAACCGTTT TGTGTGGGCA ACAGGTAACC GCTTAGATAG GGATCACGCT AAATGGCCGG TTGTTTACAA TTTAAAAGCA GGCGAAACCT ATACCTTTAC CATGTCTGGT CGCTCCAAAT TATTTAAAGT GGATCGCATT GTTTTTAGGC ACGAAAGTGT TACCAAGCAA GTGGCGGAAT CGATAAATAA TGAAGAAACG TTAGATTGA
|
Protein sequence | MRIICAKAVM YLVLLCLVGC GGGASDSSME GGSPAIDGGS EGDNSNPDSL PDPDPDPDPD PDPDPDPDPD PDPDPDPDPD PDSDPDPDAG ETVDNDDVVE ASACVAIEAV NFDQFSGING FANATTDTKL GLSALQVADR LATAAAIQTY GGDSASVIFK LHTMQESDGE SSYSVKVNGT LVGQVQNTRI HGTEIADYSL QTHVVNNQFF AINTGDEIQV EFTNATNGLV VEGETSATSR GRWHSLELCT NGDPVVLEPV EPATGSCEIS GDLHTYDRVE LLCNGLAAAE SEAATFTDYR FNVTFSKGEE SIVVPGHFAA DAQAADSGAE EGDTWRAYFM PPSAGEWGYN VSFRSGNNIA VSAEANAGTP VASLDGKQGT FTITEGSFTA PDMRARGLLQ HKQGERYLRF SKDNTVFIQA GLDSPENIFG YSGFDNTTKY FSASSCKGIL HDFEPHLSDW QTGDPTWNNG KGKSLIGLVN YLSGRGVNSV YIMANTVQGD GCDAHPWVNY NDTGTEKTFD VSKLDQWERV LQFMQQKGML IHIITQEQEN DQLLNGGELG LERKLYYREL ISRFAHHPAL QWNLGEENGN TLDQQKSFAA FFKQTDPYEH AVLMHTYPGE HDLYEGLLGD ENFDGPTFQY GGIPNSASNT ENVYEKAKTW LNKSTDAGRP WVVTFTEASG ANAPQPNTSV EKRQRVFWMW ASVMSGGAGF EWYLKNPGAG HAYDLAVEDL REFDEFWLQG GYLATFFRDI LQRELNIDLQ TLMVANDVTE TDSDWVLAKE GEAYVIYLRD GGTSDITLPD NKVYQVIWFN PRTGARYQGD TLQGQGSVPL GVAPNEITLD WALVVYPIAD APQPSGGYVE NNGLVVMEAE NTPSHLDLWQ QLTEVDGYTG DGYIQFNGNE VTNGPAKSPL TYQFTVNTAG NYYLHLRCAR ETIGDRTDVA NDAFIRLEGD FESGAAETPL NYLTTDSKYF GGADNRFVWA TGNRLDRDHA KWPVVYNLKA GETYTFTMSG RSKLFKVDRI VFRHESVTKQ VAESINNEET LD
|
| |