Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sde_2992 |
Symbol | |
ID | 3967753 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Saccharophagus degradans 2-40 |
Kingdom | Bacteria |
Replicon accession | NC_007912 |
Strand | - |
Start bp | 3805391 |
End bp | 3808339 |
Gene Length | 2949 bp |
Protein Length | 982 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 637922089 |
Product | helix-turn-helix, AraC type |
Protein accession | YP_528461 |
Protein GI | 90022634 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5520] O-Glycosyl hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0557584 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000000421127 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAAAACA TATTTCATTT AAAGAAAGCC GCGTTTATTT TTTGCGCTTT AACCAGCTGT TATAATTCGA CAGCGTCATC TCAACAAGAC GATTACAGTG TTACAGCTAC TGTATCTACC GAATTTAATC CCATGAGCTC GAGTTGGTAT ACCAACCCGT GGCCCGAATC GGATATTCCT CGCAGATTAG AACAACTAAC ACCTAGCGTT ATTACTCAGC TTGGGCAAAC GTCTGGTACG TTATTAGAGG TGGACCCTTC CACAACGTAT CAAACGTTGC TTGGTTTAGG TGCGTCGCTA GAGCATACAA CGGTTTACGC TATTCGAAAA AACAAAACAG CAGAACAACA AAAAGAAGTA TTGCGTTCAC TTATCGACCC TGTGCAGGGC ATGGGAATGA ATTTTTTTCG TGTATCAATA GGCACGTCAG ATTTTGCAGA TGGAACACGC GCAATACCAG CGCCGGATAA TGCGAAGGGG TGGTATTCCT ATCAAGATAC ACCCACGTCG CCGTTTTCTA TTGCTCGCGA CGAAAGCTTA GGCATTATTG AAACTATTCG TATGGCGGTA GAGGTTGGCG TAGAAACAAA TAATGAATTA AAAATTCTCG CTTCCCCATG GAGCCCGCCG CGCTGGATGC GTGAAGGCGA TAACATGGTA GATGGCGGCC CGCTTAAAGC GGATATGCTC GATGACTACG CGGCCTATTT GCGTAAATTT GTAGAAGCTT ATCAAGCAGA GGGTATTCCC ATTTATGCCT TAAGCATGCA AAACGAACGT CAGTTCGAAC CAGGGGCTTA CCCCGGCATG GTTATAACAT GGCAAATGGA GCGCGACCTA CTTATAGAGG TATACGAAAA CTTTCACAAT ATTGATGGCA ACTATGGCCC AGAGCTTGAT GTAAAACTGT GGACACTAGA CCATAATTTT GATTATTGGC AGCAAGCGAA ATTGCAGTTG GATTCGTTCA AAGCAATGGG CAAAGACCAT TACGTAGATG CCACTGCATT CCATCACTAC GGTGGTGTGT CTGAAAATAT GGGGCAGCTG CACGATGCTC ACCCAGATAA AGACGTGGTG TTTACCGAAG GTACTATTTG GGGGTTAAGT TCAGATGGTA ATAAGCGCAG CTACGAAGCA CTTATACGTC ATTTTCGCAA TTGGGCTACT GGCTATCTTT CGTGGGTAAC AATGACAACC CAAACTCTAA ACGAAGCAAA CCAAGGGCCA TATAACGGCT TAGGTGCATT CGATCCCACG CTATTGGTTA AATATGATGG CGACAACGCC AATTGGTATA AAACGCCAGA ATATTGGTTG ATGAGTCAAT TCAGTAAATA CCTAAAGCCC GGCGCCCTGC GTATAGAAAG TAATTACGGT TCGTTGCAAA CCGTTACCAA TGTTGCGTTT TTGAACCCTG ATGGCTATGT CGTTTTAATT GTGGCTAATT CCACCAACGG CGTGCAGCAA TTTGATGTGA TTAGTGAGGG GAATCAATTT AATGCATCGG TACCTGCGCG TTCGATTGCA ACCTATCGCT GGAAAGCGGG GTTGGGGCAA AGCCCGCACT CGTGGCAAGC ACCGCCCGAA TTGCCTCAGT TCCCATACGC AGAAAACTCG ATTGAAATTC CTGGGTTGGT AGAGGCTGAG CATTACGACC TTGGTGGCGC GGGTGCAGCG TACGCAGATG TAAGCACTGG TAACAACGGC GGCGTATTGC GCGCAGATGA TGTGGATATA GAGGCAAACG CAAACGGATA TCACATTGGT TGGTTAGATG CCGGCGAGTG GCTTGAGTAT TCCGTTAATG TAAACCAAGG GCAAAGCTTT GATGTGTTAA TTGCTAGTGC GTCTGCAAGT AGCGGCGGGC AGTTTCATTT TGAAGTAAAC GGCGAGTCGG TTTCTCCAAT ATTAGTAACA CCTGCAACTG GTGGTTGGAA AACGTTTGTA AGCACGCTTC ACCGTGGGCT ACAGCTGAAT GCAGGTGAGC AAGTGTTGCG CTTGGTAATA GATGGTGGCG AATTTAATAT TGATTCATTT CACATTGTGC CAGCAGGGAG CATGGAGCCA CCTGTGCAAG AGGATATTTG CGAAGAGGCG ACGGTAAGTA TCCTAGCAGG AAAAATCCAA GCGGAAAGTT ATTGTCTAGC AAGCGGTATT CAAACAGAAA ATACTAGCGA CCAAGGCGGT GGCGAAAATA TTGGTTGGAT AGACGCTGGT GATTCGGTGG ACTACGGCGT GAGCGTGGCG AATGCGGGTA GTTACACTCT TAATTTACGC GTAGCAAGTC AAAATGGAGG TGGAGAAATT GCGCTAAGTG TTGGTGATAC TGTTTTGGCA AATGTGCAAA TACCAGGCAC AGGCGGTTGG CAGAATTGGC AAACTATTAG CGTGCCTGTA CAGCTCTCTG CTGGGTATCA ACAGCTACAT TTTGTTTTTA TTAATGGTGG CTTAAATATT AATTGGTTTG AATTTGTAGA AACTGATAAT CCAACAGATC CAACAGATCC AACAGATCCC GCAGCAGAGC TAGAAGAGGG TGCCTACTAC ATAATTAATG AGGCTTCTGG CAAGGCGCTA GATGTATCTG GTGTATCTAC CAGTAACGGT ACCAATGTTC AGCAGTGGTC ATACAGCGGC GGTTTGAATC AGCAGTGGAT TGCCCAGCAC GTAAGTGGTA ATACATTTGA GCTGGTTAGT TTAAACAGTG GCTCTTGTTT AGATGCAGAT AATGGAAGTG ATAATGCACA CCAGTGGGCT TGCGAAGGCA ACACCAACCA GCAGTGGGTT ATTGAAGGGC AATCGGACGG CACTTATTTA ATTCGTACCA AAGCCGGTAA CGAAGTATTG GAGGTGCAAG GTGGCAGCGC TAACAACGGT GCAAATGTGC GTACTGCCAG CTCAGTAAAT AATAATCGTC AGAAGTGGCG GTTTAATGAT GTTGAGTAG
|
Protein sequence | MENIFHLKKA AFIFCALTSC YNSTASSQQD DYSVTATVST EFNPMSSSWY TNPWPESDIP RRLEQLTPSV ITQLGQTSGT LLEVDPSTTY QTLLGLGASL EHTTVYAIRK NKTAEQQKEV LRSLIDPVQG MGMNFFRVSI GTSDFADGTR AIPAPDNAKG WYSYQDTPTS PFSIARDESL GIIETIRMAV EVGVETNNEL KILASPWSPP RWMREGDNMV DGGPLKADML DDYAAYLRKF VEAYQAEGIP IYALSMQNER QFEPGAYPGM VITWQMERDL LIEVYENFHN IDGNYGPELD VKLWTLDHNF DYWQQAKLQL DSFKAMGKDH YVDATAFHHY GGVSENMGQL HDAHPDKDVV FTEGTIWGLS SDGNKRSYEA LIRHFRNWAT GYLSWVTMTT QTLNEANQGP YNGLGAFDPT LLVKYDGDNA NWYKTPEYWL MSQFSKYLKP GALRIESNYG SLQTVTNVAF LNPDGYVVLI VANSTNGVQQ FDVISEGNQF NASVPARSIA TYRWKAGLGQ SPHSWQAPPE LPQFPYAENS IEIPGLVEAE HYDLGGAGAA YADVSTGNNG GVLRADDVDI EANANGYHIG WLDAGEWLEY SVNVNQGQSF DVLIASASAS SGGQFHFEVN GESVSPILVT PATGGWKTFV STLHRGLQLN AGEQVLRLVI DGGEFNIDSF HIVPAGSMEP PVQEDICEEA TVSILAGKIQ AESYCLASGI QTENTSDQGG GENIGWIDAG DSVDYGVSVA NAGSYTLNLR VASQNGGGEI ALSVGDTVLA NVQIPGTGGW QNWQTISVPV QLSAGYQQLH FVFINGGLNI NWFEFVETDN PTDPTDPTDP AAELEEGAYY IINEASGKAL DVSGVSTSNG TNVQQWSYSG GLNQQWIAQH VSGNTFELVS LNSGSCLDAD NGSDNAHQWA CEGNTNQQWV IEGQSDGTYL IRTKAGNEVL EVQGGSANNG ANVRTASSVN NNRQKWRFND VE
|
| |