Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sde_3099 |
Symbol | |
ID | 3965480 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Saccharophagus degradans 2-40 |
Kingdom | Bacteria |
Replicon accession | NC_007912 |
Strand | - |
Start bp | 3957034 |
End bp | 3960384 |
Gene Length | 3351 bp |
Protein Length | 1116 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637922196 |
Product | transglutaminase-like domain-containing protein |
Protein accession | YP_528568 |
Protein GI | 90022741 |
COG category | [E] Amino acid transport and metabolism [S] Function unknown |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases [COG4196] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.595628 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTATCC GTGTCGCTTT ACAGCACAAC ACGTATTATA AATTCGACCG CATGGTTAAC CTGTCGCCCC ATGTCGTGCG CTTGCGCCCC GCGCCCCATA GCCGTACGCC TATTACCAGC TACAGCCTTA AAGTAAAACC CGAAACGCAT TTTATAAACT GGCAGCAAGA CGCCTTTGGC AACTACCTTG CGCGGTTGGT GTTCCCCGAT AAAACCAAAG AGTTTTCGGT AGAAGTAGAA GTTATTGCCG ATATGACGGT AATCAACCCG TTCGATTTCT TCCTAGAAGA GTATGCCGAA AAATTTCCGT TTGAATACGA AGATCAACAA AAGAAAGAGT TGCTGCCCTA CCTAGAAGCC AAAGATTACG GCCCCAAGTT TGCAGAGCTA GTAAAAGGTG TATCGCTTAA ATCTAGGCCC ACCAACGACT TTTTAGTAGA GCTTAACCAA AGCATTCAAA AAATAGTGGA TTACACCATC CGCCTAGAGC CGGGTGTGCA AACGCCAGAA GAAACCCTAG AGAAAAAGCT GGGTTCCTGT CGCGACTCGG CGTGGTTACT GGTGCAGTTG TTCCGCCACT TGGGCGTAGC TGCGCGCTTT GCGTCGGGTT ATTTGGTGCA GTTAAAAGCC GACGAAAAAT CATTAGATGG CCCCTCGGGC GCAGAAGAAG ATTTCACTGA CTTACACGCG TGGTGCGAAG TATTTTTACC CGGTGCAGGC TGGGTAGGGC TAGACCCAAC CTCGGGCTTG TTTGCCAGCG AAGGCCATAT CCCACTTGCG TGTACGCCAG ACCCAAGTTC TGCCGCACCC ATTGCCGGCT TTACCGATAA GTGCGAAGTA GAATTCGACT TTTTAAATAC CGTCGCGCGC ATCCACGAAG ACCCGCGCGT AACCAAACCT TACACCGAGC AGCAGTGGCA AGAAGTACTT GCTCTAGGTA AATTTGTAGA TAGCAAATTA GAAGCCGGTG ATGTGCGCCT TACCATGGGT GGCGAGCCAA CGTTTGTTTC TATAGACGAT ATGGAATCGG CACAGTGGAA CACCGCCGCC CTAGGCGAAG ATAAGCTGCG CTTAGCGAAA ACCCTACTGT TAAAACTGCG CGATCACTTT GCCCCGCACG GCTTACTGCA TTACGGTCAA GGTAAGTGGT ACCCGGGCGA AGAAATTCCG CGCTGGGCGC TGGGTTTATT TTGGCGTAAA GATAACGAGC CACTGTGGGC CGATCACAAA CTCCTAGCGC GAGTAGATAA AGACTACGGC CACGGTTTAA CTACCAGCAC CAAATTTGCA CGCAAACTGG CTGCCAAACT GGGCCTGGAA AAAGACTTTG CCCAACCCGC CTATGAAGAT GGTTTGCATT ACCTGCTAGA AGAACGCTCG CTGCCCAACA ACATAGATAA ACTGTGCAAC TCGGTAATGC GCAACGATTT AAGCCGTAAA CGCTTAGTGC ATTTACTTGA AAAGGGCATA GATAAACCCA CAGGTTTTGT ACTGCCCCTA GCCAAAGACT TAGCCAACAA TTGGATAAGT AGCAAATGGC CAATGCGACG CGAGCGCATT GTGCTTATTC CAGGTGACTC CCCCATGGGG CTGCGTTTGC CATTGGGGTC GCTACCGCTC GAAGAAGAAA AAGAACTCGA TGTAAAACCC GATCCGTTCG AGCAACGCCA GTCGTTGGCT CCGCACAGCG AATTGCTACA AGGCGCAGAA AAAATTCAGC CGCAAGTGGC CGAGCCGGTA AAAGAACCAA ACGCAAAACA AACGCAAACC GTACCCGTTG TGCGCACCGC GTTATGTGTA GAAAGCCGCA AGGGCAAACT GCATTTATTT ATTCCGCCCA TTCCGCTATT GGATGATTAT GTAGCGCTTA TTGCGGCCAT AGAAGCCGTA GCCAAAGAAA TGGATGTGCC GGTAATTATT GAAGGCTACG AGCCACCGCG CGATTCGCGC CTTGTTAAAT TATTGGTTAC ACCAGACCCA GGTGTAATAG AAGTCAACAT TCACCCAGCG AACAACTGGG ATGAAATAGT CGCCACTACT TCCGAGCTAT ATAAAGCCGC ACGCGAGTCG CGTTTAGGCA CCGAGAAGTT TATGTTAGAT GGTCGTCACT CCGGTACCGG CGGCGGCAAC CACGTAACCC TAGGTGCGGC CACGCCCGCC GATAGCCCAT TTTTGCGCAG GCCAGATTTA CTGCGCAGCT TTGTGACCTA TTGGCAGCAC CACCCCAGCT TGTCGTATTT GTTCTCTGGT GGGTTTGTGG GGCCAACCAG TCAGGCGCCG CGCGCCGACG AAGGCCGCGA CGAAATGCTT TACGAAATGG AAATAGCTTT CGAGCAAATG CCAGATGGTT TTGTCAACGA ACCGTGGCTA GTCGATCGAC TAATGCGCAA CCTGCTAATA GACATTACCG GCAACACGCA CCGGGCAGAA TTTTGTATAG ACAAACTCTA CTCGCCAGAT TCACCCACAG GGCGGCTGGG CATATTGGAG TTCCGCGGTT TTGAAATGCC GCCGCACTAT CAAATGTCGC TGGTACAAAT GTTGTTAATT CGCGCCTTAA CCGCGCGTTT TTGGCAAAAC CCTTACAAGC AACCGCTGGT GCGCTGGGGT ACGTTGCTAC ACGATAAATT TATGTTGCCT CACCACGTGT GGGCCGACGT AAAAGATGTG GTTAAAGACC TAAACGACCA CGGTTTTCCG TTCCGCGAAG AGTGGTTGTT GCCGTTCCAA GAATTCCGCT TCCCCCATTA CGGTCGGGTA GAAGTAGACG ATATCGAATT GGAGCTGTGC TGGGCCGTAG AGCCTTGGCA TGTGCTGGGC GAAGAGATAG GCAGCTCGGG TACTGCGCGC TACGTCGACT CGTCGGTAGA GCGCTTACAG GTAAAACTTA CAGGCTTAAC TGAGGGGCGG CATGTGGTTG CTTGTAACGG TCGCCGTGTG CCGCTGCGCA ACACAGGTAA AAAAGGCGAG TATGTTGCTG GGGTACGCTA CCGCGCCTGG GCACCGCCTT CGGCATTGCA CCCTACATTG GGCACGCACA CACCATTGGT GTTCGATATT ATTGATGCAT GGAATGGCCG CGCCATTGGC GGTTGTACTT ACCATGTATC GCACCCGGGC GGTCGCACTT ACGAAACCTT CCCTGTAAAC GCGTTCGAAG CCGAGTCGCG CAGGGTAAAT AGGTTCGATC AAATGGGGCA CACCCCTGGG CCAATTACCC CGCGCCCAGA CCTAAACGCA GTACGCGAGT TCTTCCCGCA CGGTAAGTTA CCTAAGCCTA TGGCGCCGCC GCCAGAAGAA CCAGCAGGTG AATACCCATA CACCTTAGAC TTAAGACGAA AGCCTAGATA A
|
Protein sequence | MTIRVALQHN TYYKFDRMVN LSPHVVRLRP APHSRTPITS YSLKVKPETH FINWQQDAFG NYLARLVFPD KTKEFSVEVE VIADMTVINP FDFFLEEYAE KFPFEYEDQQ KKELLPYLEA KDYGPKFAEL VKGVSLKSRP TNDFLVELNQ SIQKIVDYTI RLEPGVQTPE ETLEKKLGSC RDSAWLLVQL FRHLGVAARF ASGYLVQLKA DEKSLDGPSG AEEDFTDLHA WCEVFLPGAG WVGLDPTSGL FASEGHIPLA CTPDPSSAAP IAGFTDKCEV EFDFLNTVAR IHEDPRVTKP YTEQQWQEVL ALGKFVDSKL EAGDVRLTMG GEPTFVSIDD MESAQWNTAA LGEDKLRLAK TLLLKLRDHF APHGLLHYGQ GKWYPGEEIP RWALGLFWRK DNEPLWADHK LLARVDKDYG HGLTTSTKFA RKLAAKLGLE KDFAQPAYED GLHYLLEERS LPNNIDKLCN SVMRNDLSRK RLVHLLEKGI DKPTGFVLPL AKDLANNWIS SKWPMRRERI VLIPGDSPMG LRLPLGSLPL EEEKELDVKP DPFEQRQSLA PHSELLQGAE KIQPQVAEPV KEPNAKQTQT VPVVRTALCV ESRKGKLHLF IPPIPLLDDY VALIAAIEAV AKEMDVPVII EGYEPPRDSR LVKLLVTPDP GVIEVNIHPA NNWDEIVATT SELYKAARES RLGTEKFMLD GRHSGTGGGN HVTLGAATPA DSPFLRRPDL LRSFVTYWQH HPSLSYLFSG GFVGPTSQAP RADEGRDEML YEMEIAFEQM PDGFVNEPWL VDRLMRNLLI DITGNTHRAE FCIDKLYSPD SPTGRLGILE FRGFEMPPHY QMSLVQMLLI RALTARFWQN PYKQPLVRWG TLLHDKFMLP HHVWADVKDV VKDLNDHGFP FREEWLLPFQ EFRFPHYGRV EVDDIELELC WAVEPWHVLG EEIGSSGTAR YVDSSVERLQ VKLTGLTEGR HVVACNGRRV PLRNTGKKGE YVAGVRYRAW APPSALHPTL GTHTPLVFDI IDAWNGRAIG GCTYHVSHPG GRTYETFPVN AFEAESRRVN RFDQMGHTPG PITPRPDLNA VREFFPHGKL PKPMAPPPEE PAGEYPYTLD LRRKPR
|
| |