Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_0766 |
Symbol | |
ID | 8724496 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 930027 |
End bp | 933032 |
Gene Length | 3006 bp |
Protein Length | 1001 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | |
Product | peptidase S9 prolyl oligopeptidase active site domain protein |
Protein accession | YP_003385628 |
Protein GI | 284035698 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGACTT TTTCGTATTT TTCGTTGAAA CACAAACCCT GTTCCTTCAT GAGACACTTC TGTCAACTAA TACTCTATTT CTTCCTTGCT TTTCCCCTCC TTGCCCAAAG CCAGAAACGA GCCCTTACCG CTGCCGACTA CGATCGCTGG CAAAGTGTGC GGTCTGAAAA AATATCGGAC GATGGTCAAT GGATCGCTTA TCAGATCGAT CCCCAGGAAG GCGATGGCCG ATTGGAAGTT GTATCGTCTA CAGCCAACAC AAACCGAAAG CAGTATGTTT TCCCAAGGGG TTATATGGCC CAGTTTACCC CCGACAGCAA GTTTCTGGTG ATGCGGCTCA AAGCGCCGTA TATGGCCACC CGCACTGCTA AGCTCAAGAA GAAAAAAGCG GATGAACTGC CTAAAGATAG TTTGCTGGTC CTGAATCTGA GTACGGGCAA ACCCACTAAA CTCCCCAATG CAAAATCGTT TGTCTTCGGC AAGGATGCTG GTTCGTGGCT GGCTGTAGTG CAGGAACGGA AGGATGAACG CGCTGCACCA GCGTCCAAAA GTGCTTCGCT CAAGGATACG CTCACACCCT CGGCACCCGT TTCAACGACC ACCATCGCCA CGCGCAAAGG ACCGGTAAAA AAGCCAAAAG GCGACGATTT AACGCTGCTG AACATGGCCG ATGGCACGCG CAAAACGGTT AGATATGTAT CGAACGTGGT AATGTCGGAC AACGGCCAGG CTATTTTTTA CAGCAAAGAG TCGGCCAACG ACTCCTTAAA AACAGGCGAA TCAAGCGTTC CCGGCGTGTT TATATTTAAT ACGGCCAATG GGCAGACCAT GCTCGTTGAC ACCAGTTCAA AACGAAAAAT ATACAAAGGA CTGGCTATCG ATAAAGTCGG CCAGCAACTT ACCTGGATGG CTTCCGCCGA TAGCGCCGGT AGTGATGTAA AAGTATTCTC GCTCTATTAT AAGAACCTAG CGCCAGTGGT CGCTGCCAAA GGCAAAAAGT CGAAAACAGC AGCCCCCGAA GCACCTTTCA AAGTAATTGC CGATACATTG ACGGCAGCCT TACCCAAACG GTGGTCGGTG AATGAGTATC GGGAGCCAAA ATTCTCGGAC GATGGCAAAC GGCTTTATTT CTCCACGTCA CCCATTGCGC CCAAACCAAC CAAAGACACC CTGACACCCG AGGATGAAAA AGTTAAAATG GATATATGGA CCTGGACCGA CACCCGGTTG CAGCCCATGC AGCAACGGCG GCTCAAGGAA GAAAAGGAAC GTGGCTTTCT GACGGTGTAT GACATGGCAA GCGGAAAAGT TACGCCCCTG GCCAGCCGGG AAGCCCCAAC GGTGAGCTTT GATGCCAAGA CAAACTCTCG CTACTTGTTG GGCCTGAGTG ATCTGCCCTA TCAGGTTCAG GCCTCCTGGG ACCCCGGCCA TACCGACCTG TATCTGATCG ACACGCAAAC GGGCGAAAAG AAGCGAATTG CCAATGATAT TATGGCGTCG CAACCCAAAT TATCGCCCGA AGGCAAGTAC GCTTACTGGT TCGATGAGCG CGACTCGCTC TGGCGGGCGT GGTCGATAGC CGATAGTCGA CGGCTGGATC TGACCCGTGG CTTGCCAAGC AAGTTTTTCG ATGAAGAACA CGACACACCT AATTTACCCG GTGCTTATGG CTCGGCTGGC TGGACAACCG GCGATCGTTA CCTGTGGCTT TACGACCGCT ACGATATCTG GCAGTTTGAC CCAACCGGTA AGGAAAAGCC AACCAATCTG ACGGCTGGCT GGGGGCGCAA AAATAAAATT CGCCTGCGCC GGGCTGAAGT AGAAAACGAT GACGATACCC CCGGCCGACG GTCGGGAGCG GCTGAAAAAG CCATCGACCC CAAAGCTGAA CTATTTCTGA CGGGTATCTG GGAAAGCGAT AAAGCGACCG GCATTTTAAA GAGTAAGAAC GGTCTGGCCG TTGCTGAGCC CACTGTATTG ACCCGAAGCA ACCACCGTTA TTTTGGGCTG AACAAAGCAA AAAATGCCTC CGTTATGACG TTTTATCGGG GTAATTACAA AGAGTCGACC AACCTTTTCC TGACGGATAC GACCCTTGCG GCTCCAGTTC AGCTGACCCG AACGAATCCC CAACAGGACA GCGTTCGGTG GGGAAGTGCC GAAATTGTGA GCTGGATTGG TACAAACGGT ATCAAGCTGG AAGGCTTGCT CTTTAAACCG GAGGGCTTCG ACCCGACCAA GAAATACCCC ATGCTGACGT ATTTCTACGA GCGCAACGCC GAAACCCTGA ACGACTACCG GGCTCCTTCG CCAAGCCGGT CGACCATCAA TATTCCGTAC TGCGTTTCGA ACGGGTATCT GGTTTTCGTA CCGGACATCG TGTACACAAC CGGGCAGCCG GGCCCCAATG CCTACGACTG TGTTGTTCCC GGCGTGTTGA GCCTGCTCGA CAAAGGCTTC GTAGATCGTG ACCGGCTGGG TATTCAGGGA CAAAGCTGGG GTGGTTACCA AACCGCCTAC ATTATTACCC GGACCAATCT TTTCCGGGCG GCAGAAGCCG GTGCGCCAGT GGCAAACATG ACATCAGCTT ACGGCGGTAT TCGGTGGGAA ACGGGCATCA GCCGGGCGTT CCAGTACGAA AAAACACAGA GCCGCATTGG TGGAACCCTG TGGGACAAAC CAATGAACTA CATCGAAAAC TCGCCACTTT TCTTCGCCAA CCGAATTGAA ACGCCCCTGA TGATGACCCA TAACGACGCC GACGGAGCCG TTCCCTGGTA CCAGGGTATC GAAATGTTCT CGGCACTGCG TCGGTTGAAC AAGCCGGTCT GGATGCTGGT CTATAACGGG GAAGGCCATA ACCTGACCCA ACGGCATAAC GCCAAGGACC TGAGTATTCG GTTATACCAG TATTTCGATT ACTACCTGAA AGATGCCCCC ATGCCGGTCT GGATGAAAGA AGGCCGGAGC GCGGTAGAGA AAGACCGGGG TGAAATGAAG TACTGA
|
Protein sequence | MMTFSYFSLK HKPCSFMRHF CQLILYFFLA FPLLAQSQKR ALTAADYDRW QSVRSEKISD DGQWIAYQID PQEGDGRLEV VSSTANTNRK QYVFPRGYMA QFTPDSKFLV MRLKAPYMAT RTAKLKKKKA DELPKDSLLV LNLSTGKPTK LPNAKSFVFG KDAGSWLAVV QERKDERAAP ASKSASLKDT LTPSAPVSTT TIATRKGPVK KPKGDDLTLL NMADGTRKTV RYVSNVVMSD NGQAIFYSKE SANDSLKTGE SSVPGVFIFN TANGQTMLVD TSSKRKIYKG LAIDKVGQQL TWMASADSAG SDVKVFSLYY KNLAPVVAAK GKKSKTAAPE APFKVIADTL TAALPKRWSV NEYREPKFSD DGKRLYFSTS PIAPKPTKDT LTPEDEKVKM DIWTWTDTRL QPMQQRRLKE EKERGFLTVY DMASGKVTPL ASREAPTVSF DAKTNSRYLL GLSDLPYQVQ ASWDPGHTDL YLIDTQTGEK KRIANDIMAS QPKLSPEGKY AYWFDERDSL WRAWSIADSR RLDLTRGLPS KFFDEEHDTP NLPGAYGSAG WTTGDRYLWL YDRYDIWQFD PTGKEKPTNL TAGWGRKNKI RLRRAEVEND DDTPGRRSGA AEKAIDPKAE LFLTGIWESD KATGILKSKN GLAVAEPTVL TRSNHRYFGL NKAKNASVMT FYRGNYKEST NLFLTDTTLA APVQLTRTNP QQDSVRWGSA EIVSWIGTNG IKLEGLLFKP EGFDPTKKYP MLTYFYERNA ETLNDYRAPS PSRSTINIPY CVSNGYLVFV PDIVYTTGQP GPNAYDCVVP GVLSLLDKGF VDRDRLGIQG QSWGGYQTAY IITRTNLFRA AEAGAPVANM TSAYGGIRWE TGISRAFQYE KTQSRIGGTL WDKPMNYIEN SPLFFANRIE TPLMMTHNDA DGAVPWYQGI EMFSALRRLN KPVWMLVYNG EGHNLTQRHN AKDLSIRLYQ YFDYYLKDAP MPVWMKEGRS AVEKDRGEMK Y
|
| |