Gene Slin_0766 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_0766 
Symbol 
ID8724496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp930027 
End bp933032 
Gene Length3006 bp 
Protein Length1001 aa 
Translation table11 
GC content52% 
IMG OID 
Productpeptidase S9 prolyl oligopeptidase active site domain protein 
Protein accessionYP_003385628 
Protein GI284035698 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGACTT TTTCGTATTT TTCGTTGAAA CACAAACCCT GTTCCTTCAT GAGACACTTC 
TGTCAACTAA TACTCTATTT CTTCCTTGCT TTTCCCCTCC TTGCCCAAAG CCAGAAACGA
GCCCTTACCG CTGCCGACTA CGATCGCTGG CAAAGTGTGC GGTCTGAAAA AATATCGGAC
GATGGTCAAT GGATCGCTTA TCAGATCGAT CCCCAGGAAG GCGATGGCCG ATTGGAAGTT
GTATCGTCTA CAGCCAACAC AAACCGAAAG CAGTATGTTT TCCCAAGGGG TTATATGGCC
CAGTTTACCC CCGACAGCAA GTTTCTGGTG ATGCGGCTCA AAGCGCCGTA TATGGCCACC
CGCACTGCTA AGCTCAAGAA GAAAAAAGCG GATGAACTGC CTAAAGATAG TTTGCTGGTC
CTGAATCTGA GTACGGGCAA ACCCACTAAA CTCCCCAATG CAAAATCGTT TGTCTTCGGC
AAGGATGCTG GTTCGTGGCT GGCTGTAGTG CAGGAACGGA AGGATGAACG CGCTGCACCA
GCGTCCAAAA GTGCTTCGCT CAAGGATACG CTCACACCCT CGGCACCCGT TTCAACGACC
ACCATCGCCA CGCGCAAAGG ACCGGTAAAA AAGCCAAAAG GCGACGATTT AACGCTGCTG
AACATGGCCG ATGGCACGCG CAAAACGGTT AGATATGTAT CGAACGTGGT AATGTCGGAC
AACGGCCAGG CTATTTTTTA CAGCAAAGAG TCGGCCAACG ACTCCTTAAA AACAGGCGAA
TCAAGCGTTC CCGGCGTGTT TATATTTAAT ACGGCCAATG GGCAGACCAT GCTCGTTGAC
ACCAGTTCAA AACGAAAAAT ATACAAAGGA CTGGCTATCG ATAAAGTCGG CCAGCAACTT
ACCTGGATGG CTTCCGCCGA TAGCGCCGGT AGTGATGTAA AAGTATTCTC GCTCTATTAT
AAGAACCTAG CGCCAGTGGT CGCTGCCAAA GGCAAAAAGT CGAAAACAGC AGCCCCCGAA
GCACCTTTCA AAGTAATTGC CGATACATTG ACGGCAGCCT TACCCAAACG GTGGTCGGTG
AATGAGTATC GGGAGCCAAA ATTCTCGGAC GATGGCAAAC GGCTTTATTT CTCCACGTCA
CCCATTGCGC CCAAACCAAC CAAAGACACC CTGACACCCG AGGATGAAAA AGTTAAAATG
GATATATGGA CCTGGACCGA CACCCGGTTG CAGCCCATGC AGCAACGGCG GCTCAAGGAA
GAAAAGGAAC GTGGCTTTCT GACGGTGTAT GACATGGCAA GCGGAAAAGT TACGCCCCTG
GCCAGCCGGG AAGCCCCAAC GGTGAGCTTT GATGCCAAGA CAAACTCTCG CTACTTGTTG
GGCCTGAGTG ATCTGCCCTA TCAGGTTCAG GCCTCCTGGG ACCCCGGCCA TACCGACCTG
TATCTGATCG ACACGCAAAC GGGCGAAAAG AAGCGAATTG CCAATGATAT TATGGCGTCG
CAACCCAAAT TATCGCCCGA AGGCAAGTAC GCTTACTGGT TCGATGAGCG CGACTCGCTC
TGGCGGGCGT GGTCGATAGC CGATAGTCGA CGGCTGGATC TGACCCGTGG CTTGCCAAGC
AAGTTTTTCG ATGAAGAACA CGACACACCT AATTTACCCG GTGCTTATGG CTCGGCTGGC
TGGACAACCG GCGATCGTTA CCTGTGGCTT TACGACCGCT ACGATATCTG GCAGTTTGAC
CCAACCGGTA AGGAAAAGCC AACCAATCTG ACGGCTGGCT GGGGGCGCAA AAATAAAATT
CGCCTGCGCC GGGCTGAAGT AGAAAACGAT GACGATACCC CCGGCCGACG GTCGGGAGCG
GCTGAAAAAG CCATCGACCC CAAAGCTGAA CTATTTCTGA CGGGTATCTG GGAAAGCGAT
AAAGCGACCG GCATTTTAAA GAGTAAGAAC GGTCTGGCCG TTGCTGAGCC CACTGTATTG
ACCCGAAGCA ACCACCGTTA TTTTGGGCTG AACAAAGCAA AAAATGCCTC CGTTATGACG
TTTTATCGGG GTAATTACAA AGAGTCGACC AACCTTTTCC TGACGGATAC GACCCTTGCG
GCTCCAGTTC AGCTGACCCG AACGAATCCC CAACAGGACA GCGTTCGGTG GGGAAGTGCC
GAAATTGTGA GCTGGATTGG TACAAACGGT ATCAAGCTGG AAGGCTTGCT CTTTAAACCG
GAGGGCTTCG ACCCGACCAA GAAATACCCC ATGCTGACGT ATTTCTACGA GCGCAACGCC
GAAACCCTGA ACGACTACCG GGCTCCTTCG CCAAGCCGGT CGACCATCAA TATTCCGTAC
TGCGTTTCGA ACGGGTATCT GGTTTTCGTA CCGGACATCG TGTACACAAC CGGGCAGCCG
GGCCCCAATG CCTACGACTG TGTTGTTCCC GGCGTGTTGA GCCTGCTCGA CAAAGGCTTC
GTAGATCGTG ACCGGCTGGG TATTCAGGGA CAAAGCTGGG GTGGTTACCA AACCGCCTAC
ATTATTACCC GGACCAATCT TTTCCGGGCG GCAGAAGCCG GTGCGCCAGT GGCAAACATG
ACATCAGCTT ACGGCGGTAT TCGGTGGGAA ACGGGCATCA GCCGGGCGTT CCAGTACGAA
AAAACACAGA GCCGCATTGG TGGAACCCTG TGGGACAAAC CAATGAACTA CATCGAAAAC
TCGCCACTTT TCTTCGCCAA CCGAATTGAA ACGCCCCTGA TGATGACCCA TAACGACGCC
GACGGAGCCG TTCCCTGGTA CCAGGGTATC GAAATGTTCT CGGCACTGCG TCGGTTGAAC
AAGCCGGTCT GGATGCTGGT CTATAACGGG GAAGGCCATA ACCTGACCCA ACGGCATAAC
GCCAAGGACC TGAGTATTCG GTTATACCAG TATTTCGATT ACTACCTGAA AGATGCCCCC
ATGCCGGTCT GGATGAAAGA AGGCCGGAGC GCGGTAGAGA AAGACCGGGG TGAAATGAAG
TACTGA
 
Protein sequence
MMTFSYFSLK HKPCSFMRHF CQLILYFFLA FPLLAQSQKR ALTAADYDRW QSVRSEKISD 
DGQWIAYQID PQEGDGRLEV VSSTANTNRK QYVFPRGYMA QFTPDSKFLV MRLKAPYMAT
RTAKLKKKKA DELPKDSLLV LNLSTGKPTK LPNAKSFVFG KDAGSWLAVV QERKDERAAP
ASKSASLKDT LTPSAPVSTT TIATRKGPVK KPKGDDLTLL NMADGTRKTV RYVSNVVMSD
NGQAIFYSKE SANDSLKTGE SSVPGVFIFN TANGQTMLVD TSSKRKIYKG LAIDKVGQQL
TWMASADSAG SDVKVFSLYY KNLAPVVAAK GKKSKTAAPE APFKVIADTL TAALPKRWSV
NEYREPKFSD DGKRLYFSTS PIAPKPTKDT LTPEDEKVKM DIWTWTDTRL QPMQQRRLKE
EKERGFLTVY DMASGKVTPL ASREAPTVSF DAKTNSRYLL GLSDLPYQVQ ASWDPGHTDL
YLIDTQTGEK KRIANDIMAS QPKLSPEGKY AYWFDERDSL WRAWSIADSR RLDLTRGLPS
KFFDEEHDTP NLPGAYGSAG WTTGDRYLWL YDRYDIWQFD PTGKEKPTNL TAGWGRKNKI
RLRRAEVEND DDTPGRRSGA AEKAIDPKAE LFLTGIWESD KATGILKSKN GLAVAEPTVL
TRSNHRYFGL NKAKNASVMT FYRGNYKEST NLFLTDTTLA APVQLTRTNP QQDSVRWGSA
EIVSWIGTNG IKLEGLLFKP EGFDPTKKYP MLTYFYERNA ETLNDYRAPS PSRSTINIPY
CVSNGYLVFV PDIVYTTGQP GPNAYDCVVP GVLSLLDKGF VDRDRLGIQG QSWGGYQTAY
IITRTNLFRA AEAGAPVANM TSAYGGIRWE TGISRAFQYE KTQSRIGGTL WDKPMNYIEN
SPLFFANRIE TPLMMTHNDA DGAVPWYQGI EMFSALRRLN KPVWMLVYNG EGHNLTQRHN
AKDLSIRLYQ YFDYYLKDAP MPVWMKEGRS AVEKDRGEMK Y