Gene Slin_5257 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5257 
Symbol 
ID8729023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6411747 
End bp6413852 
Gene Length2106 bp 
Protein Length701 aa 
Translation table11 
GC content54% 
IMG OID 
ProductProlyl oligopeptidase 
Protein accessionYP_003390027 
Protein GI284040097 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGAGTT CGTCTGTCAT TCTAGCCCAG ACCACCGGCC CGCTTCCGTA CCCGGTCGCA 
AAAAAAACGG ATCAGGTCGA TACCTATCAT TCGACAACGG TTGCCGACCC CTACCGCTGG
CTCGAAGATG ACCGCTCGGC CGAAACAGCC GCCTGGGTCA AAGCCGAAAA CCAGGTTACG
TTCGACTACC TCTCGCAGAT TCCGTACCGC AAGCAGTTTC AGGATCGGCT TGAGCAGGTA
TATAATTACC CCAAATACTC GGCCCCAAAT CGTAAGGGGG ACTGGTTTTA CTTCTCAAAA
AATGACGGCT TACAAAATCA GGCCGTACTT TACCGGCAGA AGGGCCTCGA TGCCAAACCG
GAACTTGTCA TTGACCCCAA CAAACTTTCA GCCGATGGTA CCACCCGGCT TGGCGTTTTT
TCGCTTTCTA AAGATGGCAA ATACGCAGTT GTAGGCTTGT CGAAAGGCGG TTCTGACTGG
CAGGAATATC AGGTGATGGA ACTGGCGACC AAGACATATC TGCCCGATAA AATCGAGTGG
GTTAAGGTTT CCGGGGCAGC CTGGCAGGGC GACGGCTTCT ACTACAGCCG CTACCCAAAA
CCCGAAGGCA GCGCACTGGC CGCTAAAAAC GAGAACCACC AGGTTTATTT CCATAAGCTC
AACACCCCGC AATCGGCCGA CCGGCTGGTG TACGAAGACG CCAAAAACCC ACAGCGGTTT
CACACCGTCA GTACAACCGA CGATGAGCGG TTTGCGCTGC TGTCTGTCAG CGACCGAGGC
AACGGAAAAG ATGGCAACTC ACTGTTTTTT CTCGATGCCA AATCGGCGGT GAAGACGTTC
GCTCCCGTGG TGGCCGAGGT TACGAATTTC AGCTACGGCG TTGTCGATAA TGACGGTGAC
CGCCTGCTGA TCCTGACCAA CGAAAAAGCA CCGAACAGCA AAGTCATTGC CTTCGACACC
AAAAAGCGGA CGTTTTCGAC GCTCATCCCC GAAAAACCCG AGCCTATTGC CGAGAACAGC
GTTAGCGCGG CTGGTGGTAA ATTATTCGTT GAATACGCAA AAGACGTGAC CTCTAAAGTT
GCCGTATACG ACTACAGCGG CAAGTATGAG ACGGAGGTTC AGCTACCCGG CATTGGGTCA
TCGAGTGGGT TTGGGGGCGA AAAAGACGAT AAATTCGTTT TCTATTCGTT TACGTCCTTC
ACCTTCCCGC CTACCATCTA CCGCTACGAC ATCGCCAGCC GGAAAAGTAC CGTATTCCGC
GCCCCTGAAG TCGATTTCAA GCCGACCGAT TACGAAACCA AACAGGTCTT TTACACCAGT
AAAGACGGCA CCAAAGTTCC CATGTTTCTG ACGTACCGGA AGGGCCTGAA ACTGGATGGC
ACCAACCCAA CGCTGCTGTA CGGCTATGGT GGTTTCAATA TCAGCTTACC GCCCGCGTTC
AGTCCGTTCC GGATTCCGTT TCTGGAACAG GGCGGTGTGT ATGCACAGGC CAACTTACGG
GGCGGCAGCG AATACGGTGA GAAGTGGCAC GAGCAGGGGA TGAAGCACAA AAAACAGAAC
GTTTTCGACG ATTTCATTGC CGCAGCCGAA TACCTGATTG CCCAGCAGTA CACCAGCCCG
GCTAAACTGG CCATTCAGGG CGGTTCGAAC GGGGGCTTGC TCGTGGGGGC AGTGATGAAC
CAGCGGCCCG AACTGTTCCG GGTAGCTATT CCGCAGGTTG GTGTCATGGA CATGCTGCGA
TTCCATAAGT TCACCATCGG CTGGAACTGG ATTGCCGATT ACGGCAGCAG CGACAACGCG
GAGGAGTTCA AGGCGCTGTA TGCCTACTCG CCCCTGCACA ACATCAAGCC CGATATCAAG
TACCCCGCTA CGCTCATCAC CACCGCCGAT CATGACGACC GGGTGGTACC GGCTCACTCG
TTCAAGTATG CAGCCACCTT ACAGGCAACT TACAAAGGGC CGAATCCGGT ATTGATTCGA
ATCGACACGA ACTCGGGGCA CGGCGCCAGC AACACGAAGA AGAACATCGA AACAACGGCC
GACATTTACT CCTTCATTCT CTGGAATATG GGCGTAAAAA CCTTAAAAGA GATCGCCAGC
AAGTAG
 
Protein sequence
MLSSSVILAQ TTGPLPYPVA KKTDQVDTYH STTVADPYRW LEDDRSAETA AWVKAENQVT 
FDYLSQIPYR KQFQDRLEQV YNYPKYSAPN RKGDWFYFSK NDGLQNQAVL YRQKGLDAKP
ELVIDPNKLS ADGTTRLGVF SLSKDGKYAV VGLSKGGSDW QEYQVMELAT KTYLPDKIEW
VKVSGAAWQG DGFYYSRYPK PEGSALAAKN ENHQVYFHKL NTPQSADRLV YEDAKNPQRF
HTVSTTDDER FALLSVSDRG NGKDGNSLFF LDAKSAVKTF APVVAEVTNF SYGVVDNDGD
RLLILTNEKA PNSKVIAFDT KKRTFSTLIP EKPEPIAENS VSAAGGKLFV EYAKDVTSKV
AVYDYSGKYE TEVQLPGIGS SSGFGGEKDD KFVFYSFTSF TFPPTIYRYD IASRKSTVFR
APEVDFKPTD YETKQVFYTS KDGTKVPMFL TYRKGLKLDG TNPTLLYGYG GFNISLPPAF
SPFRIPFLEQ GGVYAQANLR GGSEYGEKWH EQGMKHKKQN VFDDFIAAAE YLIAQQYTSP
AKLAIQGGSN GGLLVGAVMN QRPELFRVAI PQVGVMDMLR FHKFTIGWNW IADYGSSDNA
EEFKALYAYS PLHNIKPDIK YPATLITTAD HDDRVVPAHS FKYAATLQAT YKGPNPVLIR
IDTNSGHGAS NTKKNIETTA DIYSFILWNM GVKTLKEIAS K