Gene Slin_5713 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5713 
Symbol 
ID8729488 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6949021 
End bp6952314 
Gene Length3294 bp 
Protein Length1097 aa 
Translation table11 
GC content58% 
IMG OID 
Productpeptidase S41 
Protein accessionYP_003390477 
Protein GI284040547 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0183414 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATACTT CTACAAAAAT TCTTCTCCGA TCTGCCCTGC TGATTTCCCT TGCGGCACCC 
GCTATAGCGC AAACACCACC GGCCACGCCA ACGCCAAAAT TCGCTTATGC TGAACCCAGC
CTTTCGCCCG ATGGGGCCGA AATAGCGTTC GTCTCCGGGG GCGATATCTG GACGGTACCC
GCTGCGGGTG GTGAAGCCCG GCTGCTGGTT TCGCATCCCG ATAATGAGTC CCGCCCGCTG
TATTCGCCAG ACGGCAAGTA TCTGGCGTTC GTATCCTCCC GCACCGGCAA CGGCGACGTG
TATCTGCTAA CGCTGGAAAC GGGTACCACC AAACGTCTGA CCTACGACGA CGGGGCCGAA
GTGCTCAACG CCTGGTCGCG GGATGGCAAG ATGATCTATT TCCAGTCGAC GGCCAAGGAT
ATTTCGGGTA TGAACGACAT CTACCGGGTG TCGGTCAACG GTGGTACGCC CATGGCCATT
ACCGCCGACC GCTATGCCAA TGAATTTTAC AGCGCTCCCT CGCCCGATGG CAAAACGCTG
GCTTTCTCGG CGCGGGGCAT TGCCTCCAAC CAGTGGTGGC GCAAAGGCCA TAGCCACCTC
GACGAATCTG AAATCTGGCT GCTCCATACC GATGCTAAAA AAGCGGCCAC GGCCTACGAA
CGCCTGACCG AAGGCGGAGC CAAAGAACTC TGGCCGATGT GGAGCGCCGA CGGGAAGGCC
CTCTTTTACG TGTCGGACCG TACCCGGGGA CAAAACCTGT GGACGCTCCC GCTGGGCGGC
AAACCTACTA TGCTAACGTC GTTCTCGAAC GGGCGGGTGC TGTGGCCATC CATCAGCTAC
GACGGAAAAA GCATCGTTTT CGAGCGTGAT TTTCAACTCT GGAAATACGA TGTTGCCAGT
AAGCAGACCG CTCCCCTATC GGTTAAGTTG CGGGGTGTTG CCGCCAGCCC GGCGGTTGAT
CGGCTGAAAC TAACGAACCA GTTCCGCGAA CTGGCCCTCT CACCCGATGG CAAGAAAGTG
GCCTTTGTAG CCCATGGCGA AGTGTTTGCC GCTTCTACCA AAGACGGGGG CGATGCAACC
CGGATCAGCA CTAGTGCCGC CAACGAATCA CAGCCAGTCT GGACACCCAA CAGCCGCAGC
CTGGTTTATG TATCCCAGCG CGACGGTGCT GCTCACCTCT ACCGTTACGA CTTTGCCACC
CGCGACGAAA CTCGTCTGAC CGACGGCGCA CTAGACGACA GTTCGCCCGT GTTTTCGCCC
GATGGAAAAT CGCTGGCTTA TGTTCGAAAT GGGCAGGAGT TGCGTGTTTT GGACATCGCT
ACCAAAAAAG ATCGGTTGCT GAAGAAAGCC TACCTCGGTC GTCCGCCGTT TGGCTCGAAT
GGCGCCGTTG TGTGGTCGCC GGATGGCAAA TGGATTGCGT TTGCGTCGTA CGGAGCCAAA
ACCTTCCGGA ATATCTCGGT CATTCCGGCC ATCGGGGGCG AAAGCAAGCC GGTTAGTTTT
CTGGCGAATA CCTTTGGCGG CAACGTAACC TGGAGTCCGG ATGGAAAATA CATTCTGTTC
AGCACCACCC AGCGTACCGA AACGGCGCAG ATTGCCCGCA TCGACCTGGT GCCCCGTTCA
CCCAAATTCC GGGAAGACCA GTTCCGGGAT CTGTTTAACG AAGAGGTGCC GAAAACGCTG
AAACCCGCCC CAACGTCACC CGCCGTCGCC AAAACCACAT TGCGCGACAC GAGCACATCG
GCCTTGGGCG ACACGGCCAA GCTGGGCAAA AGTGGTTCTC CAACAAAGAT CGTTTTCGAA
GACATTCGCC AGCGGTTGAG CCTGTTGCCG GTGGGTGTCG ATGTGGACGC GCATTCGATC
AGCAAGGACG GCAAAACGCT GCTGCTTATT GCTACGGTGG CCGGTCAGCA GAACTTATAC
ACGTACTCGC TCGATGAGTT GGGCCGAGAC CAGAGCGTTG CCCGGCAACT CACGTCCACT
GCTGGCAACA AAAACGGCGC GCAATTCTCA CCCGATGGCA AAGAGATTTA TTACCTCGAA
CAGGGCCGTA TCCAGTCCAT CACGCTCGAC AAACGCGAGC CCAGGCCGCT GGCCGTAACG
GCCGAAATGG ACGTTGATTT CGGCGAGGAG AAAGTGCAGG TTTTCCGCCA GGCGTGGGAC
ATTCAGAATA AAGGCTTTTA CGACTCCACC TTCCACGGGG TAGACTGGAA AGCCATTAAA
GCCGAATACG AACCACTAGC TGCCGGTGCC CGCACCCCCG ACGAACTACG CCGACTGATT
AGCCTGATGC TGGGCGAACT GAATGCATCG CACTCGGGCA TTTCGGGGCC AGCCGGTTCT
GTACAAACGA CGACGGGCCG CATTGGTCTG CGCTTCGACC GGGAGGAATA CGAGACAAAA
GGGAAACTGA AAATTACCGA GATCATTCCG CTGAGTCCGG CCGCGTTGTC GGGAACGATC
AAAGTCGGTG ATTATCTGGT TGGCGTTGAC GACACGAAAA TCACCGCCAC CACCAACCTC
GATCAGTTGC TGGAAAACCA GATCAACCGG CGCATCTCGC TTATGCTAGC CTCAACGCCC
GACGGCACCC CGCGCGAAGT GACCATCCGG CCGGTTAACC TGACGACGGA AAAGGGATTA
CTGTACAAGC AATGGGTGCA ACAACAACGG CAGTACGTGG ATAAAGTAAG TGGCGGTCGG
CTCGGCTACG TGCACATGTA CGACATGTCG GCGGAGTCGC TGGCGCAGTT GAGCCTTGAT
CTGGACGCCG ACAACCACGC TAAAGAAGGG GTAGTGGTCG ATGTGCGGAA CAACAACGGC
GGCTTCGTCA ATGCCTACGC GCTGGATGTG TTTACCCGTA AAGGCTACCT GACCATGACC
TCGCGTGGCC TGCCAGCCGC CCCGGCCCGT ACGCAACTGG GCCAGCGGGC GCTGGAAACG
CCAACCATTC TGGTCACGAA CCAGCACTCG TTATCTGATG CGGAAGATTT CACTGAAGGG
TACCGGACGC TGAAATTGGG CAAGGTCGTG GGCGAACCAA CGGCGGGCTG GATTATTTTC
ACCTCAGCCG CCCAACTCAT CGACGGCTCG AACCTCCGGT TGCCGTTCTC CAAAATCACC
GACAACACCG GCAAGAACAT GGAACTGGCC CCTCGCCCGG TCGACATAGC CGTGTCGCGG
CCCATCGGCG AGAGCTATAC GGACAAAAAT GTGCAGCTGG ATACGGCTGT AGCTGAGCTA
CTGAAGGAAC TGAACGAAGC CAAAACCTCG AAAGTGAGCA GCGGGAAGGA GTAA
 
Protein sequence
MYTSTKILLR SALLISLAAP AIAQTPPATP TPKFAYAEPS LSPDGAEIAF VSGGDIWTVP 
AAGGEARLLV SHPDNESRPL YSPDGKYLAF VSSRTGNGDV YLLTLETGTT KRLTYDDGAE
VLNAWSRDGK MIYFQSTAKD ISGMNDIYRV SVNGGTPMAI TADRYANEFY SAPSPDGKTL
AFSARGIASN QWWRKGHSHL DESEIWLLHT DAKKAATAYE RLTEGGAKEL WPMWSADGKA
LFYVSDRTRG QNLWTLPLGG KPTMLTSFSN GRVLWPSISY DGKSIVFERD FQLWKYDVAS
KQTAPLSVKL RGVAASPAVD RLKLTNQFRE LALSPDGKKV AFVAHGEVFA ASTKDGGDAT
RISTSAANES QPVWTPNSRS LVYVSQRDGA AHLYRYDFAT RDETRLTDGA LDDSSPVFSP
DGKSLAYVRN GQELRVLDIA TKKDRLLKKA YLGRPPFGSN GAVVWSPDGK WIAFASYGAK
TFRNISVIPA IGGESKPVSF LANTFGGNVT WSPDGKYILF STTQRTETAQ IARIDLVPRS
PKFREDQFRD LFNEEVPKTL KPAPTSPAVA KTTLRDTSTS ALGDTAKLGK SGSPTKIVFE
DIRQRLSLLP VGVDVDAHSI SKDGKTLLLI ATVAGQQNLY TYSLDELGRD QSVARQLTST
AGNKNGAQFS PDGKEIYYLE QGRIQSITLD KREPRPLAVT AEMDVDFGEE KVQVFRQAWD
IQNKGFYDST FHGVDWKAIK AEYEPLAAGA RTPDELRRLI SLMLGELNAS HSGISGPAGS
VQTTTGRIGL RFDREEYETK GKLKITEIIP LSPAALSGTI KVGDYLVGVD DTKITATTNL
DQLLENQINR RISLMLASTP DGTPREVTIR PVNLTTEKGL LYKQWVQQQR QYVDKVSGGR
LGYVHMYDMS AESLAQLSLD LDADNHAKEG VVVDVRNNNG GFVNAYALDV FTRKGYLTMT
SRGLPAAPAR TQLGQRALET PTILVTNQHS LSDAEDFTEG YRTLKLGKVV GEPTAGWIIF
TSAAQLIDGS NLRLPFSKIT DNTGKNMELA PRPVDIAVSR PIGESYTDKN VQLDTAVAEL
LKELNEAKTS KVSSGKE