Gene Slin_1087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1087 
Symbol 
ID8724817 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp1327408 
End bp1328808 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content49% 
IMG OID 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_003385937 
Protein GI284036007 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.471806 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.498219 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGAA CTATTAAAAA CATGGCTGTT CAGGCAGCTG CAGTAACCAT CCTAACGTCT 
ATGTCTATAA TTGGCTGTCA GCCAGTGGAT GTTGACTCAT CAGCAACCAA ATCTTCGTCA
ACTGCCAATG CACGTTTGAG TGCCGGACAG GATTATGTGG ATGGCGAACT TCTTTTACAA
TTTAAAGAAG GCACGAGTGA TGACATCAAG CAAAAAGCGT TCGACAAAGT CAAAGGCATC
CCCGCCGAAA AAATTCTGAC TAAAGCCATG GAGCGGGCCA GCAAAAAGGA AGGCCTTTTG
CTGGTAAAAG TTAACAAGAA CGTTCTCGAA GCTATTGCTG ACCTGTCAGG TGCAGAAGGT
GTTGCCTTCG CCGAACCTAA TTTCATTTAT TCGCACACGG CTGTATCCAC TGATCCTTAT
TTTACCAATG GTTCGCTTTG GGGTATGGAT GCAAAAAATA CATACGGTAG CCAGGCGTCT
ACAGCCTGGG CAGCGGGTAA TACGGGGTCA GCTTCTGTTG TGGTGGGGGT TATTGATGAA
GGCATTCAGT TCGACCACCC GGATTTAGCT GGTCAGGTCT GGACAAATCC ATTCGATCCT
GTCGATGGAA AAGATAATGA CGGCAACGGC TACGTTGATG ATATTCATGG CTGGGACTTT
GACGGCAACA ACAACACCAT CTATGATGGT ACTACCCGGG GTAGCTCCGA TGATCATGGT
ACCCACGTTT CGGGGACTAT CGGCGGCAAA GCCAACAATG GTCAGGGCGT GGTTGGCATG
AACTGGAACA TCACTATTAT TTCCTGTAAG TTCCTCGGCC GCAAAGGTGG AACCACCGCC
AACGCCGTGA AAGCCGTTGA CTACCTGAAT GACTTAAAAA CCCGTCATGG TCTCAATATC
GTAGCCAGTA ACAACTCCTG GGGCGGGGGT GGTTTTTCAC AGGCCCTTTA TGATGCCGTA
AACCGGGCAA ATACTAAAAA TATACTCTTT GTAGCGGCTG CCGGAAATGG CGGAAGCGAT
GGCGTGGGCG ACAATAATGA CGTTGTAGCC AGCTATCCTT CAAATATGGA TCTACCGAAT
GTTATTGCCG TAGCGGCCAT AACGTCAACC GGTGCCCTAT CGTCTTTTTC AAATTATGGT
GCAACCACAG TTGACATAGG GGCACCCGGT CAGGCCATCT GGTCGTCGAC GGCGTACAAT
ATTTATGAAT CGTATAGCGG CACCTCAATG GCTACTCCTC ACGTTACCGG TGCCGTTGCG
CTGTATGCCT CCGTAAAACC GGGTTCAACA GCCGCACAAA TTAAAGCCGC TATCTTAAAC
AGTGCCGTGT CTACCCCCTC TTTAAATGGA AAAACAGTAA CAGGGGGGCG GTTAAACGTC
AATGCAGCGC TGGCCCTGTA A
 
Protein sequence
MKRTIKNMAV QAAAVTILTS MSIIGCQPVD VDSSATKSSS TANARLSAGQ DYVDGELLLQ 
FKEGTSDDIK QKAFDKVKGI PAEKILTKAM ERASKKEGLL LVKVNKNVLE AIADLSGAEG
VAFAEPNFIY SHTAVSTDPY FTNGSLWGMD AKNTYGSQAS TAWAAGNTGS ASVVVGVIDE
GIQFDHPDLA GQVWTNPFDP VDGKDNDGNG YVDDIHGWDF DGNNNTIYDG TTRGSSDDHG
THVSGTIGGK ANNGQGVVGM NWNITIISCK FLGRKGGTTA NAVKAVDYLN DLKTRHGLNI
VASNNSWGGG GFSQALYDAV NRANTKNILF VAAAGNGGSD GVGDNNDVVA SYPSNMDLPN
VIAVAAITST GALSSFSNYG ATTVDIGAPG QAIWSSTAYN IYESYSGTSM ATPHVTGAVA
LYASVKPGST AAQIKAAILN SAVSTPSLNG KTVTGGRLNV NAALAL