Gene Slin_5989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5989 
Symbol 
ID8729770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp7258324 
End bp7259931 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content50% 
IMG OID 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_003390750 
Protein GI284040820 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.852027 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAC TCCTTATCAG CGCATCCGTT TGTCTGCTTT CTATGGCAGC ATCGGCGCAG 
GAAACGCAGT GGTATCTGCG TGATAAAACC GATAATACAG CCGGTATCAG TGTTGAACGC
ACGTACCGTG AACTTCTTAA GGATCGTAAA CCTACACCCG TCATCGTTGC CGTCATTGAC
GGTGGAATTG ATACTACCCA TGAAGATTTG CGTCGGGTAC TCTGGGTAAA TCCTAAGGAA
ATAGCCGGAA ATGGGAAAGA CGATGATAAA AACGGCTATG TCGATGATGT GCATGGCTGG
AACTTTATCG GTGGGAAAGA CGGCCGAAAT GTCGATTTTG AAACCGCCGA GGTTACCCGT
CTTTACGCAC AGCTGAAACC AAAATACGAG GGTAAAGACC GCAAAGCGTT AAAGCCGGAT
CAGCAGAAAG AGTACGATCT GTACGTAAAG ACCAAAGCTG AGGTCGAGAA AAATCAGACT
AAGTACAAAA CGGAATATCA GGGAATCAGC CAGTTTTACA AGCAGTATTC GGAGGCTGTG
ACTACCCTTA AGAAAGCCCT CAACGTATCT AAACTGGATA CGACTACCCT GAGTAAGGCG
GCTGATACCT TAACCGACGC TGCGCTGAAA CGTCCCGTTA TGGGCATACT TCGGTTACTG
CGCCAGCAGA ACGCACCGAA CACCGACGTG GTGATGGGTG AGCTGGAAAA ATACAATGAT
CAGCTCAAGT CGCGCGCCGA GTACAACTAC AATCCTGAAT TCAACAGCCG CACTATTGTA
GGCGATAATC CGGACGATAT GACTCAGCGG GATTACGGTA ACCCCGACAT TGCCGGGCCA
CGTCCTGACC ACGGTACGCA CGTAGCCGGT ATTATAGGTG CTGACCGTAC CAACAATCTG
GGTATTATGG GAATTGCCGA TGCGGTTCAG ATAATGGGCG TTCGGGCTGT GCCCGACGGC
GATGAGCGCG ATAAGGACGT AGCCAATGCT ATCCGGTATG CCGTCGATAA CGGAGCGAAA
ATCATCAACA TGAGCTTTGG CAAAGATTAT TCGCCCCAGC GCAAAACTGT TGAAGATGCC
GAACGCTATG CGTTATCGAA AGGGGTATTA ATGATTCATG CGGCTGGTAA CGACGGAAAA
GATATCGATA CCGCAGCCAA TTACCCTGCT CCCCGGTTTA TGGATGGGTC GGCCATTCCG
AACGTGATTA CGGTGGGTGC CAGCGCCGAG CCGAACACCG CCGATCTGGT GGCCAGTTTC
TCAAACTATG GCAAGCAGAA TGTTGATGTG TTCGCTCCGG GCAAAGATAT TTATTCGACT
GTGCCGGGTA GTAAGTACGA AAACAACAGC GGAACCAGCA TGGCCTCGCC CGTAGTGGCT
GGCGTGGCGG CTGTCCTGAA ATCGTACTTC CCGAAACTGA CTTACGCCGA TATTAAACGG
ATTATTCTGG AATCGGCAAC GCCTTACAAA ACCAAAGTAA CAAAACCCGA ATCGACGGAT
ACCGTTGACT TCTCGTCATT ATCGAAAACG GGTGGCGTTG TTAACCTGTA TGATGCTGTG
AAGTTAGCCC TGGCGCAGGA TGCGGCTTCT TCAGGCAAAG GAAAATAA
 
Protein sequence
MKKLLISASV CLLSMAASAQ ETQWYLRDKT DNTAGISVER TYRELLKDRK PTPVIVAVID 
GGIDTTHEDL RRVLWVNPKE IAGNGKDDDK NGYVDDVHGW NFIGGKDGRN VDFETAEVTR
LYAQLKPKYE GKDRKALKPD QQKEYDLYVK TKAEVEKNQT KYKTEYQGIS QFYKQYSEAV
TTLKKALNVS KLDTTTLSKA ADTLTDAALK RPVMGILRLL RQQNAPNTDV VMGELEKYND
QLKSRAEYNY NPEFNSRTIV GDNPDDMTQR DYGNPDIAGP RPDHGTHVAG IIGADRTNNL
GIMGIADAVQ IMGVRAVPDG DERDKDVANA IRYAVDNGAK IINMSFGKDY SPQRKTVEDA
ERYALSKGVL MIHAAGNDGK DIDTAANYPA PRFMDGSAIP NVITVGASAE PNTADLVASF
SNYGKQNVDV FAPGKDIYST VPGSKYENNS GTSMASPVVA GVAAVLKSYF PKLTYADIKR
IILESATPYK TKVTKPESTD TVDFSSLSKT GGVVNLYDAV KLALAQDAAS SGKGK