Gene Slin_1710 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1710 
Symbol 
ID8725447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2047203 
End bp2048393 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content54% 
IMG OID 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_003386555 
Protein GI284036625 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACGTT TTTTATCCCT AGCGACACTT ACTCATGTCA CCCTGCTTGG CGTTGGGCTG 
TTTGCCTGCC ACAACGATAG TAGTGTAGCC CCTACCATCT CGCCCGACTG TCTGGTAAAA
GCATCCTCCA ATAATGGGGT TGCCATTGCG GGTGAATATA TCGTTACCTA TCAGCCCACA
CAAACGCTAC CGGTGGCACC CAATGCCCGC GTTGCTGCTA CGGAAGCGCT GGCCGAATCA
CTGCTCAAAA CCTACAACGT AGCCAATCAC CAGACGGCTG TTTTAGCTGC GGGCGAACAG
ACAACCTTCC TGGCCCACCT CACCGAAAGT GAATCCCAAA AACTACGGCA GGACCCCTCT
GTGATGGTCA TTGAACCCGA CCGAATCATG GCCATGTGCA ATTGCGTCGA CGTAGCGATC
ACCTCTACGC TAACCTGGGA TGTGAAACAA ACGGGCTACG GTCGCGGTGA TTTGCAAACG
ACCAAAACAG CCTGGATTAT TGATACGGGC ATTGACCTCG ACCACCCTGA TCTGAATGTA
GATACCAACC GCAGCCGATC GTTTGTCAGC GGGCAAACCT CCGCTGATGA TGACAATGGG
CACGGCACCC ACGTGGCCGG TGTCATTGGG GCGAAGAATA ATAACATCGG CATAACGGGT
GTTGCTTCCG GGGCTACCTT AGTAGCGCTC CGGGTTCTGG ATGATGAAGG AGAAGGCCGC
TTGTCAGGCA TTATTCAGGC CGTAAATTAT GTAGCTCAGA ACGGCAAGGC CGGTGATGTA
GTCAATCTGA GTCTGGGTGG CGAGGGCACA TCAGCCGCGC TCGACCGGGC CATTACCCAG
GCCGCTAATT TGGGCATCTT GTTTGCCATT GCTTCGGGCA ACGATGGGAA GAACAGCGAC
AATTATTCAC CCGCCCGGGT CAACCACGCG AATGTTTTCA CGGTATCGGC CATGGACAGT
AAAAACCAGT TTGCGTCATT TTCCAACTTT GGCAATAGTG TCGATGTGTG TGCCTATGGG
GTTCGCATTA CCTCCACGTA TAAGGACGGG AAATATGCAA CCCTGAGTGG AACATCTATG
GCCGCACCGC ACGTAGCGGG TTTATTGCTG ATTCGGGGAA GCAAACTGCC CACCCACGGC
ACCGTTACCG GCGACCCGGA TGGTAAACCA GACCCCATGG CAGGCGAATA A
 
Protein sequence
MRRFLSLATL THVTLLGVGL FACHNDSSVA PTISPDCLVK ASSNNGVAIA GEYIVTYQPT 
QTLPVAPNAR VAATEALAES LLKTYNVANH QTAVLAAGEQ TTFLAHLTES ESQKLRQDPS
VMVIEPDRIM AMCNCVDVAI TSTLTWDVKQ TGYGRGDLQT TKTAWIIDTG IDLDHPDLNV
DTNRSRSFVS GQTSADDDNG HGTHVAGVIG AKNNNIGITG VASGATLVAL RVLDDEGEGR
LSGIIQAVNY VAQNGKAGDV VNLSLGGEGT SAALDRAITQ AANLGILFAI ASGNDGKNSD
NYSPARVNHA NVFTVSAMDS KNQFASFSNF GNSVDVCAYG VRITSTYKDG KYATLSGTSM
AAPHVAGLLL IRGSKLPTHG TVTGDPDGKP DPMAGE