Gene Slin_4678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4678 
Symbol 
ID8728442 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5699307 
End bp5700530 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content57% 
IMG OID 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_003389455 
Protein GI284039525 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.264127 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.283017 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCAATC ACTCCCTGAA AAAAGTTCTA TGGTCGGCGT TAGTTCCGGC GTTACTTGCG 
GCCTGTCAGC CCACGGATAC GTTCCAGCGT CAGCCGGATG ATGTGGTTCG TTATGGTAGT
CAGACAGGGT CGGCGCGGGC CGCTGCCGAT GTGCAGAAGT ACATTGTTAC CTTTAAAGCC
GACCCACTCA TTACCCGCTC GCTGCCCGAC AATGCCGGAG CGTATGATGC CCGGGTACAA
CAGATGCAGG GGCTGATTTC CCGACTGGTA GGAGCCGATA TTGCCGGCAA AACCCAGGAG
GTGTACACAA CGGCGATCCG GGGCTTTGCC GTTGAACTAA CAGCCGCCGA ACTGGCCCGG
CTGCAGCGAC TGCCGTTCAT TGCCAGTATC GTACCCGATC AGGTTGTGTC ACTGGCCGTG
CCAACTGGTA CTGCCATAAC GATAGGAGCG CAGACTATAC CCTGGGGAAT TAGCCGGGTG
GGTGGCGTTC GAACCTATAC GGGTTCACAT AAGGCCTGGG TGCTGGATAC AGGTATTGAT
TTCGATCATC CCGACCTGAA TGTCGACCTG CCGCTGTGCC GTAATTTCAA TAACCCACGT
CGCGATGCCG ACGACGACAA CGGACATGGC TCGCACGTGG CCGGTACCAT TGGCGCTAAA
GACAATAACT TTGGTGTAGT GGGTGTTGCG CCGGGCGTAA AAGTAATCGC CGTGAAGGTA
TTATCGGCGA CGGGGAGTGG TTCTTACTCC GGTGTTATTG CCGGTATCGA CTACGTTGCC
ACGGCCGGTG CAGCGGGCGA TGTAGTGAAC ATGAGTCTGG GCGGACCCGT TTATACGCCG
ATCGATGAAG CGGTAAAAGG AGCAGCCAGC AAAGGAATCC TGTTTGCACT GGCTGCAGGC
AACGAGTCGC AGAATGCCAA TAACTCCTCT CCGGGCCGTA CGGAACACCC CAACGTGTAT
ACGGTTTCGG CCCACGACTA TAACGATAAA TTTGCGTCGT TCTCCAACTA CGGCAATCCG
CCCATCGACT GGTGCGCACC GGGTGTCGAT GTGCTGTCGA CCTGGCGTTC GGGCGGTTAC
CGGACTATCA GTGGTACCTC AATGGCAACG CCCCACGTAG CAGGTATCCT GCTCTACGGC
ACGCCGGCTT CCCGCGGACC AGTATCCGGC GACCGCGATA GCACTCCGGA CCAGATGGCC
AAGCTACCAA CGGTAACGCC CTAA
 
Protein sequence
MFNHSLKKVL WSALVPALLA ACQPTDTFQR QPDDVVRYGS QTGSARAAAD VQKYIVTFKA 
DPLITRSLPD NAGAYDARVQ QMQGLISRLV GADIAGKTQE VYTTAIRGFA VELTAAELAR
LQRLPFIASI VPDQVVSLAV PTGTAITIGA QTIPWGISRV GGVRTYTGSH KAWVLDTGID
FDHPDLNVDL PLCRNFNNPR RDADDDNGHG SHVAGTIGAK DNNFGVVGVA PGVKVIAVKV
LSATGSGSYS GVIAGIDYVA TAGAAGDVVN MSLGGPVYTP IDEAVKGAAS KGILFALAAG
NESQNANNSS PGRTEHPNVY TVSAHDYNDK FASFSNYGNP PIDWCAPGVD VLSTWRSGGY
RTISGTSMAT PHVAGILLYG TPASRGPVSG DRDSTPDQMA KLPTVTP