Gene Slin_4878 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4878 
Symbol 
ID8728642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5941292 
End bp5944537 
Gene Length3246 bp 
Protein Length1081 aa 
Translation table11 
GC content50% 
IMG OID 
ProductPKD domain containing protein 
Protein accessionYP_003389655 
Protein GI284039725 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.846856 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCACA TATTTGGTAT GTATCTATCA GTCAGCAAAT TTCTGCCAAC AAGGCTTTTT 
AATTCTATTG TTTGTTTTAC CCTTCTACTG GCATACATCT CCACGAATAG TTTGCTGGTG
GCGCAACAAC TGCCGCCCGG GTTTTCGAAG AGTGTCAGTC AAAGTGGGTA TGTGGGTTCA
GTGGGTATGG TTTTTGCTAA AGATGGGAAT TCATTTTTTG TCTGGGAGAA GAGTGGTCTT
GTGTGGGCAT CAGTATGGAA CGGAACATCC TACAACCGGC AGGAATCGGT AACGCTTGAT
ATTCGGGAAG AAGTAGGCGA GTGGAATGAT TTCGGCTTAC ATAGCATGTG TCTGGATCCA
AATTTTGAGA CGAATGGGTT CATTTATCTC TTCTACGTTG TAGATCTTCA CCACCTCTTG
TATTTTGGTA CATCTCAGTA CAGCAGCACG GCTAATGAAT ACCAGAATGC GACGGTCAGC
CGGGTAACAC GCTATAAACT GAATAAGGTT GGGACGTCAT ATCTGACCGA TTATTCGAGT
CGTACCGTTT TGCTGGGAGA AAGTAAAACT ACCGGAGTTC CGATCACCTT CGAATCGCAT
GCGGGTGGAA CGATTTTGTT CGGTAACGAC GGTTCGTTGC TGGTAGCAAC GGGTGATGGG
GCTCATCACG AGGGGATTGA TGTGGGAAAC GATAGCCGGA CTAACTTTCA AACAGCACTG
AACCTCGGCA TAATGAGGCC CGAAGAGAAT GTAGGCGCGT TGCGGTCGCA GATGCTTAAT
TCTCACTGTG GCAAGGTATT GCGAATTGAC CCTACTACTG GAAATGGCTT GCCGAGCAAT
CCGTTTTATG ACCCGATGAA TCCACGGGCA CCTAAATCGC GCGTGTGGAC CCTGGGGGTG
CGTAACCCCT ACCGAATTTG TATTCAGCCT AATACCGGTA GTACAAACCC CGACGATGGT
AGCCCCGGTA CATTGCTCAT TGGGGATGTA GGCTGGTTTA AGTGGGAGGA TTTTCATGTG
ATAGATAAAG CAGGTTTGAA CTGCGGCTGG CCGGTCTATG AAGGTCTGCT ACCCACTTAC
CTCTACTATG GCACGAATGT CCATAACCTG GATGAACCCG GACAGCCCAC CTTTGAAAGT
CTATGTGTCC AGCCCTCTTC CTTCATTGAT AACCCCGATC CAACCCTCAG GCGGTTTACG
CACTCCCGCC CGGCCATGGA TTATAGTCAT AGCGCCAACA TTACCCGTGT TCCGGCTTTC
AACGGCACAA CGGCAATCGT GCGCGAGCTT GGCACGGTAG GGGCACCGGC TGGTACGCAA
TTTCTGGGCC ATTGTGCTAT AGGGGGAGCC TATTATACCG GAACTCAATT TCCGGCCATG
TATCAGAATA CCTTATTTTT TACCGACTAT GTAGAAGGCT GGATCAAAAG TATAGTGTTG
CACGATGAAG GAGACCACCA TATTCATGAA ATAAAGGACT TTGCATCGCT CGGCTTCGAT
ACCAACATAC TCGATTTAAA GGTGAACCCC CGCGATGGCT CTTTGTATTA TGTCCGGCTG
GATGGTGTAG TATCCCGAAT AAGTTACGGC GGCAATCAGC CTCCCGTAGC CAACGCAACG
GCCAGTGCCA ATTACGGCCT CTCGCCCCTC GTCATTCAAT TTACAGGTTC GAATTCTGTT
GACCCAGAAG GGCAGGCTCT ATCTTACCTT TGGAAGTTTG GCGATGGTAC CACATCCACC
AGCGCCAATC CTGTTAAAAC GTTTACGGCT GTCAGTACCC AAATGTACAC TGTAACACTG
GTCGTGACCG ACAATGAGCA GTTAACCAGC AGCCAGGAGG TCATCATCTC AGTAAACAAC
ACACCGCCAG CTGTTGAGAT AGTTACGCCC GCTAGTGGAA CGCTCTATCG AATGGATCAG
GCCACAACGT ATACCCTACA GGCCGCCGTT ACGGATACCG ATACGGCCGG TATGCAATAT
GCATGGCAGG TCACCTTACG GCACAATAGC CATACACATC CTGAACCTAT TCTCTATGAA
CGAACGCCAA CGGTTACCAT TACGCCCGCA GGCTGTAATC CTAACGAGAC ATTTTATTAC
GTCATTATTA TTAACGCAAC GGATAATGGC GGGTTGACAG CTACTCAGTC GCTCACCCTG
AACCCCGATT GTAGTTCGGC GAACGTAGCC GTAACTAACC TTCAGACCAC CTCTAAGTTA
AATTCAGTAC TCGTTAGCTG GATTAATCCT AACGTAACAT TTGATGAGGT CATGGTTGTG
GCTAAAGAAG CAACGGGCTT TCGGGGATCA CCAAGTGGGA CATCGTATAC CGCTAAGGCT
AGCTTTACCA GCGATGGGAC TGCTTTTGAA GCAGGTAAAG TGGTATACCG TGGCCAGAGT
AATTCGGTCA CAGTCACAAA TCTTGATCCA TTGAAGCAAT ATTACTTTCG GGTGTATACC
CGGGTCGGTA ATGTCTGGAA TGCAGGTGTT CAGGGGACGG CCACGCCCAA TCTGCCCCCC
ATAGCACCGG TCGTGGTGCC ACCCGCTGCT GAACTGTATA CGCTGTACTC GTATACAGTT
CCGGTGTTTA CGGACCCCGA AAATCAGCCA TTAACCGCTA CTACATCTCT TCCAGACTGG
TTAACCTACG ATGCGGATAC GGGTGTCTTA ACCGGTGTAC CAGTTGTGGC TGGTAGTTAC
ACGCTTACAA TCGGCGTGAC AGATCCCGGA AATCTAACCG CACGTGTTGT CATGGTCGTT
GTAGCCGGGC CTAATCAGCC GCCCGTTCCG CCGGTTGTGG GTGAACAGTT CGCCCAAATA
GGTCGACCGT TTAGCTTTAC AGTGCCTGCT TTCACCGATC CTGAGGGAAA AGCGCTGGCG
TATGCTTCGG GCGAGTTGCC TTACTGGTTA AGTTTCGATA CGAATACCCG TGTGATGAGC
GGCACACCCA CGCAGACTAA TAGCTATTCT GTCACCATAC ACGCCACAGA TCCACAAGGG
CTGACTGCCT CCGTTCGGGT TGTCATCAAT GCAGGCATCT GTACAATGGC CACCGTAAAG
CAGGGTAACT GGAATGACCC TACGGTATGG TATTGTCAGC GTATTCCGAC TGGTGCTGAG
ACGGTCTACA TCAACCATGC TGTAACCGTA CCGACTGGCT ATGATGCCTA TGCCAAAAGC
GTCGTTTATG CAGCCTCCGG CAGTCTGGCT TTTAGCGAGA ATGCCAGACT GAATGTCAAT
CCATGA
 
Protein sequence
MNHIFGMYLS VSKFLPTRLF NSIVCFTLLL AYISTNSLLV AQQLPPGFSK SVSQSGYVGS 
VGMVFAKDGN SFFVWEKSGL VWASVWNGTS YNRQESVTLD IREEVGEWND FGLHSMCLDP
NFETNGFIYL FYVVDLHHLL YFGTSQYSST ANEYQNATVS RVTRYKLNKV GTSYLTDYSS
RTVLLGESKT TGVPITFESH AGGTILFGND GSLLVATGDG AHHEGIDVGN DSRTNFQTAL
NLGIMRPEEN VGALRSQMLN SHCGKVLRID PTTGNGLPSN PFYDPMNPRA PKSRVWTLGV
RNPYRICIQP NTGSTNPDDG SPGTLLIGDV GWFKWEDFHV IDKAGLNCGW PVYEGLLPTY
LYYGTNVHNL DEPGQPTFES LCVQPSSFID NPDPTLRRFT HSRPAMDYSH SANITRVPAF
NGTTAIVREL GTVGAPAGTQ FLGHCAIGGA YYTGTQFPAM YQNTLFFTDY VEGWIKSIVL
HDEGDHHIHE IKDFASLGFD TNILDLKVNP RDGSLYYVRL DGVVSRISYG GNQPPVANAT
ASANYGLSPL VIQFTGSNSV DPEGQALSYL WKFGDGTTST SANPVKTFTA VSTQMYTVTL
VVTDNEQLTS SQEVIISVNN TPPAVEIVTP ASGTLYRMDQ ATTYTLQAAV TDTDTAGMQY
AWQVTLRHNS HTHPEPILYE RTPTVTITPA GCNPNETFYY VIIINATDNG GLTATQSLTL
NPDCSSANVA VTNLQTTSKL NSVLVSWINP NVTFDEVMVV AKEATGFRGS PSGTSYTAKA
SFTSDGTAFE AGKVVYRGQS NSVTVTNLDP LKQYYFRVYT RVGNVWNAGV QGTATPNLPP
IAPVVVPPAA ELYTLYSYTV PVFTDPENQP LTATTSLPDW LTYDADTGVL TGVPVVAGSY
TLTIGVTDPG NLTARVVMVV VAGPNQPPVP PVVGEQFAQI GRPFSFTVPA FTDPEGKALA
YASGELPYWL SFDTNTRVMS GTPTQTNSYS VTIHATDPQG LTASVRVVIN AGICTMATVK
QGNWNDPTVW YCQRIPTGAE TVYINHAVTV PTGYDAYAKS VVYAASGSLA FSENARLNVN
P