Gene Slin_3034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3034 
Symbol 
ID8726786 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp3675477 
End bp3678866 
Gene Length3390 bp 
Protein Length1129 aa 
Translation table11 
GC content49% 
IMG OID 
ProductASPIC/UnbV domain protein 
Protein accessionYP_003387844 
Protein GI284037914 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.721984 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAGCC AATTCCCTCA AAGTACCAAT AGAGACCTGA TTGATCGTTT ACGGTATTCT 
GTTTGCTTCC CGTTTCTGAT TATATGCCTG TGTTGCCCGG TAGTGTATGC GCAGAATCCG
CTCTTCCAAC TCCTTGCTCC CAAGCAGACC CATATTGATT TTAAGAACGA TATTGATGAG
AACGAAAGCC TGAACGTGCT TTCCTACGAA TATTTCTACA ACGGTGGGGG CGTGGCCGTT
GGCGATATTA ACAATGACGG TTTACTGGAT TTGTTCTTCA CTGCTAATCT GAAAGCCAAT
AAATTATACC TTAATCTGGG TAAATTAACC TTTAAAGACA TTACCAGTGA GGCCGGTGCT
CAGTTAGGCG GTAGAGCGGG CGGCTGGAAA ACGGGCGTTA GTATGGCCGA TGTCAATGGC
GACGGCTGGC TGGATATTTA CGTTTGCTAC TCAGGCAAAG GAGACGAAAG CAAGCGCAGA
AATCAGCTGT TTATCAACCA GGGGGCTGGC CCTAAAGGCA TGGTTCGGTT TGTGGAACAG
GCAAAGGAAT ACGGTGTCGA CGATAACGGC TATAATACGC AGGCTACGTT CTTCGACTAT
GATCGGGATG GCGACCTCGA CCTGTTTCTA CTGCATCACA ATGTCAAGAA ATACGATAAC
ATGGAACTGG CCAAACTACA CGGTGAGACC GACCTGCTAG CTGGCAATAA ACTGCTCGAA
AATAGAAACG GTCACTTTGT CGATGTGTCC CAGAAAGAAG GGATCCATCA ATATCCGTTA
ACGTTTGGTC TGGGAATGGC CGTAGCAGAT GTTAATAAAG ATGGGTGGCC GGACATCTAC
GTGACGAACG ATTATAACGA GCCAGATTAC CTCTACATAA ATCAGAAAGG GATTGACCTA
GCCGCACGGC GGTCGGGTGA GCCGGCTTTT AAAGACGAAA CCCAGTCCTA CTTCCGGCAT
CTGGCGCAGT TCTCGATGGG CGTAGATATT GCCGATTATA ACAATGACGG TCTGCCCGAC
ATCATGTCGC TGGATATGCT ACCGGAAGAT AACCGGCGCC AGAAACTGTT GCAGCTTCAG
GAAAACTACG AGTCGTTTGA GTTGATGCAG CAGCAGAAAC TACAGCGGCA GTATATGCGG
AACATGCTGC AACTCAACAA CGGCGATGGT ACATTCAGTG AAATAGCGCA AACGGCGGGC
GTGTCGAATA CCGATTGGAG TTGGTCGCCT TTGCTGGCCG ATTTTGACAA CGACGGTTAT
AAGGACCTGT TTATTACCAA CGGTTACCTG CGTGATTATA CGAACAAAGA CTTCCTTAAA
TACTGGGGCG ACTATAAAAT CAAAAAAGCA ATTGACCGGG AGCCCGTGCA ACTGATGGAT
CTGGTAAAAG CCATGCCGTC AACCAAAATT GCTAATTACA TCTTCAGCAA CAATCACGAT
TTAACCTTCA CAAATAAACA GCGGGAATGG GGTTTTCAGA CCCCCTCAAT CTCGAACGGG
GCCGTCTACG CCGACCTTGA CAACGACGGT GATCTGGAAC TGGTAGTCAA CAACATTAAC
GAGCCGGCTT TCGTGTATCA GAACATGAGC CGGGAACAAT CTGCCAATGG GTTTCTTCAG
GTAAAACTGG TGCCTTCCGG GAAAAACAGG ACCGCTATCG GCGCTAAGGT CACGCTGTTT
GCTAATGGCA ATTTACAATA TCAGGAGGTA AATCCGGTGC GGGGTTATTT ATCAAGCCAA
CCGCTCACGC TTCATTTCGG CGTTGGAACG GCCAGTAGGG CTGACTCCAT CAACATTATC
TGGCCGGATC AATCTGTACA AAAATTAACC GGTGTTCCTG TCAACCAGCA ACTAGTTGTG
CAACAGCAGG CGACTCCCGC TAGCGTTAGT CAGGCTGCTG TGCAGAAAAT GGCCTCACCC
GTGTTCACGA AAGTAGACCC GGTGCTGGCC CATACGCACG AGGGGTTTCT GGAAAATGAT
TTCAAACGTC AGCCGTTGAT GCTCTGGATG TATTCGCACA CGGGGCCGAT ACTGGCGAAG
GGCGACATAA ACAAAGACGG TCTGGACGAT GTGTTCATCA GTGGCGACCA GAACAAGCCG
GGCAGCATCT GGAAGCAGCA GACAAACGGG ACGTTCAAGC AGGTAGAAGG ACTAGTTATT
GGCGACGAAT CCATATCATC CGTTTCGGCC GCTGTCTTTT TCGACGCCAA CGGCGACGGG
TATGACGATC TCTACGTAGC GAAAGGAGGC TACTCGTTAT TTGAAATCAA TACAACCTCC
TTTCAGGATG AGTTGTACAT GAATGACGGA AAAGGCAACC TGACATTGGC AAAAACAGCA
CTGCCCAACC TCGCTGCCAG CAGCAAAGCC TGTGTTCGGC CGTACGACTA CGATCAGGAT
GGCGATATCG ACTTGTTCGT GGGTGGCCGG GTTGTTCCGG GGCGCTACCC CGAAGCACCG
GTTTCCTATC TGCTCCAAAA CACCGGCACA GGCCAGTTTA GGGCGGTCCC CATTCCATTC
GCTAAAATAG GAATGGTGAC GGACGTAAAA TGGGCCGATA TGAATGGTGA TGGGCGGGCA
GACCTGGTCC TGTGTGGTGA GTTCATGCCC ATCAAAATCT ACCTGAATAC AAAAGATGGG
TTTGTCGATA AAACGAAGCA GTATTTCCCG GAGGAGGAGA AAGGCTTCTG GCTGTCGCTA
ACGATAGCTG ATGTAAATGG AGATGGACAG AATGATATTC TGGCCGGAAA TCTGGGTACT
AATTCACAGA TTAAGTATTC ATCTAAAGAG CCTGTCGAGC TTGTCTACGC CGATTTTGAT
AACAATGGCT CTATCGATCC CTTCGTTAGT TTCTACGTTC AGGGAACATC GTATCCGTTT
GTAAGCCGCG ATGAACTGAA TGATCAGATC TACGCGATGC GCAAGAAATT TGCGTTTTAC
AAAGACTACG CAAATGCGAC TATTAGCACT ATCCTACCTG CCGATGACCT GGCCAACGCG
CCGAAACTGA CGGCTACGGA GTGCCATTCT GTCTGTTTTC TGTCGAAAAA AGGACAATTT
GAAAAACAGA TTCTGCCAAT AGAAGCCCAG TTCGCTCCGG TCACAAACAT GCTATGTGAG
GATTTTGACC GGGATGGTCA ACTGGACGTA CTGCTGTTGG GAAACAAATC GGACAATCGG
TTAAAACTGG GTAGCATGGA CGCGAACTAC GGCTGTTTAC TGAAAGGAGA TGGTAAGGGA
GGTTTTACGT ATGTGAGCCA GCCTGCTTCG GGTTTATCCG TGATTGGTGA TGTAAAATCG
GTTGTTGATA TCGGAATCAA TCAGAACAGG TGTTTACTGA TCGGAGCCTT CAATCAGCCG
TTGCAGGTGT ACAAAAAGCA AAAACTATGA
 
Protein sequence
MNSQFPQSTN RDLIDRLRYS VCFPFLIICL CCPVVYAQNP LFQLLAPKQT HIDFKNDIDE 
NESLNVLSYE YFYNGGGVAV GDINNDGLLD LFFTANLKAN KLYLNLGKLT FKDITSEAGA
QLGGRAGGWK TGVSMADVNG DGWLDIYVCY SGKGDESKRR NQLFINQGAG PKGMVRFVEQ
AKEYGVDDNG YNTQATFFDY DRDGDLDLFL LHHNVKKYDN MELAKLHGET DLLAGNKLLE
NRNGHFVDVS QKEGIHQYPL TFGLGMAVAD VNKDGWPDIY VTNDYNEPDY LYINQKGIDL
AARRSGEPAF KDETQSYFRH LAQFSMGVDI ADYNNDGLPD IMSLDMLPED NRRQKLLQLQ
ENYESFELMQ QQKLQRQYMR NMLQLNNGDG TFSEIAQTAG VSNTDWSWSP LLADFDNDGY
KDLFITNGYL RDYTNKDFLK YWGDYKIKKA IDREPVQLMD LVKAMPSTKI ANYIFSNNHD
LTFTNKQREW GFQTPSISNG AVYADLDNDG DLELVVNNIN EPAFVYQNMS REQSANGFLQ
VKLVPSGKNR TAIGAKVTLF ANGNLQYQEV NPVRGYLSSQ PLTLHFGVGT ASRADSINII
WPDQSVQKLT GVPVNQQLVV QQQATPASVS QAAVQKMASP VFTKVDPVLA HTHEGFLEND
FKRQPLMLWM YSHTGPILAK GDINKDGLDD VFISGDQNKP GSIWKQQTNG TFKQVEGLVI
GDESISSVSA AVFFDANGDG YDDLYVAKGG YSLFEINTTS FQDELYMNDG KGNLTLAKTA
LPNLAASSKA CVRPYDYDQD GDIDLFVGGR VVPGRYPEAP VSYLLQNTGT GQFRAVPIPF
AKIGMVTDVK WADMNGDGRA DLVLCGEFMP IKIYLNTKDG FVDKTKQYFP EEEKGFWLSL
TIADVNGDGQ NDILAGNLGT NSQIKYSSKE PVELVYADFD NNGSIDPFVS FYVQGTSYPF
VSRDELNDQI YAMRKKFAFY KDYANATIST ILPADDLANA PKLTATECHS VCFLSKKGQF
EKQILPIEAQ FAPVTNMLCE DFDRDGQLDV LLLGNKSDNR LKLGSMDANY GCLLKGDGKG
GFTYVSQPAS GLSVIGDVKS VVDIGINQNR CLLIGAFNQP LQVYKKQKL