Gene Slin_1166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1166 
Symbol 
ID8724899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp1419615 
End bp1422869 
Gene Length3255 bp 
Protein Length1084 aa 
Translation table11 
GC content54% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003386016 
Protein GI284036086 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.689476 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAG CTCTAATAGG AAGCTGGCTG CTATTGCTGT TGGTTGGTTT GCCTGTGTTA 
GCCCAGGAAA TAGCCATAAC TGGCCGGGTC ACCTCATCAG ATGATGGCTC CGCATTGCCC
GGTGTGAGTG TTGTCGTCAA AGGATCGACC CGCGGCACCA CAACCGATGC CAATGGTACT
TATCAGATAA ATGCAGGCTC CGCCACTACA CTGACATTCT CGTTCGTTGG CTTCAAACCA
CAGGATGTAG CGGTGGCTAA TCGTACGACA ATCAACGTAG TTTTAGCCGC CGATGCCTCC
ACACTGAACG AAGTCGTTGT TACAGGATTC GGGATTCGGC GCAACGAGCG CGAAATTGGC
ACATCTGTTA CCAAGATCAA TAACACATTG ATCAACCAGG CAGCTCCCGT TAACCTGGCG
AATGGCCTGA CCGGTAAAGT GGCCGGGCTG CAGATCAACG CGGTCAATAA CGGCGTCGGT
TCGAACCCCC GCATAACCAT TCGGGGAAAT CGTTCGTTTC TGGGCAACAA CCAGGCGCTA
CTGGTCGTAG ACGGTGCTCT GACGGACGTG AGCTTTCTGT CGGCCATCAA CCCCAACGAC
ATTGAAAGCA GCTCCATCTT GAAAGGGCCG AGCGCAGCGG CTTTGTATGG TTCCGACGCG
GCTAACGGTG TACTGGTCAT TACAACCCGG CGCGGTACAA CCAACAACAA GCCCCAAATT
TCTTACACTA ACAACACCCA GTTGGAGAGT GTGTCGTACA TGCCTGATCT ACAGCGCCTG
TATGGCTCCA ATGGGGGCGA AGGTGCTCCT TTTCTGGATG CCAACGGCCA GCGGCTTTAC
GTACCTTATG AGAACCAGCA GTTCGGCCCT CTGTATGATG GCTCATTACA GCCGCTGGGT
TACGGGGTGC AGGTCATAAA CCCTGATGGG TCTATTCGGA TCGATACGCT GAAAGTTCCC
TATGCGTCGA CAGGAAAAGA CCCGCGCCGT GCGTTTTTTA ACACGGGGGT TACGCAGCAG
CACGACCTGG CCTACCGGGT GGGCGATGCA CAGAATTACT TTGGACTTAG TGTGCAGCGC
GTCGATCAGA AAGGGATTGT TCCTAACGAT AAATACAGCC GTACCAACTT CACGGTGAAC
GGTGGCCGGG CGGTAGACCG CTTTACGGCC AATGCCAAGA TGCAGTTCAC TTACGAAAAT
ACGGATCAGG AGAACGGCGA TTTCGGCCAG GGACGCCCGC TATACTGGAA CCTGCTGAAC
CAGCCCGCCC ATGCACCACT CACCGATCCG CGTATTAAGG ATATTAACTC CCCTTACGGC
GATGTGAACG GCTATTTCAA CGCCTACTAC CCAAACCCCT GGTGGCAGGT AACCGGCGAC
AACTCGCGGG CCGTTACCAA CAAATACTCG ATTCAGGGGA CGGCCGATGT TGGCTACAAA
TTCACCGATT GGCTCAATGT TACCTATCGG GTGAGTGGTC AGGTATCCAA CACACAGTTT
AAATCGCACC TGGCGGCTGT TTCGTTCAGC ACCTACGCGC TCGGCGACCC CTGGGGTGCG
GGGAACATTG CCTCATCGCT GAAACAGGTG AACGGTAACG TGAGTGATTA TAGTCGTACG
ACCTCCCGCG TGACAGGTGA CCTGCTGATT ACGATAGCTC CCAATTTTGG TGACTTCACT
ACCAAGCTGA TTCTGGGGCA GCAGGCCCGG GTCGACTATT CACGGTACAT CTCCACGACG
GCAACCTCAC TCGTAGTGCC CGGTACCTAT AACATCGCCA ACCGGCTGGG TAATGTGCTT
GCCAGTGAGA ACTCCTACCA GAGTCGGCTG TTAGGTTATT TTTATGATTT CACGGCCGGG
TTCCGGAATT TCGCCTTCAT CAACGCCACC GGTCGTTACG ACAACACCTC GTTGCTGGCT
GCTGGCAACC GGTCCTATTT TTACCCTGGT GTGAATGCGT CGGTTATCCT GACCGAAGCC
ATTCCTGCCC TGAAGGGGAG TAGCGTCCTT TCCTATCTGA AGGTGCGCGG GGGTATTGCC
AGAGCAGGGA ATATCAGCGT TGGGCCTTAC CAGTTGCAGA ATGTATTTAA CCCCGGATCA
GGCTTCCCCT ACGGTAGCCA GCCCGGCTTC TCGCTCAGCA CTCAGCAGAA CGACCCTAAC
CTGAAGCCCG AGTTCACCAC GAACAAGGAA GTAGGTGTTG AGTTTGGCCT GTTCGACCGG
GTCAATGCCG AAGTCGTGTA TTACACGATG GAAACCATTA ACCAAACGGT TCCCATTCAG
GTTTCGCGGG CTACGGGCTA TGGCAGCGCG CTGATCAATA CAGGTACTAT GGTCAACAAT
GGGCTTGAAG TGGAGTTGAA AACCCTCCGG CCGATTGTTA ATACCGGTGG CTTTACCTGG
AATGTCAATA CGAACTTTAC CTACCTCAAC AATACGGTAA CGTCCGTTTA TCCGGGCCTG
GATCGCATCA ATATTACGCA GTCGAATGGG GCGCAATCGG CTAACGTGTT TGCCGCTGTC
AATTATACGT ATCCAGCTTT ATTCGGTACT GATATTGCCC GTGTGCAAAA CACCGATCCC
AATGCGGCCT ATTACGATGC AACGGGCCAG TTTGTTGGCC AGCCGGTTAT TAACCCATCA
ACGGGCTACC CCATTCTGGA CGCCAATATC AAGTACCTGG GCAACACACA GCCAAAATAC
CGGTTCGGGT TCAACAACAC TTTCGCCTTT AAAGGACTGA CACTGAACGC CCTGGTCGAG
TACCGGGGGG GCAATGTGAT TTACAACCAG TTGGGTAACG CACTGGAGTT TACAGGTGCC
GGTATTCGGT CGACTTACAA CGGACGGCAG AACTTCGTCT ACCCGAACTC TGTGCTGGCC
ACCACCAACC CCGATGGAAC CACCACCTAT GCGCCCAATA CCAGCGTGTC GACCCGTGAT
GGCAATCTGG AGTTCTGGAC GAATTCGGGC TATCACAATG CCGTGTCGAG TTACGTGACA
AGTGCGGCCT TCTGGAAGCT TCGTGAAGTA GCGTTGAGCT ACAACTTCCC TACCCAGTTG
TTCAGCAATA TCAAGTTTAT CCGATCACTG ACCCTTGGCT TAACAGGCCG CAACCTGCTG
ATGCTTCGGC CGAAAACAAA CGTATTCACG GACCCCGAGT TTTCGGTGGA CAACAGCAAT
GCCCAGGGCG TTACGAACGA ATACCAGACA CCACCAACCC GCCAGTACGG TTTCCGGCTG
AGCGTTGGGT TTTAA
 
Protein sequence
MKKALIGSWL LLLLVGLPVL AQEIAITGRV TSSDDGSALP GVSVVVKGST RGTTTDANGT 
YQINAGSATT LTFSFVGFKP QDVAVANRTT INVVLAADAS TLNEVVVTGF GIRRNEREIG
TSVTKINNTL INQAAPVNLA NGLTGKVAGL QINAVNNGVG SNPRITIRGN RSFLGNNQAL
LVVDGALTDV SFLSAINPND IESSSILKGP SAAALYGSDA ANGVLVITTR RGTTNNKPQI
SYTNNTQLES VSYMPDLQRL YGSNGGEGAP FLDANGQRLY VPYENQQFGP LYDGSLQPLG
YGVQVINPDG SIRIDTLKVP YASTGKDPRR AFFNTGVTQQ HDLAYRVGDA QNYFGLSVQR
VDQKGIVPND KYSRTNFTVN GGRAVDRFTA NAKMQFTYEN TDQENGDFGQ GRPLYWNLLN
QPAHAPLTDP RIKDINSPYG DVNGYFNAYY PNPWWQVTGD NSRAVTNKYS IQGTADVGYK
FTDWLNVTYR VSGQVSNTQF KSHLAAVSFS TYALGDPWGA GNIASSLKQV NGNVSDYSRT
TSRVTGDLLI TIAPNFGDFT TKLILGQQAR VDYSRYISTT ATSLVVPGTY NIANRLGNVL
ASENSYQSRL LGYFYDFTAG FRNFAFINAT GRYDNTSLLA AGNRSYFYPG VNASVILTEA
IPALKGSSVL SYLKVRGGIA RAGNISVGPY QLQNVFNPGS GFPYGSQPGF SLSTQQNDPN
LKPEFTTNKE VGVEFGLFDR VNAEVVYYTM ETINQTVPIQ VSRATGYGSA LINTGTMVNN
GLEVELKTLR PIVNTGGFTW NVNTNFTYLN NTVTSVYPGL DRINITQSNG AQSANVFAAV
NYTYPALFGT DIARVQNTDP NAAYYDATGQ FVGQPVINPS TGYPILDANI KYLGNTQPKY
RFGFNNTFAF KGLTLNALVE YRGGNVIYNQ LGNALEFTGA GIRSTYNGRQ NFVYPNSVLA
TTNPDGTTTY APNTSVSTRD GNLEFWTNSG YHNAVSSYVT SAAFWKLREV ALSYNFPTQL
FSNIKFIRSL TLGLTGRNLL MLRPKTNVFT DPEFSVDNSN AQGVTNEYQT PPTRQYGFRL
SVGF