Gene Slin_0206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_0206 
Symbol 
ID8723934 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp274847 
End bp278041 
Gene Length3195 bp 
Protein Length1064 aa 
Translation table11 
GC content51% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003385070 
Protein GI284035140 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.676581 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAAAC TTCTATTAGG AAGTTGGCTA CTCTTGTTAT TATCCTATTT TCCTGCATTG 
GCTCAGGAAG TGGCAGTGAG CGGCCGTGTC ACTTCATCCG ATGATGGCTC TGCACTTCCA
GGGGTGAGTG TACAGGTAAA GGGTCTTAGC CGGGGAGGCA CAACAGATGC CCAGGGTAAT
TATCGGGTAA CCGTACCGGC AAATGGTCGA TTGGTATTCA GTTTCATCGG CTATAAAAGT
CAGGAGTTGC CGATTGGCAA CCGCACTACC CTAAATGTGG TTTTACAGGC CGATGCCACC
AATCTGGGCG AGGTAGTTGT TACAGCATTG GGTATTGAGC GCGACACCCG GTCCTTATCC
TACGCTACTC AGCAGTTAAA TGGCGACCGC ATTTCGCAGC GGGGTGAACC AAACGTCTTG
AACGCCTTAC AGGGTAAAGT ATCCGGAGTT CAGATTACCG GAGCCAGTGG CGAAGCCGGT
GCTTCAACGA ACATCAACAT CCGGGGTATT CAGTCATTTA CCGGAAATAA CCAGCCGCTT
TTCGTAGTCG ATGGTATTCC CATCAGTAAC AGCGTAGACC GCACCAACCT TGGCCCGAAC
GGTACGCTGG GCGGTCCGCA GTCCTCAAAC CGGGCCCTCG ATATTGCCCC GGAGAATATT
GAGTCGCTGA ACGTATTGAA AGGACCTGCT GCGGCTGCAC TGTATGGTTC ACGTGCTGCT
TCGGGCGTCA TCGTAATTAC GACGAAGTCC GGCAAAAATG CCAAGAAGAA AACCGAGATC
ACGGTCAATT CAGGCATAAC GTTTCAGAAC GTATACGGCC TGCCGAACTT TCAGAACGAT
TATGGTCAGG GGTTGAATAA TATTTTCCAG AGCAATAGTG TCGCTTCCTG GGGACCTGCC
TATGGCTCTC CGGCTACGCT GGAAAATGGC TTTTTTACCC GAAATTCCAC AAACCCCACG
CAGTTGGATT CGACCGCACT GTTTCCGGGG GGCGCTTTAC CGGCCTACAA AGCCTACGCG
AATTACCGGG CCTATCCAGA CAACATCAAG AACTTCTTCA ACCAGGGCCG AATCCTGTCA
AACTCCGTCA ATATTGGTGG CGCGAGCGGC TCCAGCAACT ATAATTTCTC GGTAGCCAAT
ACCGCTCAGG AAGGTATTGT TCGAAACTCG AAATTTAACC GGACCAACGT AAGTTTTGGT
GGCAACACGG TATTGTCAAA CAAATTTCAC GTGGGCGCAT CGGTCAATTA CAGCAATACG
GGCCAGAGCA ACTCGCTGAC GGGTAATGGT GGTAGTGCCT TTGGACAGTT GGTATCCGTA
CCCCGAAGCT ACGATTTACC AAACCTTCCT TTCCAGGATG CTAATGGCCG TAGTGTATTT
GCGGGTACAC CCGGCTCCAA TGAGAACCCA ATCTGGGGCT TGGAAAACAA CCGGACAACC
AGTAATGTCG ACCGCCTGAT CGGTAATATC AACATCGGCG TTGACTTGGT CAAAGGCGTC
AATCTGTTCT ATCGGGCCGG TATCGATACG TATACCGACC GCCGGAAGCA GATTTTCGCG
CCGAGTGCGG CCCGCCTGCT GGTGGGTGGT GTTGGCGAAG ATATTTATTA CTGGCAGGAA
CTGAACAGTG ATCTGATTTT GACCGGCCAG AAAAATGACA TTCTGCCGGG ACTTAACATC
GAAGGCCGGT TAGGCTTTAA CGTCAACCAA CGGAAATCAC AGAACGTAAC GGCCAATGGA
CAAAGTCTGA CACTACCCAA TTTTTATAAC CTGAGCAATG CTACGGTCTT CTCAAACAAC
ACCGGAGAGA GCAACACACT GCGTCGACTG GCGGGTTATT ACGGACAGGC CAGTTTCTCG
TTCCGCAATT ATGCTTTCCT TGAGTTGACG GGTCGTTTCG ACAAATCGTC GACCTTACCG
CTCAGCAACA ATACCTATTT CTATCCGGCG GTGTCGGCTA ACGTCATTCT TTCCGACATC
TTCCCGATCA AATCATCGAC ACTTTCGTAC CTGAAAGTAC GGGGTAGTGC CGCAACGGTG
GGTAAAGATG CCGATCCGTA TCAGCTACAA TCGGTTTACG TGACCACGAG CCGGGGCAAT
AACGTATCGA ACATCGCCTT CCCCATCAAC GGGCAGAACG CCTTCTCGAT CAGCAACCGG
ATTGGGCCCG GCGATAACCT GAAACCTGAA TTTACAACCT CGTATGAAGG CGGTTTGAAC
CTTGGGTTGT TTGATAACCG CTTTAGCATT GACCTCACCT ATTATAATTC GGTCAGTACA
GACATGATCG TAAACGTAGG TATGGCCGCA TCGACCGGGT ATACGACACG GACCACGAAC
GTGGGTAAAA TGACCAATAA GGGTATCGAA GCCTTGCTGA CGGTAACCCC AATCCGGGTT
AAAGACTTCC GCTGGGATGT AACCGTCAAC TACACGCGCA ACGTAAACGA GGTAGTCGAC
ATCGCTCCGG GGGTGGAAAG TTTCTCCATT CCGGGAAGCG CCTTCACGGG TTCGATTCCG
TCGATCGTGA AAGGGCAGCC TTACGGAGTA ATTCTGGGGA ATAAAAAACC AACTAACCCC
GACGGTCAGT GGATCATCAA CCCCAACACA GGACTTTGGA ATCCGGAAGT GTCTAACCAG
AATATTTCCA ACCCAAACCC ACTCTGGATA GGCAGCATTC AGAACACCTT CAAGTACAAA
GGCATTGCAC TGGGCGTTCT CTTCGATACG CAGCAGGGCG GTCAGTTAGT AGAGTTCTCG
TCGGGTTCGT ATAAATCCAA CGGTACGCTT GACGTAACGG GAGTGAACCG GCAGGCTCCA
CGCATCATAC CGGGTGTATT TGAAATCAGC AATGCCGACG GCTCCAAGTC GTACACACCG
AACAACATCC AGGTGGATGC ACAAAGTTAC TGGCGTGCGT TTGGCTTGCA GAGCGACCTG
AACGTGTTCA GTGCAACCCG GTATAGTCTG CGCGAAGCGA CACTGAGCTA CGACCTGCCC
GCATCGCTGA TTGGCAAAAC ACCCTTTGGT GGTATATCCA TCTCTGCCGT TGGCCGTAAC
TTATGGTATT TTGCCCCCGG TTCGCCCATC GACCCAGAAG TTAATACACA GGGCGCAGGC
AACATCCGTG GTCTGGAACT ACAAAGCGCA CCCAACACCC GCAACTACGG CGTAAATCTG
AGATTTACTT TCTAA
 
Protein sequence
MRKLLLGSWL LLLLSYFPAL AQEVAVSGRV TSSDDGSALP GVSVQVKGLS RGGTTDAQGN 
YRVTVPANGR LVFSFIGYKS QELPIGNRTT LNVVLQADAT NLGEVVVTAL GIERDTRSLS
YATQQLNGDR ISQRGEPNVL NALQGKVSGV QITGASGEAG ASTNINIRGI QSFTGNNQPL
FVVDGIPISN SVDRTNLGPN GTLGGPQSSN RALDIAPENI ESLNVLKGPA AAALYGSRAA
SGVIVITTKS GKNAKKKTEI TVNSGITFQN VYGLPNFQND YGQGLNNIFQ SNSVASWGPA
YGSPATLENG FFTRNSTNPT QLDSTALFPG GALPAYKAYA NYRAYPDNIK NFFNQGRILS
NSVNIGGASG SSNYNFSVAN TAQEGIVRNS KFNRTNVSFG GNTVLSNKFH VGASVNYSNT
GQSNSLTGNG GSAFGQLVSV PRSYDLPNLP FQDANGRSVF AGTPGSNENP IWGLENNRTT
SNVDRLIGNI NIGVDLVKGV NLFYRAGIDT YTDRRKQIFA PSAARLLVGG VGEDIYYWQE
LNSDLILTGQ KNDILPGLNI EGRLGFNVNQ RKSQNVTANG QSLTLPNFYN LSNATVFSNN
TGESNTLRRL AGYYGQASFS FRNYAFLELT GRFDKSSTLP LSNNTYFYPA VSANVILSDI
FPIKSSTLSY LKVRGSAATV GKDADPYQLQ SVYVTTSRGN NVSNIAFPIN GQNAFSISNR
IGPGDNLKPE FTTSYEGGLN LGLFDNRFSI DLTYYNSVST DMIVNVGMAA STGYTTRTTN
VGKMTNKGIE ALLTVTPIRV KDFRWDVTVN YTRNVNEVVD IAPGVESFSI PGSAFTGSIP
SIVKGQPYGV ILGNKKPTNP DGQWIINPNT GLWNPEVSNQ NISNPNPLWI GSIQNTFKYK
GIALGVLFDT QQGGQLVEFS SGSYKSNGTL DVTGVNRQAP RIIPGVFEIS NADGSKSYTP
NNIQVDAQSY WRAFGLQSDL NVFSATRYSL REATLSYDLP ASLIGKTPFG GISISAVGRN
LWYFAPGSPI DPEVNTQGAG NIRGLELQSA PNTRNYGVNL RFTF