Gene Slin_2476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_2476 
Symbol 
ID8726220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2996590 
End bp2999877 
Gene Length3288 bp 
Protein Length1095 aa 
Translation table11 
GC content51% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003387294 
Protein GI284037364 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.263017 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.39845 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACAAAC AATTTACTCA ACAGGGGGCC TTCTTTCAAC GAGATGATGC TTTTCCATTT 
TACGCGACCG TCAGTCGACA GGTCGCTTTT TGTTCGTTAT TCCTACTGGG TCTTCTGGCC
TCTGTAGGTG CTTATGCCCA GACGAAAGTT TCGGGTAAGG TGGTCGATGC GCAGGGGCTG
GCCCTGCCGG GGGTGAGTAT CGTCGTGAAG GGCACGACAA CGGGTACGGT GTCGGGTGGA
GAGGGTGATT TTACACTTAA CGTAGCCAGA GGTAACGAGA CGCTGGTTTT CTCGTACATC
GGATTCATTA CGCAGGAAGT AGCCATCAAT AACCGCAGTA GCATCAATAT TACGCTGGCC
TCAGACGATA AAATGCTGAG CGAGGTTGTT GTTGTCGGGT ACGGTGAGCA GAAGAAAGAA
ACCGTTACGG GTGCCGTTGC AACTGTAAAG GGCACAGATC TGGTAAAGTC GCCAGCTGTA
AACCTGAGTA ATTCCATCGC GGGCCGTATG CCGGGCGTTA TCGCCACCAA TGCCAGTGGT
GAACCAGGCT ACGATGGAGC GGCAATCCGG ATTCGGGGCT CAAACACGTT GGGTAACAAC
GACGCGCTGA TCGTAATTGA CGGTGTACCA GCGCGGGCGG GTGGTATTGA CCGCTTGAAC
CCGGCGGACA TTGAGAGTAT GTCGGTATTG AAAGATGCTT CTGCAGCTAT TTATGGTTCA
CGGGCAGCCA ACGGGGTTAT CCTGGTAACT ACCAAGCGTG GTAAGAGCGG TAAACCGGAG
TTGTCGTACA GCTTCAACCA GGGCTTTGGT CAGCCAACCG TCATCCCTAA AATGGCATCT
GCGGCTGAAT ATGCACAACT GAACAACGAG ATCAACGTGT ATAATCTGCC TTCGCAGTAC
TGGAAAGATG CGAATACGGC GTTCAATACG ACGGGAAGTT ATACAAGGCC CGATAATGGA
TCTATTGCCA AAGCCGCTTT TACGCCGGAT GACATCAAGA AGTTTCAGGA CGGATCTGAC
CCGTGGGGAC ACCCCAATAC CGACTGGTTT GGTGCAGCTC TGAAAACCTG GTCGCCACAG
TCGCGGCATA CGCTGCAACT GGTGGGTGGC AACGAGAACG TTAAGTACCT AACATCCGTC
AACTACCAGA ATCAGGATGC CTATTACAAG AACTCGGCAA CGGGCTATAA GCAATACGAC
TTTCGTCTGA ACCTGGATGC CAAAGTGAGT AAATACATCA ACCTGGTAAC GGGCGTGGTA
GGTCGTCAGG AAAACCGTTT TTTCCCAACG GTAGGGGCCG GAGATATTTT TCGTATGCTG
GCTCGGGGGT ATCCAAACAA ACCCGCTTTC TGGCCTAACG GTCAGCCCGC TCCCGACATC
GAGAATGGCC AGCAGCCTGT ACTCGTTACG ACCAGCGCTA CGGGTTACGA TCGAGATACC
CGGTATTATC TGCAAAGTAA TGCCAGCGTT AACATCACAA ACCCCTGGGT TCCGGGCCTG
AAGTTAACCG CTAGTGTCGC GTTGGATAAA TACATTCAGC AAGGGAAACG GTGGCAGACC
CCCTGGTTCG TGTATAGCTG GGATTATACC TCCTACGACC CAACAACGAA AGAGCCTCTT
CTCCAGCGCG TACAGAAAGG ACCGGCCCAG TCTACCCTCA ATCAATATAC CAACGATCAG
CTTAATTCGT TGCTGTCGGG TATTTTGTCC TACGACCATA CCTTCGGTGC CAGTCATGCT
ATTACACTGC TGGCCGGTAT TACCAAAGAA CAGTCTAACT CAAACGGTTT TTCGGGCTTC
CGCCAGTATT TTAATTCGAC GGCCATTGAT CAGTTGTTCG CGGGTAGTCA AACCCAGCAG
GTTGCCAATA CAACGGCTGC CTGGCAGCGG GCCCGCATGA GTTATTTTGG CCGGGCTGCG
TATAACTATA AAGAAAAATA CCTGGCCGAA TTCCTGTGGC GTTATGATGG GTCGTATATG
TTCCCGTCGG CCAGCCGGTG GGGCTTCTTC CCCGGCGTAA CGGCGGGCTG GCGTATTTCG
GAAGAAGATT TCTTCAAGAA AGCCTTACCC GTTGTTAGCT CTCTGAAACT TCGCGCATCG
TGGGGTCAAT TAGGTAACGA CCAAGTGTAT TTTAACAACA CCCTGCGCGA GTACGATTAC
CTACCTACCT ACGCTTACGG CGACGCAGTC AATTCAGGCT GGGGTTATGT AATCAACGGG
CAAGTGGCCC AGACGCTGTA TGAAAATGGT GTTCCTAACA GGAAGTTAAC CTGGGAAGTT
GCCAACAACG CCGACATCGG TCTGGAAGGG TCACTGCTGA ACGGAAAGGT CTTCTTCGAA
TTCGACGTTT TCCAGAACAA GCGGTCCAAT ATTCTATGGC GCCAGAGTGC TTCTATTCCG
CAAACGACAG GTGCTACGTT GCCCGCAACG AACATTGGTA GAGTAACCAA TAAAGGGTAT
GAATTCCGGG TTGGCTATAA TGGCCAGGTG GGCGACCTGA AATATAACGT GAGCGTGAAC
GGTGGTTATG CCAAAAATAC CATTACGTTC TGGGATGAAA CGCCGGGTGC ACCAGAATGG
CAGCGGTCAA CGGGTAAGCC TATTCCAACT GACGTAAACA ACCCGAACAA TGCCAACGGC
ACACTCATGT ATCAATATGA TGGTATTTTC TCGACGCAAG CCGATATTGA TGCCAACAAA
CTGGATTACA GCGGTGTAGG AGCCAGCCTG CTACGTCCTG GCGACATGAA ACTCAAGGAT
ATTGACGGGA ATGGGAAAAT TGACGGAAAT GACCGGGTTC GGGCCGACCG CAACAACCAG
CCTCGTTTTC AGGGCGGTTT GAACGCCGGT GTACGGTATA AAAACTTCGA TCTGAGCATT
CTGTTCCAGG CATCAGCCGG TGGCCAGATC TTCCTTCAAA CGGAATCCGG TACCATTGGT
AACTTCCTGC AATACAGCTA CGATCACCGC TGGACGGTCG ATAACCCAAG CACCGTTGAT
CCCCGTATTG TTGACCGGAG CAATCAGTAC TTCTCCAACG GTACCACCTA CTGGTTGAAG
AGCACCGACT ATATCCGGTT GAAAAACCTG GAGTTAGGCT ACACATTACC GAGCACCATT
GGTAGCAAAA TTGGCCTGAA CAACCTGCGC GTTTATGTCA ACGGCTTGAA CCTGGCTACC
TACGCGCCAG CCATGAAAGG CATCTACGAC CCTGAGTCGA CTAATAGTGC GGGACAGTAT
TACCCACAGG CACGAGTTAT CAATATGGGT GTATCACTTA GTTTCTAA
 
Protein sequence
MYKQFTQQGA FFQRDDAFPF YATVSRQVAF CSLFLLGLLA SVGAYAQTKV SGKVVDAQGL 
ALPGVSIVVK GTTTGTVSGG EGDFTLNVAR GNETLVFSYI GFITQEVAIN NRSSINITLA
SDDKMLSEVV VVGYGEQKKE TVTGAVATVK GTDLVKSPAV NLSNSIAGRM PGVIATNASG
EPGYDGAAIR IRGSNTLGNN DALIVIDGVP ARAGGIDRLN PADIESMSVL KDASAAIYGS
RAANGVILVT TKRGKSGKPE LSYSFNQGFG QPTVIPKMAS AAEYAQLNNE INVYNLPSQY
WKDANTAFNT TGSYTRPDNG SIAKAAFTPD DIKKFQDGSD PWGHPNTDWF GAALKTWSPQ
SRHTLQLVGG NENVKYLTSV NYQNQDAYYK NSATGYKQYD FRLNLDAKVS KYINLVTGVV
GRQENRFFPT VGAGDIFRML ARGYPNKPAF WPNGQPAPDI ENGQQPVLVT TSATGYDRDT
RYYLQSNASV NITNPWVPGL KLTASVALDK YIQQGKRWQT PWFVYSWDYT SYDPTTKEPL
LQRVQKGPAQ STLNQYTNDQ LNSLLSGILS YDHTFGASHA ITLLAGITKE QSNSNGFSGF
RQYFNSTAID QLFAGSQTQQ VANTTAAWQR ARMSYFGRAA YNYKEKYLAE FLWRYDGSYM
FPSASRWGFF PGVTAGWRIS EEDFFKKALP VVSSLKLRAS WGQLGNDQVY FNNTLREYDY
LPTYAYGDAV NSGWGYVING QVAQTLYENG VPNRKLTWEV ANNADIGLEG SLLNGKVFFE
FDVFQNKRSN ILWRQSASIP QTTGATLPAT NIGRVTNKGY EFRVGYNGQV GDLKYNVSVN
GGYAKNTITF WDETPGAPEW QRSTGKPIPT DVNNPNNANG TLMYQYDGIF STQADIDANK
LDYSGVGASL LRPGDMKLKD IDGNGKIDGN DRVRADRNNQ PRFQGGLNAG VRYKNFDLSI
LFQASAGGQI FLQTESGTIG NFLQYSYDHR WTVDNPSTVD PRIVDRSNQY FSNGTTYWLK
STDYIRLKNL ELGYTLPSTI GSKIGLNNLR VYVNGLNLAT YAPAMKGIYD PESTNSAGQY
YPQARVINMG VSLSF