Gene Slin_3420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3420 
Symbol 
ID8727173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4144486 
End bp4147716 
Gene Length3231 bp 
Protein Length1076 aa 
Translation table11 
GC content54% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003388227 
Protein GI284038297 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0439065 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAAC TCTTACCACC TAAACTCCTG TCCGATAGGT TAATTAGAGT TTCAAGCACT 
CAACTCCTAC TGGCAGCTTT GTGTGTTAGT TTTACTTATG CGGGAAAGCC GGTTAATCCA
AAATTCCCCG TCAATCAGTC AGTGAAGCAG GCTGACCGTA CACTCACAGG CCGGGTAACA
GACGAAAAGT CCGAAGGACT TCCCGGCGTG AGTGTTATCC TGAAAGGAAC CCAGCGCGGA
ACCGTAACCG ATGCCGATGG ACAGTATAAA CTTGACGTAC CCGATGGGGC CTCTACACTT
GTGTTCTCCT TTGTTGGTTA CCTGCCACAG GAAGTTAGTG TTGGGAATCA AACGTCAATC
AACGTCAGCC TGAAAACCGA CAGTAAAGTA CTGGATGAGA TCGTAGTCAT CGGCTATGGT
ACGACGCGCA AATCCGACCT TACCGGCGCT GTCACCGGCG TGAAGGAGGC CCAGTTGCAA
GAGCGCCCTG CGCCTTCGTT GAACCAGGCC CTGTCAGGTC GCATGCCCGG CGTGCAGGTC
AACACCAACT CGGGACGACC CGGCGGTCGG ACCACCGTCC GTATCCGGGG CTTCAGCTCC
ATCAACTCCT CCAACAACCC CCTCTACGTC GTTGATGGCG TCATGCTCCC CCAAGGTACC
GGCGACCAGT TCAGTAACCC AATCGATTAC ATCAACCCCA ACGACATCGT CAACGTGGAG
GTCCTGAAAG ATGCCTCTTC GACGGCTATC TACGGGGCTC GTGGCGCCAA CGGCGTTATT
CTGGTACAAA CTCGTAAAGG GAAAGCCGGT GAAAGCCGGG TCACCTACGA CGGTCAGTTC
AGCGTTAACA CCATCGGACC CAACAAGCCA AAGGTGCTCA ACGCCAAGGA GTACCTGGCT
ACCGAAGACC TCGCCTATGC CAACATGGCC AAGTATGACC CCGTTGGCTG GGCCGCGGGT
AAGTGGTCTT ACCTGGACCC GATAGCCCGG CGCAAAGCCT TCAGCGCGGC TCACCCTGGT
GTGTTTGATG CCAACCTGAA CCCACTCTAC GACACCGACT GGTTCAAGGA GTCGGCTCAG
AACAAGCTTT CCCAGAACCA CCAGTTAGGT TTCAGTGGCG GTAACGAGCG CACCCAGTAC
TCCCTCTCGC TGAACTACCG CGACGATCAG GGTCTGATCA AGACCTCCTA CATGAAGCGT
TACTCGGGTC GTTTCTCGAT CGATGATCAG GTCAAGAGCT GGCTTAAGAT TGGCGGGACA
ATGAGCTACA ACTACCAGAC GGAAAACCTG GTGGACATCA ACGATGCGGT GGCCCGTCAG
ATCGTCGAAG ACTTCCCCTT CCTGCCCGTA CGCTACCCGG ACACCGGCGT CTTCGCCGAG
AACCGGGACT ACCCTTATGC AGAAGGCACC ATGAGTTCGG TACACCGCCT GATGGACCGT
AAGTACATCC AGAACACCCA GACTATTCTG GGCAGTTTGT TTACCAACAT CACTTTCGGC
AAAGGGCTGG AGATGCGTAC AGTACTGGGT ACCAACGTCC AGACGCAGGA GATTAACCAG
TCGCAAACCC GTACGCTTAA CATTGGCAAT AACGGTAACG CATCGACCAA CAACAACCGG
CAGAATTTCT GGTCGTTGGA GAACTACCTG ACCTACAACA AACAGTTTGG TCAGGATCAC
TCCTTCACCG GACTACTAGG TCTGTCGTGG CAGGAGACTA ACACCTTTGG CATCGGTGCC
AGCGTAAGCG GTTTTGCCAC CGACTACTTC GGCTTTAACA ACCTGGGCGC TGGTGCGATC
AACCCATCGG TGAGTTCGGG TGCTTCACGG TTTGCCTTTA ACTCCTACTT CGGTCGGATC
AACTACGGCT ACAAGAACAA GTACCTCTTC ACCGCTACCG GCCGGGCTGA TGGCTCCTCG
AAGTTCGGAG AAAACCACAA GTTTGCCTTC TTCCCCTCGG CGGCTCTGGC CTGGCGGGTA
TCGGAAGAAG ACTTCCTGAA AGGCAACCCC GTTATCTCGA ATTTGAAAGT GCGCACCAGC
TACGGTCTGA CGGGTAACTC CGAGATTCCG CCTTACTCCT CGCTCTCGCT GTTGAGTTCA
AACTACGCTA CGATCTATAA TGATACGAAG GTGAGTGGCA CGGGTATCAA CCGTCTGGCT
AACCCCGACC TGCGCTGGGA AAAAACCGCT CAGACTGATG TAGGTCTGGA AGTTGGCTTC
CTCAAAGGAC GCATCTCGCT GGAAGCCGAT TACTACTACC GTCTGACAAC CGACATGCTC
CTGGATGCCC CCGTACCACA ATCGAGCGGC TATGCCACCA TTCGGCGTAA CGTAGGCTCG
ATGGAGAACA AAGGTTTTGA GTTCGGAGTG AACACGGTTA ACATCAACCG GGGTACTTTC
AGTTGGAACA CCTCCTTCAA CATCTCCCTT AACCGCAACA AAGTCCTCTC CCTGGCTACT
CCGTCTGACA TCTTCAACGT AGGTGGTCCT AACTTCACTA ACCCCACCAA TGTCATCCGG
GTAGGTGAAG CAGTAGGTTC GTTCTGGGGT CTGACCCGGG TAGGCGTATG GAGTGAAGCG
GAGCGGGAAG AAGCGGCCAA GTTTACCAGC TACCGCAACG GTCTGACCAT TCTGCCCGGC
GACATCAAGT ACCTCGACGT AAACGGCGAC AAGGCCATCA CCGATGCTGA CCGCAGCATC
ATTGGCAACG GTAGTCCTAA AGGCTGGGGT GCCATGACCA ACAACATTCG TCTGGGCAAC
TTCGATGCCA CCCTGGAACT TCAGTACATG TTTGGTAACG ACGTCATGCT GATGAACTTA
CACCCCAGTG AAGACCGGCA GGCTCTGGCC AACAGCTACT CGTCGGTGCT CAACGCCTGG
ACGCCAACCA ATCAGGGTAG CCAGATTGCT CAGGTACGCG ACACACGGGC GGGCTACGTA
ACCAACGTCG ACAGTCACTG GATTAAGGAC GGTTCGTTCC TGCGGGGCCG CAACCTCCTG
TTTGGCTATA CGCTGCCGGC TAACGTAACG TCTAAATTAA AGATGAACCG GTTACGGGTA
TACGTTTCCG CTCAGAACTT CTTCCTGTTG CTGAAAGATC CTATTGTTGG TGATCCGGAA
GTAACGCCCA CCAACCAGGG AACAGGCAAC AGCGCCTTCT CACAGGGCAT GATCTGGCAC
AACTACCCTA AACCAACTAC CTATCTCCTT GGTCTGCAAA TTGGCTTGTA G
 
Protein sequence
MAKLLPPKLL SDRLIRVSST QLLLAALCVS FTYAGKPVNP KFPVNQSVKQ ADRTLTGRVT 
DEKSEGLPGV SVILKGTQRG TVTDADGQYK LDVPDGASTL VFSFVGYLPQ EVSVGNQTSI
NVSLKTDSKV LDEIVVIGYG TTRKSDLTGA VTGVKEAQLQ ERPAPSLNQA LSGRMPGVQV
NTNSGRPGGR TTVRIRGFSS INSSNNPLYV VDGVMLPQGT GDQFSNPIDY INPNDIVNVE
VLKDASSTAI YGARGANGVI LVQTRKGKAG ESRVTYDGQF SVNTIGPNKP KVLNAKEYLA
TEDLAYANMA KYDPVGWAAG KWSYLDPIAR RKAFSAAHPG VFDANLNPLY DTDWFKESAQ
NKLSQNHQLG FSGGNERTQY SLSLNYRDDQ GLIKTSYMKR YSGRFSIDDQ VKSWLKIGGT
MSYNYQTENL VDINDAVARQ IVEDFPFLPV RYPDTGVFAE NRDYPYAEGT MSSVHRLMDR
KYIQNTQTIL GSLFTNITFG KGLEMRTVLG TNVQTQEINQ SQTRTLNIGN NGNASTNNNR
QNFWSLENYL TYNKQFGQDH SFTGLLGLSW QETNTFGIGA SVSGFATDYF GFNNLGAGAI
NPSVSSGASR FAFNSYFGRI NYGYKNKYLF TATGRADGSS KFGENHKFAF FPSAALAWRV
SEEDFLKGNP VISNLKVRTS YGLTGNSEIP PYSSLSLLSS NYATIYNDTK VSGTGINRLA
NPDLRWEKTA QTDVGLEVGF LKGRISLEAD YYYRLTTDML LDAPVPQSSG YATIRRNVGS
MENKGFEFGV NTVNINRGTF SWNTSFNISL NRNKVLSLAT PSDIFNVGGP NFTNPTNVIR
VGEAVGSFWG LTRVGVWSEA EREEAAKFTS YRNGLTILPG DIKYLDVNGD KAITDADRSI
IGNGSPKGWG AMTNNIRLGN FDATLELQYM FGNDVMLMNL HPSEDRQALA NSYSSVLNAW
TPTNQGSQIA QVRDTRAGYV TNVDSHWIKD GSFLRGRNLL FGYTLPANVT SKLKMNRLRV
YVSAQNFFLL LKDPIVGDPE VTPTNQGTGN SAFSQGMIWH NYPKPTTYLL GLQIGL