Gene Slin_3418 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3418 
Symbol 
ID8727171 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4139187 
End bp4142417 
Gene Length3231 bp 
Protein Length1076 aa 
Translation table11 
GC content55% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003388225 
Protein GI284038295 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAC ACTTACCTCC TCAAAGCCTG CTGGGCAGGT TAGTCAGCGT AGTAATTACT 
CAGCTACTAC TGACTGCAAT GTGCGTTAAT TTTACTTATG CCAAGGTCCC TCTGGCATTA
AAAACAGTGG CCGACCAACG TGCCATTACA GCTGATCGTA CACTCACGGG TCGGGTGACA
GATGAAAAAG ATGAAGCCTT ACCCGGTGTG AGTGTTATCC TGAAGGGAAC CCAGCGCGGA
ACCGTAACCG ATGCCGATGG CCGGTATAAA GTGGATGTTC CCACGGGTGG CGCTACGCTT
GTGTTCTCCT TTGTCGGATA TGTCCCTCAG GAAGTACGCG TTGGCAACCA AACCTCACTC
AATATCAGCC TGAAAGCCGA CAGCAAAGTG CTCGACGAAA TCGTCGTGAT CGGGTATGGT
ACCGCCAAAA AGTCTGACCT TACCGGCGCT GTCACCAGCG TGAAGGAGGC TCAGCTTCAG
GAACGGCCTA CATCCTCATT GAACCAGGCC CTGTCAGGTC GCATGCCCGG CGTGCAGGTC
AACACCAACT CGGGACGACC CGGCGGTCGG ACCACCGTCC GTATCCGGGG CTTCAGCTCC
ATCAACTCCT CCAACAACCC CCTCTACGTC GTTGATGGCG TCATGCTTCC CCAAGGTACC
GGCGACCAGT TCAGTAACCC AATCGATTAC ATCAACCCCA ACGACATCGT TAACGTAGAG
GTCCTGAAAG ATGCCTCTTC GACGGCCATC TACGGAGCAC GCGGTGCCAA CGGCGTTATT
CTGGTCTCCA CTAGAAAGGG GAAGGCCGGT GAAAGCCGGG TTACCTACGA CGGTCAGTTC
AGCGTTAACA CCATCGGACC CAACAAGCCA AAGGTGCTCA ACGCCAAGGA GTACCTGGCT
ACCGAAGACC TCGCCTATGC CAACATGGCC AAGTATGACC CCGTCGGCTG GGCCGCAGGT
AAGTGGTCTT ACCTGGACCC GATAGCCCGG CGCAAAGCCT TCAGCGCGGC TCACCCTGGT
GTGTTTGATG CCAACCTGAA CCCACTCTAC GACACCGACT GGTTCAAGGA GTCGGCTCAG
AACAAGCTTT CCCAGAACCA CCAGTTAGGT TTCAGTGGCG GTAACGAGCG CACCCAGTAC
TCCCTCTCGC TGAACTATCG CGACGATCAG GGTCTGATCA AGACCTCCTA CATGAAGCGT
TACTCGGGTC GTTTCTCGAT CGATGATCAG GTCAAGAGCT GGCTCAAGAT CGGTGGTACA
CTGAGTTATA ATAACCAGAC GGAAAACCTG GTGGACATCA ACGATGCGGT GGCCCGTCAG
ATCGTGGAGG ACTTCCCCTT CCTACCCGTG CGCTACCCGG ACACTGGCGT CTTCGCCGAG
AACCGGGACT ACCCCTATGC AGAAGGCACC ATGAGTTCGG TGCACCGCCT GATGGACCGT
AAGTACATCC AGAACACCCA GACCACTTTG GGTAGCCTCT TCACCAACAT CACGTTAGGC
AAAGGGCTGG AGATGCGTAC GGTATTGGGT GCCAACGTTC AGACGCAGGA GATCAACCAG
TCGCAAACCC GTACGCTTAA CATCGGCGGT AACGGTAACG CATCGACCAA CAACAATAAG
ACCTCGTTCT GGTCGCTGGA ACATTACCTG ACCTACAACA AACAGTTTGG TCAGGACCAC
TCCTTCACCG GACTGCTGGG TCTTTCGTGG CAGGAGACTA ACACCTTTGG CATCGGTGCC
AGTGTGAGCG GTTTTGCCAC CGACTACTTT GGCTTCAACA ACCTGGGTGC TGGTGCTACC
AACCCATCGG TGAGTTCAAG CGCATCACGG TTTGCCTTTA ACTCCTACTT CGGTCGGATC
AACTACGGCT ACAAGAACAA GTACCTCTTC ACGGCTACCG GCCGGGCCGA TGGCTCCTCG
AAGTTCGGAG AGAATTACAA GTTTGCCTTC TTCCCCTCGG CGGCTCTGGC CTGGCGGGTA
TCGGAAGAAG ACTTCCTGAA GGGCAATCCC GTTATCTCGA ATTTGAAGGT CCGCGCCAGC
TACGGCTTGA CGGGTAACTC TGAAATTCCA CCGTATCAGT CACTGTCGTT GCTTAGCTCG
AACTATTCGA CGATCTACAA CGACGGCCGC GTTGGTGGTA CGGGTATCAG CCGTTTGGCT
AACCCCGACC TGCGCTGGGA AAAAACCGCT CAGACTGATG TAGGTCTGGA AGTTAGCTTC
CTCAAAGGAC GCATCTCGCT GGAAGCCGAC TACTACTACC GTCTGACAAC CGACATGCTC
CTGGATGCCC CCGTACCACA ATCGAGCGGC TATGCAACCA TCCGGCGTAA CGTAGGCTCG
ATGGAGAACA AAGGCTTCGA GTTCGGTTTG AACACGGTCA ACATCAACCG GGGTACTTTC
AGCTGGAATA CAAACTTCAA CATCTCGTTG AACCGCAACA AAGTCCTCTC CCTGGCTACT
CCATCCGATA TTTTTGGGGT AGGTGGTCCT AACTTCACCA ACCAGACGAA TATCATTCGT
ATTGGTGAAT CAGTAGGTTC GTTCTGGGGT CTGACCCGCG TGGGTGTATG GAGTGAAGCG
GAGCGGGAAG AAGCGGCCAA GTTCACCAGC TACCGCAACG GTTTGACCAT TCTGCCCGGC
GACATCAAGT ATCTCGACGT AAACGGCGAC AAGGCCATCA CCGATGCTGA CCGCAGCATC
ATTGGCAACG GTAGTCCTAA AGGCTGGGGT GCCATGACCA ACAACATTCG TCTGGGCAAC
TTCGATGCCA CCCTGGAACT TCAGTACATG TTTGGTAACG ACGTCATGCT GATGAACTTA
CACCCCAGTG AAGACCGGCA GGCTCTGGCC AACAGCTACT CGTCGGTGCT CAACGCCTGG
ACGCCAACCA ATCAGGGTAG CCAGATTGCT CAGGTACGCG ACACACGGGC GGGCTACGTA
ACCAACGTCG ACAGCCACTG GATCAAGAAT GGTTCGTTCC TGCGGGGTCG TAACCTGCTA
TTCGGTTACA CCTTCCCGGT TGAGATGACT AACAAGCTTA AGATGAACCG TCTGCGGATG
TATGTGTCGG CTCAGAACTT CTTCCTGTCA GTTGAAGACC CCATCGTAGG TGATCCGGAA
GTAACGCCCA CCAACCAGGG CTCAGGCAGC AGTGCCTTCT CACAAGGTCA AATCTGGCAT
AACTACCCCA AACCAACCAC GTACATGCTG GGCCTCCAGA TTGGCTTGTA A
 
Protein sequence
MKKHLPPQSL LGRLVSVVIT QLLLTAMCVN FTYAKVPLAL KTVADQRAIT ADRTLTGRVT 
DEKDEALPGV SVILKGTQRG TVTDADGRYK VDVPTGGATL VFSFVGYVPQ EVRVGNQTSL
NISLKADSKV LDEIVVIGYG TAKKSDLTGA VTSVKEAQLQ ERPTSSLNQA LSGRMPGVQV
NTNSGRPGGR TTVRIRGFSS INSSNNPLYV VDGVMLPQGT GDQFSNPIDY INPNDIVNVE
VLKDASSTAI YGARGANGVI LVSTRKGKAG ESRVTYDGQF SVNTIGPNKP KVLNAKEYLA
TEDLAYANMA KYDPVGWAAG KWSYLDPIAR RKAFSAAHPG VFDANLNPLY DTDWFKESAQ
NKLSQNHQLG FSGGNERTQY SLSLNYRDDQ GLIKTSYMKR YSGRFSIDDQ VKSWLKIGGT
LSYNNQTENL VDINDAVARQ IVEDFPFLPV RYPDTGVFAE NRDYPYAEGT MSSVHRLMDR
KYIQNTQTTL GSLFTNITLG KGLEMRTVLG ANVQTQEINQ SQTRTLNIGG NGNASTNNNK
TSFWSLEHYL TYNKQFGQDH SFTGLLGLSW QETNTFGIGA SVSGFATDYF GFNNLGAGAT
NPSVSSSASR FAFNSYFGRI NYGYKNKYLF TATGRADGSS KFGENYKFAF FPSAALAWRV
SEEDFLKGNP VISNLKVRAS YGLTGNSEIP PYQSLSLLSS NYSTIYNDGR VGGTGISRLA
NPDLRWEKTA QTDVGLEVSF LKGRISLEAD YYYRLTTDML LDAPVPQSSG YATIRRNVGS
MENKGFEFGL NTVNINRGTF SWNTNFNISL NRNKVLSLAT PSDIFGVGGP NFTNQTNIIR
IGESVGSFWG LTRVGVWSEA EREEAAKFTS YRNGLTILPG DIKYLDVNGD KAITDADRSI
IGNGSPKGWG AMTNNIRLGN FDATLELQYM FGNDVMLMNL HPSEDRQALA NSYSSVLNAW
TPTNQGSQIA QVRDTRAGYV TNVDSHWIKN GSFLRGRNLL FGYTFPVEMT NKLKMNRLRM
YVSAQNFFLS VEDPIVGDPE VTPTNQGSGS SAFSQGQIWH NYPKPTTYML GLQIGL