Gene Slin_0106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_0106 
Symbol 
ID8723834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp125617 
End bp129102 
Gene Length3486 bp 
Protein Length1161 aa 
Translation table11 
GC content57% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003384977 
Protein GI284035047 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.813272 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAT CGTTACCGCT CCTCCTGACA ACCGGCTGGT TGATTAGCGT GGGCTATGGG 
CATACCGTCG CTCAAACGCT GGCATTTGCC CGCCCTCCGC AGCATACCGA TAACAAACAA
CCCGGCAAAG GCACGGTTGG CATCGGGGCC AGTCAGCCGC TTAAGCAAAC GCTATTGCAG
CTAAAAGAGC ATTATGGAGT CGATATTCTG TTCGAAGAAT CGGTCATCTC GCGGCACCTG
AGCCCGCTCG AACCGCTCAA CTTCAGTGCC CGGCTCGAAA CAAACCTCGA TTTGCTTCTT
AAACCGTATG GACTGCGGTT CAAGAAGCTG CGGTCGGGGG CGTACCTGAT TCTGCCCCCT
AAAAACGGCA ATCGCGCAAT TATCAACCTG CCTGCGCCAA TTCCCATGAC GACCGCCGGA
ACAGGTCCTG TTTCACTGGC CGGTCTTGCA AACCTGGCAA CGGTACAGCC GATGGAAACA
CCGCAGACCG ATGTACGGAT AACCGGCCGC GTAACGGGCG AAACGGGAGA GGATCTGCCG
GGCGTAAGTG TCGTGGTGAA AGGGTCGTCG CGCGGTACAA CGACCGATGC CCAGGGGCGT
TATCAACTGA GCATCCCAAC GGATGCGTCG GCCGTTACGC TGGTGTTCAG CTTTGTCGGC
TACGTCAGCC AGGAGCGGGC TGTTCGGAAC CAGACAATTA TTAATGTGCA GTTACTGCCT
GATACAAAAT CACTCAACGA AGTGGTAGTC GTGGGCTACG GGCAAGTGAA GAAAAGCGAC
CTGACGGGCG CCGTATCGAC CGTACCCGTC GACGAAATTC GGAAGGTGGC CGTTACGTCT
TTAGATCAGG CCTTGCAGGG GCGGGCCGCC GGGGTGCAGA TCACCCAGAA CTCGGGCGCA
CCGGGTGGCA CGACAAGTAT CCGCATTCGG GGTGGCAACT CCATCCAGGG CGATAACGAG
CCGCTGTACG TCATCGACGG CATTCCATTC AAGAACGACG GGGCCAGCAG CGGGTCTAGT
TTCAACGTGC TGAGTACGCT CAACCCCAGC GACATCGAGT CAATTTCAGT CCTGAAAGAT
GCCTCGTCGA CGGCCATTTA CGGATCGCGC GGCTCGAATG GCGTGGTGAT CATCACCACC
AAACGGGGGA AGGCTGGTAA ATCGACCATC AACCTGGATA CCTATTACGG CATCCAGACG
GTTCGCCGGA AGTACCCCGT GCTCAATGGA CGCGAGTACG CCCAGTTGGT GAACGATGCC
AACACCAACG AAGGTCGTCC GGCGGTGTAC ACCCAGGATC AGGTCAACGC TTTTGGCGAA
GGAACCGACT GGCAGGATGA GATTTTCCGG CAGGCACCCA TTGCCAACTA CCAACTGTCG
ATGAGCGGTG GCGACGAGAA AACGCAGTAT GCCATTGCTG GTGGGTATTT CAAACAGGGC
GGCATCATCG TCAATTCCGA TTTTGACCGG TATTCGTTCC GTATCAATCT GGACCGCAAG
CTGACCAACA AAATAAAGAT CGGCAATAGC CTGACGGTCA ACCGGACGGT GACTAATCAG
GCGCGTTCGG ATGGGGACTT GGGTAGCGCG GGGTTGGTGA CCATCGCGGC CCTGCAATTT
CCGTCCATTC TGCCCGTCAC CAATCCCGAT GGCTCGTACC TGCTGACCAG CCCGGCGCTG
GCTTTCACCG CCGATAACCC CGTGGCACTG GCCCGCGACA ACAAAAACCG GACTACCGCC
TACCGAATCT TCGGCAATGT ATTCGGCGAC TACCAGATTA TCGACGGACT CAGTCTGCGG
GTGCTGCTGG GTATCGACGG AGTGTTGCAG AAGCAGGATT CGTATCTGCC CCGCTCGGTA
TCCAGCGGGC TGGCGCAGGG GGGAGCCGCT TCGATCTTCA ACGGGCAGTC GGTAACCTGG
CTGAACGAAA ACTTGCTGAC CTACACCCGT ACGTTCAACA CGGTGCATAA TGTAACGGCC
CTGCTCGGCT ATACCCAGCA GGCAAACCGC ACGGAAAATA GCCAGGCGCA GGCTCGTAAC
TTCGTAAACG ATAACCTGGG GTCGAGCAAC CTGGGTTCCG GTTCGGTGCC GCTCACGCCC
TCGTCGGGCA TTGGCACCTG GGGGCTACAG TCCTATCTGG CCCGGATCAA CTATGGCTAT
AAAGACAAAT ACCTGCTCAC GGCTTCGTTC CGGAGCGATG GTTCGTCGCG CTTCGGGGCT
AATAAGCAGT ACGGCTATTT CCCGTCGGCG GCTTTGGCCT GGCGGGTGTC GGAAGAAGCG
TTCCTGAAGA ATAATCCGGT AATCAACGAT CTCAAATTTC GGGTCACGTA TGGCGCAACG
GGTAACCAGG ATGGGGTGGG CAACTACCCG GCCTACTCGC TGCTGGGTAC CCAAAACTAC
GTGTTCGGCA ACACCGTGTC GACGGGCTTG GGTCCCAATC AGATTGCCAA CCCCGACCTG
TCGTGGGAAA CGACGACCCA GGCTGACATG GGTGTAGACG TTGGTCTGCT TAACAACCGC
ATCACCTTAA CGGCCGATCT GTACCTGAAA CGCACCAAAG ACCTGCTGCT CAACGTAACT
GTACCCAGCA CATCCGGCTT TTCCAGTGCC TTCAAAAACC TTGGCAAAGT GGAGAACAAA
GGGTTTGAAC TCAGCATCTC CTCACGAAAT ATTGACGGCG CCTTCAAGTG GAACACCGAC
CTGAACTTTG CTCTCAACCG GAACAAAGTG CTCGACATTG GCGGGGCTCC GCAGATTTTT
GCCGGTAGTG TAGCCAACAT TGGTCAGGGG CTGAATTCGG GGATTATCCG GGTAGGGGAG
CCGCTGGGGT CATTCTTCGG CTATGTTACG AACGGCCTTT ACCAAACCAC CGATGAATTG
GCCGCCCTCA CCGACCCGCA GGCCCGTAAA CCCGGCGACC GCAAGTACCT CGACCTCAAC
GGCGATAAGA AAATTGACGA CAACGACCGC ACCATTATCG GTCGGGCACA ACCCAAGTTC
CTGGGTGGAC TCAGCAACAC CTTTTCCTAC AAAGGCATTG AGCTGACGGC TTTTCTACAA
GGTGTTTATG GGAATAACAT CCTGAATGCC AATCGGTATG AACTCGAATA CCTTAACGCC
ACCACCAATC AGGACCGCGA TATGCTGAAC CGCTGGACAC CCACCAACAC CAATACCGAC
ATACCGAGGG CTTCAACCAC TCGTCCGGCC AACCGCGTGT CGACCCGGCA GATCGAGGAT
GGGTCTTACC TCCGGCTGAA GAACGTTCAA CTGGCCTACA ACCTCCCCGC ATCGGTGCTT
AAAACGCTGA AGATTCAGTC GCTGCGGGTG TACGTGACGG CGCAGAACTA CCTGACCTGG
ACCAGCTACT CGGGCTACGA CCCGGAGGTA AACCGGTTCG GGCAGGACAG CCGTAGCCAG
GGCTTCGATT ATGCCAGCTA CCCATCGGCC AAAACCATCC TGTTCGGCCT TAACGTAGGG
TTCTAA
 
Protein sequence
MKKSLPLLLT TGWLISVGYG HTVAQTLAFA RPPQHTDNKQ PGKGTVGIGA SQPLKQTLLQ 
LKEHYGVDIL FEESVISRHL SPLEPLNFSA RLETNLDLLL KPYGLRFKKL RSGAYLILPP
KNGNRAIINL PAPIPMTTAG TGPVSLAGLA NLATVQPMET PQTDVRITGR VTGETGEDLP
GVSVVVKGSS RGTTTDAQGR YQLSIPTDAS AVTLVFSFVG YVSQERAVRN QTIINVQLLP
DTKSLNEVVV VGYGQVKKSD LTGAVSTVPV DEIRKVAVTS LDQALQGRAA GVQITQNSGA
PGGTTSIRIR GGNSIQGDNE PLYVIDGIPF KNDGASSGSS FNVLSTLNPS DIESISVLKD
ASSTAIYGSR GSNGVVIITT KRGKAGKSTI NLDTYYGIQT VRRKYPVLNG REYAQLVNDA
NTNEGRPAVY TQDQVNAFGE GTDWQDEIFR QAPIANYQLS MSGGDEKTQY AIAGGYFKQG
GIIVNSDFDR YSFRINLDRK LTNKIKIGNS LTVNRTVTNQ ARSDGDLGSA GLVTIAALQF
PSILPVTNPD GSYLLTSPAL AFTADNPVAL ARDNKNRTTA YRIFGNVFGD YQIIDGLSLR
VLLGIDGVLQ KQDSYLPRSV SSGLAQGGAA SIFNGQSVTW LNENLLTYTR TFNTVHNVTA
LLGYTQQANR TENSQAQARN FVNDNLGSSN LGSGSVPLTP SSGIGTWGLQ SYLARINYGY
KDKYLLTASF RSDGSSRFGA NKQYGYFPSA ALAWRVSEEA FLKNNPVIND LKFRVTYGAT
GNQDGVGNYP AYSLLGTQNY VFGNTVSTGL GPNQIANPDL SWETTTQADM GVDVGLLNNR
ITLTADLYLK RTKDLLLNVT VPSTSGFSSA FKNLGKVENK GFELSISSRN IDGAFKWNTD
LNFALNRNKV LDIGGAPQIF AGSVANIGQG LNSGIIRVGE PLGSFFGYVT NGLYQTTDEL
AALTDPQARK PGDRKYLDLN GDKKIDDNDR TIIGRAQPKF LGGLSNTFSY KGIELTAFLQ
GVYGNNILNA NRYELEYLNA TTNQDRDMLN RWTPTNTNTD IPRASTTRPA NRVSTRQIED
GSYLRLKNVQ LAYNLPASVL KTLKIQSLRV YVTAQNYLTW TSYSGYDPEV NRFGQDSRSQ
GFDYASYPSA KTILFGLNVG F