Gene Slin_5059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5059 
Symbol 
ID8728824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6169053 
End bp6172196 
Gene Length3144 bp 
Protein Length1047 aa 
Translation table11 
GC content49% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003389833 
Protein GI284039903 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.25287 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAACA ACTCCTACAA AATGAACCAC TTTAGTGGCT ACGTAGGTCA GGTTGGTGAA 
CGCCAACCCC AGTCATTCAA ACGGAAAGCG TTGGTTAGCT TTCTGTCGAT GCTGGTAATG
GTGCTACTGC TGAGCAATTC GGCAGCATGG GCACAAGGCC GTACTGTTAC GGGTAAAGTG
ACCGATCCGG CGGGTACAGC GTTGCCCGGT GTGAGTGTGC AGTTGAAAGG TACGCAGCGT
GGCACCAACA CCGACGCTGA TGGCAAGTAT TCGCTGGCCA ACGTGCCAGA CAATGCAACG
CTTGTTCTGA GCTTTATTGG TTATACCTCG CAGGAGGTTG TTGTAGGAAA CCGGTCAACG
GTAGATGTAA AGTTAGCCGA CGATACCAAA GCACTCGACG AAGTAGTCGT TGTGGGTTAT
GGTACCGCAA AACGGAAAGA CCTGACCGGC TCGGTTGTTC AGGTATCGGC TAAAGATTTC
AACGCGGGCG TAAACCCCAA CCCACTGCAG GCCATTCAGG GTAAAGTAGC CGGTCTGGTA
ATTACTTCGC CATCTGGTGA CCCCAATCAA CAACCAACCG TCCGTTTGCG CGGGTACACC
TCGCTGGCGG GTGGTTCTGA CCCACTGTAT GTGGTTGATG GTATGATTGG TGTGCCCATT
AGTACCATTT CCCCTTCCGA CATCGAATCG ATGGATGTAT TGAAAGATGC ATCGGCTTCT
GCTATTTATG GCTCACGCGC AGCTAACGGT GTTATTCTTG TAACCACCAA GCGTGGTAAA
GCGGGCAAAA CGACCGTAAC CTTTAACAAC TATGTTAGTG CCGCCGTTAT TTCAAGACGT
CTTGATCTGC TTGACGGCCC GGGCTATCGC GATGCCGTAA CGTCCATCAA AGGCTCATCG
GCGCTGGGCG ATTTACAGCG TTTCCCAGCG GGTAATTACA ATACCGACTG GATTAAAGAA
ATCACTCGTA CCGCCATTGT GAATAACCAC GATTTGGCTA TTGCGGGTGG TTCGCCAACG
TTTAGCTACC GTGGTTCGTT AAATTACATC AACAACCAGG GTATTGTTAA GAAAACGGGT
TTCGATCGTA TCACAGGGCG TATTAACCTC GATCAGAAAG CGTTGGATAA CCGTCTGAAT
ATCCAATATA ACCTTTCTTA TTCTGAGACA AATAAGGATT TCCCTGACAA TGGGAGCCAG
GGTGCACCAA GTGTAGGTTC CTTGTTGAAT CGTGCCACTA CCTTTCTGCC AACACTGCCT
ATTCGTAATG CGGACGGTTC TTACTACGAA GTAGGAGGGA GCTTCGACCT GTTCAACCCA
GTTGCTATGC TCAATAATTC AGTGAATACG GCTGTGCAGC GTTATTTACA GGCGGGCGCT
AACCTTCGAT ACGAAATTCT GGATGGCCTG ACCTTAGGCG TCAGCGGGCA AATTCAGCGT
GACAATTCGA CAACAAGCTT TTACACCAAT CCTATTATTA AGGCGTTTTC TGCTAACAAT
GGCCGTGCTG GGAGGGGCTT TTCCGAATCA AACAGCCGAC TACTGGAAAC GACGCTTAAT
TATGTAAAAG GCTTTGGTAC TCAAAATAGT AACTATTCCT TATTAGCTGG TTATTCGTAT
CAGCAGTTCG ACAACGACGG CTTCAACGCG TCCAACACTG GTTACTTAAC TAGTGAAATT
AACTACAATA ACCTTAATTT AGGATCAGGT ACAATCATTC TGCCAGGAAG TGGTTATGTT
GGCTCCTACC GTAACCAGTC GAAACTCATC TCGTTCTTCG GACGGGCCAG TGTTAACCTG
AACGACAAGT ACAATGTAAC CGCCACGATC CGCCGGGATG GCAGTTCTAA GTTCGGGGTC
AACAATAAAT GGGGTATCTT CCCATCCATT GGTGCAGGCT GGACAATCAG CAACGAGTCG
TTCTTCCCAA AGGGTAATTC TCTTAATTAC CTGAAATTGC GGGCCGGTTG GGGACAAACG
GGTAATTCGG AAGGTATTGC TGCCTATAAC TCGATTCAAT TGTATGGTCA GAGTGGTAGC
TATTACGATG GAACGATCAG TGACTTCCTG CCAGGCTATG GTATTACCCA GAATGCTAAT
CCGAACCTGA AATGGGAAGT ACTGACTCAG TCGAACGTAG GGCTGGATTT CCAACTGCTG
GGTGGGCGCT TCTCGGGTAC GCTTGAGTAT TACAACAAGC TGACCAAAGA CATGCTGTAC
CCCTACTCAG TACCGGCCGA TGGTAAAAAA TACTTTACCA ACATCATTCT GGCCAACGTA
GGTTCGATGC GGAACAGCGG AGTTGAATTA TCGTTTGGTG GCGACGTAAT CCAGAAAGGG
TCTTTCTCCT GGAATGCCCG TGTGGTGGGA GCTTACAACA AAAACACGAT TGTTAACCTG
AAAAACGATG AGTTTGATTC AGGTACCGTT CGTTTCAATG CCTTCGGTGG CCGTGGCTTG
TCGGACGTAT TTGCTTCGTT CATCTGGCCG GGTCAGTCGC TGGGTCAGTT CAACAACGTA
CCCACCTTCA CCGGTGCTTA CTCGGCAGAT GGCCAGCCGC TGCTGAAAGC AGCTTCGGGC
GATACGCCGG TAACAGACGT ATCGAAAGCT GATGCAGCCG CTGCTTTTGC TGCGGGTAGT
CCGTTGAAGC AGGGTAACCC ACAGCCGTTC CTGAACGCTT CGTTCATCAA CACGTTCCGT
TACAAAGGTT TTGATTTCTA CTTCCAACTG CGGGGAACCT TTGGCAACAG CATTCTGAAC
AACCTGCGCT CGAATCTAAT GATTCCTGGC TCAATTCTGG AAACCAACAT GCTGAAAGAC
GTAACGACAC TGCCCAAAAA CTATGGTGTG AACGTTCTGT CGACCAACTG GCTCGAAAAA
GGCTCATTTG TTCGGTTCGA TAACTGGCAA ATTGGTTACA GTATCCCACT GCCAGCCAGC
AAGTACATCT CGAATGCCCG CGTTTATGTA GGTGGTAATA ACCTGTTCAT CATTACCAAA
TACAAAGGTA TCGATCCGGA ATTGCAGGTT AAAGGTGATC TGCCTAACAG CCTCACCCAG
GCACCCAACT CGGTTGGCAT TGATGCCAGT GGTATTTATC CCAAAACGCG CACATTCCAG
TTAGGCCTTA ACCTGACGTT CTAG
 
Protein sequence
MMNNSYKMNH FSGYVGQVGE RQPQSFKRKA LVSFLSMLVM VLLLSNSAAW AQGRTVTGKV 
TDPAGTALPG VSVQLKGTQR GTNTDADGKY SLANVPDNAT LVLSFIGYTS QEVVVGNRST
VDVKLADDTK ALDEVVVVGY GTAKRKDLTG SVVQVSAKDF NAGVNPNPLQ AIQGKVAGLV
ITSPSGDPNQ QPTVRLRGYT SLAGGSDPLY VVDGMIGVPI STISPSDIES MDVLKDASAS
AIYGSRAANG VILVTTKRGK AGKTTVTFNN YVSAAVISRR LDLLDGPGYR DAVTSIKGSS
ALGDLQRFPA GNYNTDWIKE ITRTAIVNNH DLAIAGGSPT FSYRGSLNYI NNQGIVKKTG
FDRITGRINL DQKALDNRLN IQYNLSYSET NKDFPDNGSQ GAPSVGSLLN RATTFLPTLP
IRNADGSYYE VGGSFDLFNP VAMLNNSVNT AVQRYLQAGA NLRYEILDGL TLGVSGQIQR
DNSTTSFYTN PIIKAFSANN GRAGRGFSES NSRLLETTLN YVKGFGTQNS NYSLLAGYSY
QQFDNDGFNA SNTGYLTSEI NYNNLNLGSG TIILPGSGYV GSYRNQSKLI SFFGRASVNL
NDKYNVTATI RRDGSSKFGV NNKWGIFPSI GAGWTISNES FFPKGNSLNY LKLRAGWGQT
GNSEGIAAYN SIQLYGQSGS YYDGTISDFL PGYGITQNAN PNLKWEVLTQ SNVGLDFQLL
GGRFSGTLEY YNKLTKDMLY PYSVPADGKK YFTNIILANV GSMRNSGVEL SFGGDVIQKG
SFSWNARVVG AYNKNTIVNL KNDEFDSGTV RFNAFGGRGL SDVFASFIWP GQSLGQFNNV
PTFTGAYSAD GQPLLKAASG DTPVTDVSKA DAAAAFAAGS PLKQGNPQPF LNASFINTFR
YKGFDFYFQL RGTFGNSILN NLRSNLMIPG SILETNMLKD VTTLPKNYGV NVLSTNWLEK
GSFVRFDNWQ IGYSIPLPAS KYISNARVYV GGNNLFIITK YKGIDPELQV KGDLPNSLTQ
APNSVGIDAS GIYPKTRTFQ LGLNLTF