Gene Slin_1938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1938 
Symbol 
ID8725675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2345762 
End bp2348167 
Gene Length2406 bp 
Protein Length801 aa 
Translation table11 
GC content55% 
IMG OID 
ProductTonB-dependent receptor 
Protein accessionYP_003386782 
Protein GI284036852 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.17436 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGGTT TGAGCAGAAT AAGTTTATTG ATTGGGTTGT GGGTAGCCGC AGGGCTGGGG 
TGTCTGGCGC AGAGCAGCAC CTGTACCAGC GTGCTGAAAG GGCAGATTCT GGGGCAGGAA
AACCGCGAGC CACTCGTTGG AGCGACTCTT TACGTTCGGG AGTTGAAAAC CGGTGCCGTT
GCTGATTCAG CGGGTTATTT TCAACTCGGG CAGCTGTGTC CGGGAAATTA TACCATCGAC
TTTCAGTTTG TTGGTTATAA AGTCACGACA CAGTCAATCG TGATCAGGCA GGAACCGCTG
GTTCGCTTAA ATCCGGTTTT GCTGGTGCCC GACAATCAAA CCCTGCAGGA GGTTGTCGTA
ACGGAACACC GGTCCGAAGC CCAGCAGCTG CTTCAAACAC AGGTGAGCCT GTCGGGGGCG
GCTCTCGACC AAACCAGGGG TCAATCGCTG GGCGAGAGTT TGAAGGCACT TACCGGCTTA
TACGCCATCC AATCCGGCCC CAGTATTTCA AAGCCGGTCA TTCACGGGCT TTACAGCAAC
CGAATTATAA TTCTCAACAA TGGCATCCGG CAGGAAGATC AGCAGTGGGG GAGTGAGCAC
GCGCCCCAGG TCGACCAGTT TCTGGCGTCC CGGCTGACGG TGATCAAAGG GGCGGCCAGT
ATCCGCTACG GGTCCGACGC TATTGGCGGG GTTATTTTGG TAGAGCCCAG CGCCATGCCC
ACCCGGCCGG GCATTCAGGC CGAAGTAAAT CTGGTTGGAG CAACGAACGG GGGCATGGGT
GTGGTATCAG GTCTGGTAGA AGGGGCATTC GATAAAAAAC TGGCCGGGTT AAGCTGGCGA
TTGCAGGGAA CCCTGAAACG GTCGGGCTAC GTCAAAACAC CGAACTACTA TCTCGAAAAT
ACCAGCTACC AAGAGAATAA CTTTTCCGGT GATCTGCATT ATGACCACAA GAATTTTGGT
GTCGAGTTGT TTTACAGTCA ATTCGATACG AAAGTGGGTC TGTTTACGGG TGCGCAGGTG
GGTAGTCTGG CCGATTTATA CACGGCCATT AGCCGACCTG AACCGCTGGT TCAGCCGGTT
TTTTCCTATG CATTGAACCG CCCTTATCAG GCCGTTCAGC ATGACTTGTT TAAAGGGCGG
GCCCATCTAC ATCTGCCAAA AGGGGGCACG CTGACCGCTA CCGTTTCCCG GCAGCAAAAC
ACCCGCCGGG AGTATGACTT TGTCTCGTTC AGCGGCATAA CGACGCCCGA ATTGTATCTG
AAATTAGTTA CCCATACCGC CGATCTGGTT TGGGAGCATG CGCCGATTAA AACGGAAGCC
GGAGGCCGTG CGGGGCAATG GTCGGGAAGT GTGGGTTTTA ACGGGATCAC GCAAGGCAAC
GTGCGGCAAT ATCTGTTCCT GATTCCTAAT TTCCGGAACT ACGGTGCCGG ATTATTTGCC
ATTGAACGCT ACGCAGTAGG ACGTCTGACC CTGGAGGGCG GGTTGCGGTA TGATTACCGC
TGGTTACGGG CTTACTTCCT CGATGAAGTT ACCAAACAAA TCTATTTTAC CACCCATAAC
TGGCAGAACG CCAACGGATC GCTGGGGGTA GCTTACCAGC TCCGGCCCGA CCTGACCCTG
ACCGGCAACG TTAGCACTGC CTGGCGGGCT CCTAATGTCG CGGATTTGTA TTCCAACGGC
CTGCACCAAA GCGCGGTAGC CTACGAACGG GGTAATCCCA ACCTGCGCCC GGAACAGGCT
GTCAACAGTA ACCTGGTGCT GGCCTATGCC GGGAAGCGGC TGAGTGGGGA AATTGGCGTT
TATAGCAACC GGATCGAAAA TTACATTTAC CTCAAGCCCG ATTCGGTACC CATTGTCCGG
CAGCGGGGTG CTTTCCCGTC GTATACCTAT AACCAGGTGC GGGCCACGTT TCGGGGAGTC
GATGCTACCC TGACCTATAA ATTTACCGAT CAGCTGGCGT TCACGACAAA AAACTCGCTG
CTGTTCGCCT ATAACCAGAC CGACCACGAC TACCTCGTTT TCATCCCCGC CAATCGGTCG
GATAATACGC TGCGTTACGA TTGGGACCGG TGGGGTAAAC TTGCCAAACC CTATGTGTCC
ATGACGGGCT TGTACGTATC CCGACAAAAC CGTTCTCCCT CCGTGACTAC CCGACAGGAG
AACGGGGCCG TTATTTTCAC CGGCGATTTT GCTGCGCCAC CACCCGCTTA TTTCCTGCTG
GGGGCCGAAG TGGGGTTCCA GACTCAGCTC GGCAAGCAGC CCCTTAGTGT CATCCTGAGC
GGCACCAACC TCGCCAATGT CGCTTACCGC GATTACCTGA ACCGGTTCCG GTATTTCGCC
GATGAGCCGG GACGCAACAT CATGCTCAAA GTGAAATTAC CACTGGCGTT TCCCAAAAGG
TCTTAA
 
Protein sequence
MGGLSRISLL IGLWVAAGLG CLAQSSTCTS VLKGQILGQE NREPLVGATL YVRELKTGAV 
ADSAGYFQLG QLCPGNYTID FQFVGYKVTT QSIVIRQEPL VRLNPVLLVP DNQTLQEVVV
TEHRSEAQQL LQTQVSLSGA ALDQTRGQSL GESLKALTGL YAIQSGPSIS KPVIHGLYSN
RIIILNNGIR QEDQQWGSEH APQVDQFLAS RLTVIKGAAS IRYGSDAIGG VILVEPSAMP
TRPGIQAEVN LVGATNGGMG VVSGLVEGAF DKKLAGLSWR LQGTLKRSGY VKTPNYYLEN
TSYQENNFSG DLHYDHKNFG VELFYSQFDT KVGLFTGAQV GSLADLYTAI SRPEPLVQPV
FSYALNRPYQ AVQHDLFKGR AHLHLPKGGT LTATVSRQQN TRREYDFVSF SGITTPELYL
KLVTHTADLV WEHAPIKTEA GGRAGQWSGS VGFNGITQGN VRQYLFLIPN FRNYGAGLFA
IERYAVGRLT LEGGLRYDYR WLRAYFLDEV TKQIYFTTHN WQNANGSLGV AYQLRPDLTL
TGNVSTAWRA PNVADLYSNG LHQSAVAYER GNPNLRPEQA VNSNLVLAYA GKRLSGEIGV
YSNRIENYIY LKPDSVPIVR QRGAFPSYTY NQVRATFRGV DATLTYKFTD QLAFTTKNSL
LFAYNQTDHD YLVFIPANRS DNTLRYDWDR WGKLAKPYVS MTGLYVSRQN RSPSVTTRQE
NGAVIFTGDF AAPPPAYFLL GAEVGFQTQL GKQPLSVILS GTNLANVAYR DYLNRFRYFA
DEPGRNIMLK VKLPLAFPKR S