Gene Slin_3036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3036 
Symbol 
ID8726788 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp3680686 
End bp3683787 
Gene Length3102 bp 
Protein Length1033 aa 
Translation table11 
GC content49% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003387846 
Protein GI284037916 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000276569 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.556141 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTCAA AATTACGTAG AGCCATTCCC GGGTTATTAC TTTTTCTGGG CATATGGATT 
TCCGCTGTTG CATCACCTGT ATTTGCTGTC GACAGTGAAA TAACGGGCCG GGTTTCAGAC
GAGAAAGGAA ATGACGTGGT TGGTGCCTCT GTCACCATCA AAGGTACCAA TCGGGGTACT
AATACAGATG CCAGTGGCAA GTATCGAATT GTCGTTCCAA ATGGAAGTGC CGTACTTGTA
TTTTCATACA TTGGCTACAC CAAGCAGGAG GTTACCGTTG GTAACCGCTC CGTTATTGAC
GTGAAGCTGG AGCCAGGTAG CGCTGTACTC GACGAGGTCG TGGTAACGGC TTTAGGGATA
TCGAAAGAAG CCCGTAAGGT TGGTTATGCG GTTACGACGG TTAGTAGTGA AGCATTCACA
AAAGCTCGCG AAACCAACGT TGGTAATTCG CTGGTTGGCC GGGTTGCCGG TTTGACCGTA
AAGGGTACCA ACGGTGGCCC CGGCAGCACA TCTAAAATCC TGTTGCGGGG TATGCCCAGT
ATTAATTCGG GTGGTGCGCC ATTGATCGTT ATTAATGGTG TACCCATGGA CAATACCCAA
CGGGGTAGCG CCGGCGAGTG GGGTGGTGCC GATGGTGGAG ACGGTATCGG TAACCTGAAC
CCGGATGACA TCGAAACAAT GACGGTGCTG AAGGGACAAT CTGCATCGGC CCTGTACGGT
GCCCGATCAT CTAATGGTGT TATTCTGATT ACAACCAAAC GCGGTAAGAA AGGTGATTAT
GCCATTGAAT ACAACGCAAA CCTGACAGCT GATAGTCCGA TTAACTTTAC TGATTTTCAG
TACGAGTACG GACAGGGAAC AGGTGGCGTA AAACCCACTA CCATTGCCGC TGCTCAGCAA
ACGGGTCGTC AGAGCTGGGG TGCTAAACTG GATGGCTCGC AAATCACCCA GTTTGATGGT
AAGCAATATG CTTACTCGGC TCAAAAAGAT AACATCAAAA ATTTCTACCG GACAGGCACC
AACTTCACCA ATACAGTTTC GGTCACTAAA GGGGGGGATA ACGGGTCATT CCGTTTGTCA
TTGTCTAACC TCGATACCAA GTCGATTCTA CCGAACAGTG GTCTGGGCCG TAAAACGTTT
AACCTGACGG CCGACCAGAA CATCACCTCG AAACTAAGCG TGAGCCTGCT GGCAAACTAC
ATCGACGAAA AGATTACGGC AAAGCCGCAG TTGAGTGATG GTCCAATGAA TGCCAACAAT
GGTTTATTCC TGGCAACGAA CATCGATCAG CGAATTCTGG CCCCGGGTTA TAATACCACA
ACCGGACGTG AGATTATTTT TAGTGATGAT GAGTACGTAA CGAACCCGTA CTTCGTAACC
AATCAGTACG TGAACGACGT AAGCCGCAAG CGATTGATTT CGATGATTGC GACCAAGTAT
CAGTTTGCCG ACTGGATTTA CGCGCAGGGA CGGGTTGGAT ACGATAACGG CAATGACCGG
ATTTTCCGGG TTACACCCTA CGGAACGGCT TATTCGCAGG ATGCCAAAGG TGGCCTGGAC
GAGCAGTCGA ACGCCCAAAC GACTGAATTG AACATCGACG GTTTGATTAG TGTTAGTAAG
GCCATTACGC CCGACTTCTC TATTGATGCT ATCGTGGGTG GTAACATCCG CAAAAATAAC
TATGAGAAAA TCGGCATCGG CGGTGGGCCA TTCGTTCTGC CTTACCTCTA TAGCTACAAT
AACGTTGTAA ACTTTAACCG GAGCTATGGC TTTTCCAAGT CTGAAGTTCA GTCGGCTTAT
TATAGCCTCG ACTTTAGTTA TAAAAGCTTC CTGAACATAA GCACAACGGG TCGCTACGAT
GCTTATTCGA AACTGCCCAG TACGGCACGA ACAATTTTTA CGCCGTCTGT AACGGGTGCT
TTCATTTTCT CTGAGTTTGT TAAAACACCC AGCCTGAGCT TTGGTAAACT ACGGGCGTCT
TATGCAGTTA CCAGTGGTGA ACCAGCAGAC GCCTATGGAA CTAGTGTTTA TTACGGGGTA
GGAAGTGCGC TGAATGGTGT TCCTACTGGT AATTTTAGTT CCAGCTTGCC CAACTTGTTC
CTCAAACCCT TTACCAAGAG CGAAGTTGAG GTTGGTTTAG AACTTAAGTT CTTCGGTAAC
CGGTTAGGAT TCGATCTGGC CTATTTTGAT CAGAAAACAC ATAACGAAAT CCTGCCAGCT
AACTACAGCC CGGCAACAGG GTATACGAGT GGGGTAGTGG CTACCGGTTC TACCCAGAAC
CGGGGTCTCG AAGTGCTGGT AACCGGCACG CCGGTGAAAA CGGCTAAATT GGCCTGGAAT
GTTTCGTTTA ACCTGACTTC GGTTAAAAAC AAAATCCTCC AGACCGATGC CAATAACAAT
CCGCTGGGTT TGGGCTCAAA CCGTGCTACA CTGGGGAATG CGACTACTGC GTTTGTTGTG
GGTGAGTCTG GTCCGCAGAT CCGCGCGTAT GATTACAAGT ATGCCTCGAA CGGACAAATC
ATTGTCGATG CATCCGGCCT GCCGGTTCGG GGTAACCTGA TCAATATGGG TACGGTATTA
CCAACGCTCT TCGGTGGTTT AAATAACGAG TTCTCGTTCG GTAATTTCAA CCTGGCGTTC
CTGGTCGATT ACAACTACGG TAACAAGATT CTTTCGGCTA CCGAAAACTA CGCCTACCGC
CGTGGCCTGC ATAAAGCGAC TTTGGTGGGC CGTGAAGGAG GTATCACCAC GGGTGTTGTA
GAGGGCGGTG CTGCCAATAC GGTTAGCGCC ACCGCTCAGA ATTACTACAC GGCACTGGCC
AACAACGTAA CCAAAATCAG TGTGGTCGAT GGCGATTTCA TCAAATTGCG GCAGCTGACA
TTTGGCTACA ACATACCTGC CAGCGTTTTG ACAAAAGTGC CTCTGATTCG TGCGGTTAAT
ATTTCTTTCG TGGCCCGGAA CCTGTTCTAT ATCATGAAGA AAACAACCAA TATTGATCCA
GAAGCTACGT TTGGCGCTAA CCTGCGTTAC GCTGGTATTG AAGGAACGAG CCTTCCATCA
AGTCGTAACT ACGGGGTTAA CCTAAACATC CGGTTCAAGT AA
 
Protein sequence
MKSKLRRAIP GLLLFLGIWI SAVASPVFAV DSEITGRVSD EKGNDVVGAS VTIKGTNRGT 
NTDASGKYRI VVPNGSAVLV FSYIGYTKQE VTVGNRSVID VKLEPGSAVL DEVVVTALGI
SKEARKVGYA VTTVSSEAFT KARETNVGNS LVGRVAGLTV KGTNGGPGST SKILLRGMPS
INSGGAPLIV INGVPMDNTQ RGSAGEWGGA DGGDGIGNLN PDDIETMTVL KGQSASALYG
ARSSNGVILI TTKRGKKGDY AIEYNANLTA DSPINFTDFQ YEYGQGTGGV KPTTIAAAQQ
TGRQSWGAKL DGSQITQFDG KQYAYSAQKD NIKNFYRTGT NFTNTVSVTK GGDNGSFRLS
LSNLDTKSIL PNSGLGRKTF NLTADQNITS KLSVSLLANY IDEKITAKPQ LSDGPMNANN
GLFLATNIDQ RILAPGYNTT TGREIIFSDD EYVTNPYFVT NQYVNDVSRK RLISMIATKY
QFADWIYAQG RVGYDNGNDR IFRVTPYGTA YSQDAKGGLD EQSNAQTTEL NIDGLISVSK
AITPDFSIDA IVGGNIRKNN YEKIGIGGGP FVLPYLYSYN NVVNFNRSYG FSKSEVQSAY
YSLDFSYKSF LNISTTGRYD AYSKLPSTAR TIFTPSVTGA FIFSEFVKTP SLSFGKLRAS
YAVTSGEPAD AYGTSVYYGV GSALNGVPTG NFSSSLPNLF LKPFTKSEVE VGLELKFFGN
RLGFDLAYFD QKTHNEILPA NYSPATGYTS GVVATGSTQN RGLEVLVTGT PVKTAKLAWN
VSFNLTSVKN KILQTDANNN PLGLGSNRAT LGNATTAFVV GESGPQIRAY DYKYASNGQI
IVDASGLPVR GNLINMGTVL PTLFGGLNNE FSFGNFNLAF LVDYNYGNKI LSATENYAYR
RGLHKATLVG REGGITTGVV EGGAANTVSA TAQNYYTALA NNVTKISVVD GDFIKLRQLT
FGYNIPASVL TKVPLIRAVN ISFVARNLFY IMKKTTNIDP EATFGANLRY AGIEGTSLPS
SRNYGVNLNI RFK