Gene Slin_4801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4801 
Symbol 
ID8728565 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5855033 
End bp5858257 
Gene Length3225 bp 
Protein Length1074 aa 
Translation table11 
GC content54% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003389578 
Protein GI284039648 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATGA TAGCCTTTTT GTTTTATAGC TCACTATGGG CCCAGACGCC CATTCAGGGA 
ACCGTGCTCG ATGCCAAAAA AGAGCCACTC GTGGGCATAA ACGTATTGGT AAGAGGAACT
ACGCGCGGAA CGGTTACAGA CGTGAACGGC AAGTTTACCG TCAATGCCGA CCCGGCCGCT
ACGCTGATTT TCTCGGGCGT TGGGTTTGTT CGTCAGGAAG TGGCCGTTGG AAGTCGGACA
GATGTGTTGG TAACGCTGGC GGAAGATAAT GCCGTACTGA GCGAAGTGGT TGTTACAGCA
CTGGGTATTA AACGGGATAA GCGTTCGCTG GGGTATTCCA TTCAGGAACT GAACGGGCAA
GATATTGCAA CAGCTAAAGA GGCCAATGTG GCTACGAGCC TGGCCGGGAA AATGGCCGGT
GTGCAGGTGA CCCGCTCGGC CAATGGAGCC GGTGGCTCGT CGCGGGTGAT TATCCGGGGC
GCTAACTCGC TCGTGGGAAA CAGCCAGCCG CTATATGTGA TCGACGGTAT TCCGATGGAC
AACCAGAACC CAAGGGCTCC CGGCAGCTCG GGCGGCATCG ACTACGGCGA CGGTATTTCG
AACATCAATT CCGAAGACAT TGAGACGATC TCTGTCCTGA AAGGCCCTAA CGCGGCTGCG
CTATACGGGC AGCGGGGCAG TAACGGCGTA GTGCTGATCA CGACCAAGTC GGGTAAGAAC
CGCAAAGGGA TCGGGGTTAA ATACGGCATT GACTACTCGC TGGGTGATGC GCTGGTACTG
CCCGATTTCC AGGACGAATA CGGGCAGGGT CTGGATGGTA CGTTTACCAA CTTCCGTGGA
AATGACGGTA AAATCTATAC TTGGGCAGCT GCTCAGGCCG CCGGTATTCA GGGGGTTCCC
AAGATGAGCG GTGGCCGTGA CCGATTTACC CGCTCAAGCT GGGGACCGCG TATGGAGGGG
CAACCTTATG AAGACCAATG GGGCAACTTG CTGAATCTGA CCCCGCAACC CAATACCTTC
CAGAAGTTTT TCAATACCGA AAAACAGATG GTCAACAACC TGAGTCTGGA GGGAGGTAAC
GATGCCGTGA ACTACCGGGT GGCCTATTCC AACACGAATA TCAACGGCTA TGTACCCACC
AATACCCTTA ACCGGAATAA CATCAGCTTA CGGACGGTGG CTAAAGTTAC CTCGAAGTTA
GAGGCCGATG TGAAGGTTAA TTACATTGCT CAGCAGGGCG TAAACCGGCC AACCGTTTCC
GACGCAGCCG ATAACCCGGC CTACATCTTT ATCAGTCAGC CCCGGAGTAT GCCAATGGAT
ATTCTGGCCA ATTCGGCCTG GACGGCGGCT GATATTTCCA AACAACTCGG CTACGGTACA
ACGCCCTTCG TAGGCCTCGA AAAAACCTAC GCTACCAACT CGTCAACGGC GAACCCCTAC
TGGACAATGT CCCGAACCCG CAACTCGGAT GAACGCCAGC GAATTATCGG ACTGGTCAGA
CTGAGTTATC AGTTCAACGA CTGGATCCGG CTGACGGCCC GTACCGGTAC GGATTTCTAT
ACCGATCAGC GATTCCGCTA CCGCGACAAG GGTACCTACG TGACGGCTAA TAAAAACGGG
GATATTACCG AAGAGGTGAC CCGCACCCGC GAAGATAACA GCGATGTACT GCTGTCTCTT
ACGCCCAAGG TTTCGGACGA CATTTCGTTC TCGTTCAACC TGGGCGCTAA CCACCAGCGT
TATTACTCGC GCACAACGGG CAATACAGGT AATGAATTTA TTGCGCCTAA TCTGTTCATT
ATCAATAATA CGCTAACCAA TTCGTATGTC TTCGGCCTGA CGGAATCGTC CATCAATTCG
GTGTATGGGT CGGGGTCGGT TGGGTACAAG GAAATGGCGT TCATCGATTT CTCGGCCCGG
AATGACTGGT CGTCGACGCT GTCGCCCAAA AACAACTCGT TCTTTTACCC GGCCATTAGC
GGCAGCCTTA TTCTGACGGA TGCGCTGCGG TTGCAAAGTC CGACGCTGAG TTTCGTGAAG
GTAAGAGCTT CCTGGGCACA GGCGGGTAGC TCGGGCAGCC CGTACCAGCT CAACGGAAAT
TACTCGCTGG ACCAGTACAC CCAGGGCGGC ATTCCACTGG CTTCGTTTGC GTCGACCATT
CCCGATCCAA ACCTGAAAAA TGAGCTGACT ACGTCCAATG AGTTTGGGCT GGAAGCGCGG
CTGTTCAAAA ATCGGGTGGG GGTAACGGTT GCCTATTACA ATGCCAGCAC CCGAAACCAG
ATTCTGAACG TACCGCTGCC GCCGTCGAGC ACCTTCACTT CCCGACTGAT CAATGCGGGT
GAAATTCGTA ACCACGGCAT CGAACTGTCG GTCAATGCCA CGCCCGTTAA GCTGGCTTCC
GGTTTCTCCT GGGATGCCAC ATTGAACTAC TCGCATAACC GCAACGAAGT GGTTTCGCTG
GCGGAAGGGG TTTCGACCTA CATACTTGGC AGCGACCGGG GCGTGCAGGT TATAGCCACA
CCCGGCAAGC CGTTTGGTAC GATTCTGGGC AACGGGTTTC AGTGGCTTCG GGATGGCTCC
GGAAATCGGA TCATTGACCC CGCCACGGGC CTTCCGGTCA AAACAAATTC CAAGATCCTA
TACGAAATGG GTAACGCACT GCCGAAGTGG ATTGGTGGTT TCAACAACGT ATTCCGGTAC
AAAGGTCTTA CTCTATCCGG TCTGATCGAC GTTAGTCAGG GCGGCAAAGT ATACTCGCAA
AGCTTGCGCG AAGAACTAGT GTACGGCACG ATCAAAAAGA CGCTGCCGGG CCGCGATGGA
AGTTACGTTG CCGAAGGCGT TGTCGGTTCG AAATCGGCCG ATGGCACCTG GACCGGCACC
GGGCAGGCGA ATACCAAAAC GGTACGGGCG CAGGACTATT GGAACGTCGT GGCACCGGAC
AAAGACAACG TGGTAGCGGA AGAGATGCTG AACGATGCCA GCTACGTTAT CCTGCGGGAA
ATGACGCTCA ATTACAGTTT GCCGGCTAAG CTGGTGAGCC ATACGCCGTT CCGCAATATC
CGGGCTGGTG TGTATGGCCG GAATCTATTT TACTTACAAC GCAAGACAGA GGGCTTTGCA
CCGGAAGCCT CGGCATTCAA CGTCAACAAC TCGTCGCTGG GACTCGAATC GACCGCACTG
CCGTTGCTGC GGTATGTTGG GGTTAGCCTG AATGTAGAAC TGTAA
 
Protein sequence
MTMIAFLFYS SLWAQTPIQG TVLDAKKEPL VGINVLVRGT TRGTVTDVNG KFTVNADPAA 
TLIFSGVGFV RQEVAVGSRT DVLVTLAEDN AVLSEVVVTA LGIKRDKRSL GYSIQELNGQ
DIATAKEANV ATSLAGKMAG VQVTRSANGA GGSSRVIIRG ANSLVGNSQP LYVIDGIPMD
NQNPRAPGSS GGIDYGDGIS NINSEDIETI SVLKGPNAAA LYGQRGSNGV VLITTKSGKN
RKGIGVKYGI DYSLGDALVL PDFQDEYGQG LDGTFTNFRG NDGKIYTWAA AQAAGIQGVP
KMSGGRDRFT RSSWGPRMEG QPYEDQWGNL LNLTPQPNTF QKFFNTEKQM VNNLSLEGGN
DAVNYRVAYS NTNINGYVPT NTLNRNNISL RTVAKVTSKL EADVKVNYIA QQGVNRPTVS
DAADNPAYIF ISQPRSMPMD ILANSAWTAA DISKQLGYGT TPFVGLEKTY ATNSSTANPY
WTMSRTRNSD ERQRIIGLVR LSYQFNDWIR LTARTGTDFY TDQRFRYRDK GTYVTANKNG
DITEEVTRTR EDNSDVLLSL TPKVSDDISF SFNLGANHQR YYSRTTGNTG NEFIAPNLFI
INNTLTNSYV FGLTESSINS VYGSGSVGYK EMAFIDFSAR NDWSSTLSPK NNSFFYPAIS
GSLILTDALR LQSPTLSFVK VRASWAQAGS SGSPYQLNGN YSLDQYTQGG IPLASFASTI
PDPNLKNELT TSNEFGLEAR LFKNRVGVTV AYYNASTRNQ ILNVPLPPSS TFTSRLINAG
EIRNHGIELS VNATPVKLAS GFSWDATLNY SHNRNEVVSL AEGVSTYILG SDRGVQVIAT
PGKPFGTILG NGFQWLRDGS GNRIIDPATG LPVKTNSKIL YEMGNALPKW IGGFNNVFRY
KGLTLSGLID VSQGGKVYSQ SLREELVYGT IKKTLPGRDG SYVAEGVVGS KSADGTWTGT
GQANTKTVRA QDYWNVVAPD KDNVVAEEML NDASYVILRE MTLNYSLPAK LVSHTPFRNI
RAGVYGRNLF YLQRKTEGFA PEASAFNVNN SSLGLESTAL PLLRYVGVSL NVEL