Gene Slin_5655 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5655 
Symbol 
ID8729429 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6887603 
End bp6891097 
Gene Length3495 bp 
Protein Length1164 aa 
Translation table11 
GC content54% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003390419 
Protein GI284040489 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.268168 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.858728 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCAAC CTCTTACTGC CATGACTAAA CAATTAATTG CGCTTGGTTC GTTGCTACTG 
ATTGGAATAC TACCGACTCG GGCACAGCTC GTGGCTTCCA GCAATCGGTA TGCCTCCTAT
CAGAAGCAGC CAACAGATAC CCGCTTGATT ACGTTGAAAA GTGCGTTAAG TGAGCTTGAA
CAGCATTACA GCGTTTCGTT TATTTATCCA ACCAACCTGG TCGACACCAA AGTCTCGATG
GTCAACCAGC GTAATCGGAA TCTCGAAACG GAACTCGCCA GTTTACTCAA CAGCACCGGC
TTAACGTACC GAAAAGTCCA GCCCCGCTTT TATGCCATTG TCCCGGAAAA GGAAAAGAGC
ACACGGCTTT TTCGTAAAAT CGAACAGATC GAAACCAAAA CTACTACCGA TCCATCGACA
GAAGCCGTAA GGTTACCCGA ACGCACCATC GACAAACTGG AGCGTATTGG ATGGTCTACT
ACATCGGCCC AGCCCTTAGC CGACATTACG GGTAAAGTAC TGGACAAAAA CGGGCAGGGA
ATTCCGGGGG TTAGTGTTGT TATTAAAGGC ACCACACGAG GCACTACGAC CAATACAGAC
GGGGAGTACA ACCTCAATGC CCCGAATAAC GCGGTGCTGG TATTCAGTTT TGTCGGTTAT
GCCTCGCAGG AAGTTGCCGT AGCCAACCGA AGCCGCATTA ACATTACCCT GCTGGACGAT
GTAAAATCCA TCAGCGAAGT GGTTGTTGTC GGTTACGGTA CCCAGAAACG GGCGTCGGTA
ACGGGTGCTA TCTCGTCGGT ATCCGCCCAG GAGGTAACAC AACTGCCCGT ACCGAGCGTG
GAGCAGGCCA TTCAGGGACG CGTGCCGGGC GTAACGGTCG TGAGCAACGG CTCCCCCGGC
GAAACACCCA TTGTGCGTAT TCGGGGCATT GGTTCCATCA ACTACGCGGC TAACCCGCTG
TACGTTATCG ACGGTTTTCC GACGAGTGAC CTAAACAACT TCGACACCCG CGACATTGAG
TCGGTCGATG TATTGAAAGA TGCCTCCTCA GCCGCCATCT ATGGCTCACG GGCGGCCAAT
GGCGTTATCA TTGTCACGAC CAAAAAAGGT AGTGGAGATG GTAAACTGCA CGTCAACTAC
GACGGCTACG TAGGCACACA AAGTGCCTGG CGTCAACTCG ATCTACTCAA CCGGGACGAG
TACATCCAGT ATGGCACTGC CCTCCGGACC AATGCCGGTC AGCCGGTGCC AACGCGGTTC
AGCAACCTGA ATCAGCCCAT TTATACTGGG GCTTCTCAAA CGTACGCTCA AACGGATACC
GACTGGCAGA AGGCCGTATT CCGGGATGCG CCCATTACGC AGCATAGCGT TCAGTTATCG
GGCGGTAACG AAAAGTCGCG TTTCTATTCA TCTGTCGGCT ATTTCAATCA ACAGGGTATC
ATGATTGGCA CGGGGTACAA GCGCGGCAAC TTCCGCATCA ACTCCGACCA TGCCATTAGC
AAGCGCTTCT CATTCGGGCA AACGCTGACC ATCTCCTACG ACGATAAGCT TAACGAAGTA
AGCGCCGGAG GCCGGACACA GGTTCAGAAC ATGATCCGCA TGACACCTTA CATGCCCGTT
GAAGACCCTA CCCTGCTCGG CGGTTACCGT GGTCCCGACG GCTCCGACGG CTCTGACCCC
CAAAACCCGG TACGGGCCGC GCTACAGGAC CAGAGCAACA CCCAGCGCAT GAAAATCCTG
GGAAGTGCTT ATGTCGATGT CAAGATCATT GACGGGCTGA CCTACCGCCT GCGCGGAGGT
ATCGACTACG TAACGGCGCG TACCTTTTCC TTCCTGCCAA TCTACAGTGA GAGTTTCAAC
GCCCGGGCGC TGGCCAACAT TTCCGACGAC CGGCTGACGT ACGCTTCTCC CCTGATCTCC
AATCAGTTGA CCTACGAAAA AACATTTGGC AAACACAGCT TCAATGTCGT TGCCGTAGCC
GAACGGCAGG CGGGCAACAG CCTTCAGATA GTTGGAACAG GCCAGGCCGC TTCAAATACG
ATTCGCGAGT TGAGCGCCGT TATTAGTACG AGTGCGGGTT TAAACGGCAC TCGCTCTCAA
AACGTGCTGC TGTCGTATCT GGGTCGCCTT AATTACGAGT ACGCGGGTAA ATACCTTCTG
GGTGCCTCGT TCCGGCGGGA TGGCTCGTCG CGGTTTGCAC CGGGTAATAA GTGGGGTAAT
TTCCCATCGG TGTCAGCGGG CTGGCGGATT AGTGAAGAAG CTTTCCTGAA AAATGTACCG
GCTATTTCGG AATTGAAGGT TCGTGCCAGC TATGGTACGA TGGGCTTCAA TGGTATCGGC
GATTACTCGT GGCAGGTGGC CGTCTCTCAG AACACGAACG CCATCATCGG GGGCGACCGC
ACGCAGGGAA CCTACTTCGA CCGTCTGGGC AACACCGACC TGCGGTGGGA AGTTACGAAG
ATGAGTAACG TCGGGGTTGA TCTGGGCTTG TTCAGCAACA GCATAACCCT CTCGGCCGAA
GTTTATCAGC GAAATACGGA CGGCCTGATT CTGAACCAGC CCATTGCGCC ATCCATCGGT
TATTCGCAAT CGCCCATTGT CAACGTGGGC AGCATGCGGA ATACGGGGGT CGAAATGCAA
CTTGGCTATA ACAAAACGAA GGGCGCTCTT CGATTCAACG CATCGGGCAA CATCAGCTTT
ATTAACAATA AAGTGCTGAG CCTCGGGCCC ACCGTGTCGC CCCTGCTCAA TGGTGCCAAC
GCCGATTACG GCGGATTCGA CATTACCCGT ACCGAAGCCG GTCAGTCAAT TCAGTACTTC
TACGGCTGGA AAGTAGCGGG TATCTTCCAG AATGCCGACG AAATCAAATC AGCTCCAACG
CAGGCCAATG CCGCGCCCGG CGACCTTCGC TTTGTCGATG CAAACGGCGA TGGCAAGATC
GACGCCAGCG ACCGGGTAAA CCTGGGCAGC TTCCTGCCAA AATTCACTTA TGGACTGAAC
CTGTCGGCCA ATTACCGCGG CTTCGATCTA TCGATGTTCT TTCAGGGTGT TCAGGGCAAC
AAAATTTACA ACGGCGTAAA AGTTCTTGAG CAGGGTATGC TCCGGTTGTT CAATGCCGGA
ACGGATGTGC TGCGGGCCTG GACACCAACC AATACGGTGA CCGACGTACC CCGTGCCGTT
GATGGCGACC CCAATGGTAA CTCGCGTACG TCAGACCGGT TCATTGAGGA TGGCTCCTAC
TTACGGCTGA AAAACCTGAG CATCGGCTAT TCAGTACCAG CGGGTGCGTT ACAGGCATTT
TCGCGCGGTA CCCTGAGCCG GGCGCGGGTC TATGTAGCAT CGACCAATCT GCTGACGTTT
ACGAAATACA CTGGCTACGA CCCTGAAGTG GGCTCTCGCA CCAGCAATAC CAACCCAACG
CTGACCAATG GTATCGACTA TGGACAGTTT CCGCAGGCAC GCACCTTCAT GGTAGGTCTT
CAACTTGGTT TTTAA
 
Protein sequence
MTQPLTAMTK QLIALGSLLL IGILPTRAQL VASSNRYASY QKQPTDTRLI TLKSALSELE 
QHYSVSFIYP TNLVDTKVSM VNQRNRNLET ELASLLNSTG LTYRKVQPRF YAIVPEKEKS
TRLFRKIEQI ETKTTTDPST EAVRLPERTI DKLERIGWST TSAQPLADIT GKVLDKNGQG
IPGVSVVIKG TTRGTTTNTD GEYNLNAPNN AVLVFSFVGY ASQEVAVANR SRINITLLDD
VKSISEVVVV GYGTQKRASV TGAISSVSAQ EVTQLPVPSV EQAIQGRVPG VTVVSNGSPG
ETPIVRIRGI GSINYAANPL YVIDGFPTSD LNNFDTRDIE SVDVLKDASS AAIYGSRAAN
GVIIVTTKKG SGDGKLHVNY DGYVGTQSAW RQLDLLNRDE YIQYGTALRT NAGQPVPTRF
SNLNQPIYTG ASQTYAQTDT DWQKAVFRDA PITQHSVQLS GGNEKSRFYS SVGYFNQQGI
MIGTGYKRGN FRINSDHAIS KRFSFGQTLT ISYDDKLNEV SAGGRTQVQN MIRMTPYMPV
EDPTLLGGYR GPDGSDGSDP QNPVRAALQD QSNTQRMKIL GSAYVDVKII DGLTYRLRGG
IDYVTARTFS FLPIYSESFN ARALANISDD RLTYASPLIS NQLTYEKTFG KHSFNVVAVA
ERQAGNSLQI VGTGQAASNT IRELSAVIST SAGLNGTRSQ NVLLSYLGRL NYEYAGKYLL
GASFRRDGSS RFAPGNKWGN FPSVSAGWRI SEEAFLKNVP AISELKVRAS YGTMGFNGIG
DYSWQVAVSQ NTNAIIGGDR TQGTYFDRLG NTDLRWEVTK MSNVGVDLGL FSNSITLSAE
VYQRNTDGLI LNQPIAPSIG YSQSPIVNVG SMRNTGVEMQ LGYNKTKGAL RFNASGNISF
INNKVLSLGP TVSPLLNGAN ADYGGFDITR TEAGQSIQYF YGWKVAGIFQ NADEIKSAPT
QANAAPGDLR FVDANGDGKI DASDRVNLGS FLPKFTYGLN LSANYRGFDL SMFFQGVQGN
KIYNGVKVLE QGMLRLFNAG TDVLRAWTPT NTVTDVPRAV DGDPNGNSRT SDRFIEDGSY
LRLKNLSIGY SVPAGALQAF SRGTLSRARV YVASTNLLTF TKYTGYDPEV GSRTSNTNPT
LTNGIDYGQF PQARTFMVGL QLGF