Gene Slin_6636 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_6636 
Symbol 
ID8730422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp8069267 
End bp8072245 
Gene Length2979 bp 
Protein Length992 aa 
Translation table11 
GC content56% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003391392 
Protein GI284041462 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.300178 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAGCA CGTTTATGTA TGTACAGACG CTGGTCTGTC CACCCGCCCG CTGGCTGCTG 
GCACTAGGGC TCTTTCTACT TAGTATAACC ATCCATGCCC AGACCACGGG CAGTCTCAGC
GGTTCCGTTA TCGATTCGGA AAACGGGCAG GGCCTGCCGG GTGTCAACGT AGTGGCCAAG
GGGTCATCGC GGGGCACTAC CACCGACGGA AACGGTCGCT ATCAGCTGAC CGGCATTCAG
CCCGGCACCG TTCTGGTATT CAGCTCCGTT GGGTACACCT CGCAGGAAGT AACCGTAGGA
AGTCAAACGA CCATCGACCT GCGCATGGTC AGCGATAACA AGAGCCTGAA CGAAGTGGTG
GTACTGGGCT ACACATCGAC CCGCCAGAAA ACCCTGACCG GCTCCGCGCA GGCCGTTAGT
GCAAAGGAAC TAAAAGACGT AACGGCCAAC AACGTGGGGC AGTTATTACA AGGGAAAGCT
GCCGGTGTAT TCGTGGGCAA TAGCTCCGGC GACCCGCGTG TGCCGCCCAA AGTGCTGGTT
CGGGGGATTG GTACCCTTAC GGCCAGTTCC AACCCGCTTT ACGTGGTCGA TGGCGTAATC
GGTGGCATTC CGAACCCCAG CGACATCGAA TCCATCACCG TGCTCAAAGA TGCGGCTTCT
ACCGCTTTGT ATGGCGCCCG GGCCTCCAAC GGGGTCATTG TTGTGACCAC CCGCCGGGGC
ACATCGGGCA AAACCCAGGT TACGGCCCGG CTGAACAAAG GCGTTGGTTA CCTGAGCCTG
GGCAATTTCA GGCTGCTTAA TGGGCAGGAG TTATATAGCC TTCAGAAAGG GGTCTACCAG
CGCGACCAGC CCAACGGCAA CGTCAATGAT TACTTGCCCA CCCCCGAAGC CAACGCCAAT
ACGAACTGGT TCGACATTGC CTTTCGACCG GCTACCAACA CCCTGGCGGA GCTTAGTGCA
TCGGGCGGCA GCGACAAAAC CCGGTTTTTC CTGAGCGGCA ACTATTATCA GGAAGAAGGT
ATTCTGAAAG GAACGGGCCT GAACCGGTTT GGCTTACGGC TTAATTTCAG CCATAACCTC
AGCGAGAAGT TCCGGGTGAG CCTGAATAGT GCCGGCACCT ATACCCGTGG GTACGACAAC
AGCAATGGCT CGCTGTACGG CGCCTACACC TACCTGCCCT ACGACGACCC GTACTACAAC
GGACAGCCGT ACAACCCGAT CACGGGTGCT AAAAAATGGT ACGGGCGCGA CAATGCCAAC
TTCGTGTACA ATCAGCCATT CAACACGTTC AAAGAAAATA CCTTTGCGGG AGATGTGCTC
TTTAAAGCCG AATACAACCT GCTGCCCTGG CTTTCGCTCT CGACAACTAA CCGCGCCCAA
ACCAATTATT ACGGCAGCGA AAGCAGTCAG GATTTGCGGG GTAATGCCGC AGCCGACGTG
CTGGGCCGGT TAACGGACTA CACCAGCCGC GACTATAACC TGCTCACCTC CAACCTGCTT
CGGTTCAGGC ACAGCTTCGG CGACGGGCAC AGTCTGGATG GACTGGCCGG TTATGAATAC
CAGACTTATT ATTTCGAATC GCTGGGGGCA ACGGGAAAGG GCATCTTCTC GGGGCTGAAT
ATTCTGGATG CGACCTCCCA GCCCGAAAGC ATCAACGGCA CCAAAACCGA TAATGCCTTT
GCGTCTTACC TGTTCCAGGC CAACTACGGC TATAAGGAAA AGTATCTGGT AACGACCTCG
TTCCGGCGGG ATGGCTCCTC AAAGTTCGGG CGCGACCGTA AATACGGCAA CTTCTACGCC
GTGGGCCTGA GTTGGATTGC CTCCAACGAA GCCTTTCTGA ACAACAACCC AACGCTGAAC
AACCTAAAAT TCCGGCTCAG CTACGGTACC ACAGGCAACG CCGACGGTAT CAATGACTAT
GCCTCGCAGG GATTGTATAA CCTCACGGGT CAGTATGCGG GAGTGCCCTC GGCCTATCCT
ACCCGCATCG AAAACCCCAA CCTGTCGTGG GAAGTGTCTA ACAACACCAA CTTCGGCGTG
GACGCTACCC TCTGGAACCG GCTAAACGTT ACCGTCGACC TCTATAACCG ACTGACGAAC
AACCTGCTGT TCAACCGGCC ATTGCAGGGA ACCAGCGGCT ATGCGTTCAT CACGGAAAAC
ATCGGAGCCG TACGCAATCA GGGACTTGAG GTGGTATTAT CGGCAGATAT CCTGCAAAAA
ACAGCCCTCA AATGGCGCAC CGAAGTGAAC GTAGGCATGA ACCGGAACGA ATTGACGGCC
CTCTATGGCG ACCGCACCTT CGTGGCCAAC GGGCAGCGTC CGTTTGCACT CGACAAGCCG
CTGAACAGTT GGTACATGCG TCGGTGGATG GGTGTCGATG CGCCCACCGG CGACCCCCTC
TGGCAAAAGG TCAATGCCGA TGGCACGACC GCCACGACCA ACAACTACAA TGAAGCCACG
CTTCAGTTCA TCGGCAGCAA CGCCAATCCG AAACTGTTTG GCGGTATTCG GCAGGTGCTC
AACTGGAAGA ACTTCGAGCT GAACGCTTTC TTCACCTATG CGGCTGGCGT GACGCTCTAC
AACGGCGACC GGAACCTATT CGACAACGAC GGGGCCTATG ACCGGTACAA CCTGATGGCC
CTGCAGGACG GCTGGAGTCG CTGGGAAAAA CCGGGCGACA TCGCCACCCA CCCGAAATAC
GTAATCGGCG GCAATAAAAA CGCCCAGCGT CCGTCGTCGC GTTTCCTCGA AAACGGCAAT
TACCTCCGGC TCCGCAACAT TAGCCTGAAC TACGACCTGC CCAAAGCATT GGCCAGCAAG
GCCCACCTCG GCAGCGTCCG GCTGACGGCT TCGGGCGATA ACCTGTTTAC GGTCACGAAA
TTCTCGGGTA TCGACCCGGA CGTGGCAGAA ACGGGTGAAG TGGGGACGAA ATATCCTTTC
AGCAAGAAAT TTGTCTTTGG CGTTCAACTC AGCTTTTAA
 
Protein sequence
MNSTFMYVQT LVCPPARWLL ALGLFLLSIT IHAQTTGSLS GSVIDSENGQ GLPGVNVVAK 
GSSRGTTTDG NGRYQLTGIQ PGTVLVFSSV GYTSQEVTVG SQTTIDLRMV SDNKSLNEVV
VLGYTSTRQK TLTGSAQAVS AKELKDVTAN NVGQLLQGKA AGVFVGNSSG DPRVPPKVLV
RGIGTLTASS NPLYVVDGVI GGIPNPSDIE SITVLKDAAS TALYGARASN GVIVVTTRRG
TSGKTQVTAR LNKGVGYLSL GNFRLLNGQE LYSLQKGVYQ RDQPNGNVND YLPTPEANAN
TNWFDIAFRP ATNTLAELSA SGGSDKTRFF LSGNYYQEEG ILKGTGLNRF GLRLNFSHNL
SEKFRVSLNS AGTYTRGYDN SNGSLYGAYT YLPYDDPYYN GQPYNPITGA KKWYGRDNAN
FVYNQPFNTF KENTFAGDVL FKAEYNLLPW LSLSTTNRAQ TNYYGSESSQ DLRGNAAADV
LGRLTDYTSR DYNLLTSNLL RFRHSFGDGH SLDGLAGYEY QTYYFESLGA TGKGIFSGLN
ILDATSQPES INGTKTDNAF ASYLFQANYG YKEKYLVTTS FRRDGSSKFG RDRKYGNFYA
VGLSWIASNE AFLNNNPTLN NLKFRLSYGT TGNADGINDY ASQGLYNLTG QYAGVPSAYP
TRIENPNLSW EVSNNTNFGV DATLWNRLNV TVDLYNRLTN NLLFNRPLQG TSGYAFITEN
IGAVRNQGLE VVLSADILQK TALKWRTEVN VGMNRNELTA LYGDRTFVAN GQRPFALDKP
LNSWYMRRWM GVDAPTGDPL WQKVNADGTT ATTNNYNEAT LQFIGSNANP KLFGGIRQVL
NWKNFELNAF FTYAAGVTLY NGDRNLFDND GAYDRYNLMA LQDGWSRWEK PGDIATHPKY
VIGGNKNAQR PSSRFLENGN YLRLRNISLN YDLPKALASK AHLGSVRLTA SGDNLFTVTK
FSGIDPDVAE TGEVGTKYPF SKKFVFGVQL SF