Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_3978 |
Symbol | |
ID | 8727736 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 4778305 |
End bp | 4781484 |
Gene Length | 3180 bp |
Protein Length | 1059 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | |
Product | TonB-dependent receptor plug |
Protein accession | YP_003388767 |
Protein GI | 284038837 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000935862 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGAAAT ACTTACTTAG TTGTTTTATG CTGGTTATGA TCGCGACGGG ACCGCTCTGG GCGCAAACCC GACAATTGAC GGGTGTGCTA CGCGACGAGC AGGGCCAGAC CATATCCGGC GCAAACGTGG TCGTTAAAGG GACTACGCGC GGTACTACAA CGGATGCCGC TGGTGAATTT CGCCTGTCAA TCCCTGCCGA AAACACGATC CTGACAATTT CATCAGTCGG GTATACGGCA AAGGACGTAC CTGTATCATC GAGTCAGACA CAACTCAGCG TAACGCTGGC GACCGATGAC CGGCAACTGG GCGAAGTGGT TGTTACCGCT CTGGGTATCA AGCGCGAAGC GAAAGCTCTC AGCTACGCTA CCCAGATGAT TAAACCCGCC CAGATCAACG AAGTACGCGA CGGTAACGTG CTGAACACCT TACAGGGCAA AATTGCCGGT GCCTACATAA CCCAGGGTTC CGGCGGACCG GGCACGGGAT CACGAATTGT GCTGCGGGGA AACCGCTCCA TTCAGGGAAC GAATAATGCC CTGATGGTGG TCGACGGCGT TCCAATCAAC AACAGCACCT TTGGGCAGGC CACCAGCGAC TTTGGCAGTG TGGCCAACTC AGATGGAGCC TCGAACATCA ACCCCGACGA CATCGAGAAT GTGACCGTAT TACGGGGTGC GTCGGCGGCT GCCCTCTACG GCAGTCAGGC TGCCAACGGG GTAATCCTGA TCACAACGAA ACGCGGAAAG TCGGGCCGGG TATCGGTCGA TATAAACTCG GGCGTTTCAA TCGATAAACC CTTTGCGCTG CCAATGGTAC AAAACCAGTT TGGACAGGGC GTTGGCGGAA AGCTGGACCC TGCCGTTGGG GCCAGCTGGG GCGCCCCCAT GACGGGACAG TCGTACACAA ACTACCTCGG CAATCCTGAT ACGTACTCAG CACAGCCCAA CAACATTCGT GATTTTTTCC GCACGGCGGT CAGTTTAAAT AACTCCATTG GCATTACGGG AGGCTCAGAG CGGTCGCAGA CGTATCTGTC GTACACCAAT AACTCATTGC AGGGAACAGT GCCGGGCAAT GACCTGACCC GTCACACCAT CAACCTGCGG TTGTCGAACC AGATCAGCTC GAAGCTATCG ACCGATGCCA AGGTAACGTA CATCAATCAG TCTGTAGTGA ACAAGCCCCG GACGGGTGAG GAAAACGCGC CGGTCATTGA CCTCTACCAG ATTCCCCGTA ACGTAAGCCT GACCACGGCA CAAAACTACG CAGCGCCCAA CTCGTTCGGT CTGCCTACGC CAACGGCCTG GCCGTCGACG CTGTCGTCGA TCTACCAGAA TCCCTACTGG ATGACCAATC AGACGGCCAT TAACCAGTAC CGGGACCGCA TCATCGGCTT CGTGCTGGCG AAGTACCAGT TAACTGATTT CCTGAGCATT CAGGGCCGGG CCAACCTCGA TAAGTATTTC GACAAAAATG AAGAAAACTA TAGCCAGGGC ACAATTCTAT GGGCCAACCA GGCGGGTGGT AAATTCTCCC GAAACAACAT CGTAAATACC CAAAGCTGGT ATGACCTGTT GATTGAAGGA CGGAATAAAA TCGGGACTGA CCTTACGCTC GACTATCAGG CGGGGGCCAT TATTCAGGAT ACCCGCTACC AATCGACCAA CTCCCTGGCC GACGGTCTCA ATGTACCGAA CCGGTTTAAC CTGAACTTTG GTACGAACCA GACACTGGGC GATGATTTCT CGCGGATTCA GACCCAATCG CTGTTCGGGC AGGCATCGCT GGCATGGCGG GACGCTATTT TTATCAATGC CAGTTTGCGT AATGACTGGT CATCGACCTT GCCAAAGCCT TATTCGTTCC AGTATCCATC CGTCGGCGCA TCGGTGGTTT TGTCTGATCT GCTGAAACTT TCGGGGCCGC TGTCATTCCT GAAAATTAAT GGATCGTTCG CGCAGGTGGG TAACGGAGCC GATCCGTATC TGTTGCAAAC CAATTACTCG TACAGCCAGG GTGCCGGTTC CGGATTCATT AGCCGGGATG GGACACAGGC CATTGGTAAC CTGAAGCCAG AGATCACCAA AAGTCTGGAA CTTGGCGTCG ACGCCCGTTT TCTCAGCAAC CGTATTGGTG CAACGATTAC GGCCTACAAA ACCAATTCGA TCAACCAGTT ATTGAAACTG GGACTGGCAC CCGCTTCGGG ATTCAGTGAC CAGTACATCA ACGCGGGCGA TATCCGCAAC ATGGGTCTTG AGGTAGTAAT CAATGGAACA GCGATCAAAA CCGACCGGTT GACCTGGGAT CTGACGCTGA ACATGGGCCT GAACCGGAAT AAAATCGTTA GCCTGTCGCC CGATATTAAA ACGGCGTTCC TGTCGGGCGG TTATGGCCGG TCAGCATCGC CGATTGTACA GGAAGGAGGC TCTTACGGTG ATATCGTATC GTACCGCTGG GCGAAAAATG CCAACGGCCA ATACCTGATT GGTTCGCAAA CACCCAGCGG AACGGTGTCC GAAGCATCGG TTGTATCGAC TGGCTTGCCC GTGGCTACCA AAGAGCAGGA ATACATCGGC AACTTCAACC CCAAAATGCT CCTGGGATTC ACCAACACAT TTACGTTCAA AGGCTTTTCG CTCCGCTTTC TGGTCGACGC CCGTTTAGGT GGCATAGCCG TATCGGGTAC TGAAATGAAC CTGGCGTTCA GCGGCATTCC GGAAGTAACG GCGCTGAATC GGGGTGGTGG CTGGGTATTG CCGGGCGTTA CGGCTGGCGT TGCCGGAGCC GATGGAACAA CCTTGATCGG AGCCGGCAAG ACGAACGCAC AGGCCATTAC GGCCGAACAG TTCTGGCAAA CGGTATCGGG TAAACGCTAC GGCTGGGGTG AGTTCTTCGC GTACGATGCG ACCAACGTGC GCCTTCGGGA AATTTCGATC GGTTACGGCA TTCCGGTACC GTCGAATTTC TTTATCAAGT CGGCTCGCCT GTCGTTCGTA GCCCGCAACC TGTTCTGGAT TTACCGGGGT AGTTCGCTGC TGGACATTCC CGGTATTGGC AAGCGGAAGA TGTGGTTCGA CCCCGATGTA AATATCGGCA ACGGCAACTT CCAGGGCGTC GAATACGGAA CCCTCCCATC AAACCGGAGC CTTGGCCTGA ACCTGAAACT TTCTTTTTAA
|
Protein sequence | MLKYLLSCFM LVMIATGPLW AQTRQLTGVL RDEQGQTISG ANVVVKGTTR GTTTDAAGEF RLSIPAENTI LTISSVGYTA KDVPVSSSQT QLSVTLATDD RQLGEVVVTA LGIKREAKAL SYATQMIKPA QINEVRDGNV LNTLQGKIAG AYITQGSGGP GTGSRIVLRG NRSIQGTNNA LMVVDGVPIN NSTFGQATSD FGSVANSDGA SNINPDDIEN VTVLRGASAA ALYGSQAANG VILITTKRGK SGRVSVDINS GVSIDKPFAL PMVQNQFGQG VGGKLDPAVG ASWGAPMTGQ SYTNYLGNPD TYSAQPNNIR DFFRTAVSLN NSIGITGGSE RSQTYLSYTN NSLQGTVPGN DLTRHTINLR LSNQISSKLS TDAKVTYINQ SVVNKPRTGE ENAPVIDLYQ IPRNVSLTTA QNYAAPNSFG LPTPTAWPST LSSIYQNPYW MTNQTAINQY RDRIIGFVLA KYQLTDFLSI QGRANLDKYF DKNEENYSQG TILWANQAGG KFSRNNIVNT QSWYDLLIEG RNKIGTDLTL DYQAGAIIQD TRYQSTNSLA DGLNVPNRFN LNFGTNQTLG DDFSRIQTQS LFGQASLAWR DAIFINASLR NDWSSTLPKP YSFQYPSVGA SVVLSDLLKL SGPLSFLKIN GSFAQVGNGA DPYLLQTNYS YSQGAGSGFI SRDGTQAIGN LKPEITKSLE LGVDARFLSN RIGATITAYK TNSINQLLKL GLAPASGFSD QYINAGDIRN MGLEVVINGT AIKTDRLTWD LTLNMGLNRN KIVSLSPDIK TAFLSGGYGR SASPIVQEGG SYGDIVSYRW AKNANGQYLI GSQTPSGTVS EASVVSTGLP VATKEQEYIG NFNPKMLLGF TNTFTFKGFS LRFLVDARLG GIAVSGTEMN LAFSGIPEVT ALNRGGGWVL PGVTAGVAGA DGTTLIGAGK TNAQAITAEQ FWQTVSGKRY GWGEFFAYDA TNVRLREISI GYGIPVPSNF FIKSARLSFV ARNLFWIYRG SSLLDIPGIG KRKMWFDPDV NIGNGNFQGV EYGTLPSNRS LGLNLKLSF
|
| |