Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_1220 |
Symbol | |
ID | 8724953 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 1482890 |
End bp | 1486303 |
Gene Length | 3414 bp |
Protein Length | 1137 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | |
Product | TonB-dependent receptor plug |
Protein accession | YP_003386069 |
Protein GI | 284036139 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTCTTC TACGTCTACC TAAAATCAGT CTGGTCCTGT TGGGGTTGCT GTGTCAGCAA CTGGCTACGG CCCAGGCTAT TGTCTTCGCC CGCCAACAGC GGAAAGCGAA TAACCTGGCG CAAATCGCTC CGGTCGAGAG CCAGAAACTT AAAGAAGTGC TGACGGATAT GAGCCGCCAG TTTCAGGTTA GTATTCTATT TGAGGAAGCA ACCGTTAAGG GAATCACCGT GCCGGTCGAT GCTCGTCCGG GCACTGGAAA GCTGGAAAAG CAACTTCAGT CCCTGCTAAA ACCGTATGGG CTGATGGCCC AGAAGAAGGG TGAACAGGCG TATTATGTCA TCAAAATCCC GGCTAAGGAG AGCGCGACCT CGGTCAGAAT GATCCAATCC GAAAACGGAA TGGTGCCCGC TATTGAGAAT AGCGTATCGG CGCTTCAGCC GTTGGCACCC ACATTGACCG AAAAAGTAAC GGCCGACATT CGGGTCACCG GCCGCGTAAC CAGCGAAAAA GGTGAAGGGT TACCCGGCGT TAACGTCGTC ATTAAAGGGG CCATCCGGGG TACGAACACT GACGCCGACG GACGCTATCA ACTAAATGTG CCCGATGCGA ACACAACGCT GGTATTCAGC TTTGTTGGCT ACGCTACGCA GGAAGCGCTG GTAGGTAACC GGACAACGCT GAACATTCAG CTACAGCCCG ATAACAAATC GCTTAACGAA GTGGTGGTAG TGGGCTACGG TACGCAGTCG CGGAAAAACC TGACCAGTGC AGTGAGCACC ATTAAGCCCG ACGAACTGAA CCGGGGGGCT ATCAGCGACG TAGGCCAGTT GTTGCAGGGT AAAGTGCCGG GCCTGAACAT CTCGGCCAAT GGTGACCCAA ACGCACCAGC CGCCGTAATT CTGCGGGGAG CGTCTACGAT TAACAGTTCG CAGGGACCAT TTTATGTGAT TGACGGGGTG CCGGGCGCTG ATATTTCCAT TATTGCCCCC GACGATATTG CGTCGATTGA TGTGCTGAAA GATGCCGCTG CTACGGCTAT TTACGGTAAC CGGGCTGCCA ACGGTGTTAT CATGGTAACG ACGAAGCGCG GTAAAAAAGG CCAGATGCAG ATTACCTATA GTGGCTATGC CGGTATTGAA AAAGTATCGA GCAAGCTCAA TATGATGAAT GCCAGCCAGC TTCGCGACTT CCTGACGAAG AACGGGCAGT CGTTCTCGCC AAACGATGAT AAAGGAGTGG ATACCGACTG GCAAGCCGCC GTTCAGCGCA GCACGGCCAT TTCGCATAAC CACAATATCT CGATCAGCGG GGGCACTGAA CATAGCACCT ACAGTGCCAG TATCAACTAC CTCGATAAGC AGGGGATTTT GCAGAGTAGC TCGCTGAATC GGGTTATTGC CCGTCTGGCT GTTGAGCAAA TGGCCTTTAA CGACAAACTG AAGTTGGGAC TGAACGTGAC CAACTCCAGT AGCAATGCCA ACAACACGCC CCTGCGCAAC AACGTCCTGA ACCAGATGGT GAATCACCTG CCGGTTTCGC CCGTAACCAA TCCAGACGGT ACCTACTTCG AGAACTTCCA GAACACCGGC TACTTCAACC CGGTGGCGAT GATCAATTAT GCCAAGGACA ATACAAAGTA TAACAACCTG GTTGGTTCTC TGTTTGCCCA GGTGAAACTG CCGTTTGGTC TTTCGTACGA TCTTAACCTG TCGTACCAGA GCAATACGTC GCTGCATGGT GAGTCGTACG CCAGCTATTA CACGCAGTAC AACAGCGCCA ACTTCTATAA CTACCCCGAT CCGCCACTCG TACACAGCCT GCTCAACTTC GGTACCAACG GGTCGGCGCT GCGGAATACT TACCAGACCA CACGTAAGGT GCTGGAAACC TTTTTTACCT GGAACAAGGA ATTTGGCGAC CACTCTGTGA ATGCCGTTCT GGGCTATTCG TGGCAGGGTA ATGTTTCGGG CGATGGTTTC CAGACGTCGA CAACCAACTT CCCCGTCGAT AACATTGGCT ACAACAACTT CGCGCTGAGT AACCCCTATG CAGTTTCGTC GTACCGGATC AACTTTGGCC CTGACGGTAT CTACCAGGAA ACGCGGCTGA TTTCTGATTT TGCCCGGTTG AACTATAATT ACAAAAATAA ATACCTGCTG CAGGGCTCGA TCCGACGCGA CGGTAGCTCG GTGTTTGGTA AAAACAACCA ATGGGGATAT TTCCCGGCAG CGGGTGTCGC GTGGCGCATC GACCAGGAGA AGTTTATGCA AAACCAGAAC CTGTTCAGCG ACCTGAAATT CCGGGCCAGC TACGGAGTAA CGGGTAACTC GTCGGGCTTC AATGCCTACA CGGCGCAGTT TATCTCGGGT AGCCTGGGTA CATATTATTA CAACGGTATC CAGACGGCCG CTTATGGCCC TACGCAAGCC GCTAACCCCG ATCTGCACTG GGAAAAAACG GCAACGGCTA ACATCGGCCT TGATTTTACC ATCCTGAAAG GCAAGTTGAG CGGTACCGTT GAGTGGTATA ATAAAGAAAC CACCGGTATG ATCTATGCCT ACCGGGTCAA TCCGGTGCTT GTACCGGCCG GTAGCATCAT TGCCAACGGT GGCAGCATGA GCAACAAAGG TGTTGAGGTT AGCCTGAACG CGACACCGGT GCAAGCGGGT AAATTCAGCT GGACAACGGG CTTGAACCTG GCGCACAACA GCAACCGGAT CAATAGCTTA ACGAATCCGC TGTTTGTAGG TGGCGACTCT GTCAGGACCA CGCAGCCCGA AGGGGCCGGA CAAACGGGCA GCACCCTGCA AATCCTGAAA GCGGGTATGC CGCTAGGACA GTTCTTCTCG CTTGAGTATG CCGGTAAGAA CGACAAAGGA GTGTCGCAGT ACGTGAGTCG AAACGGTTCC CTCACCACGA CACCGGTTAT TGGTACGGAC TACAAATACC TGGGTAGTCC ACAACCCAAA CTGCTGGTTG GCTGGACTAA CACCCTACGC TACGGAAACG TTGATCTGAA CGTCTTTTTC CGCGGAGTCT TCGGCAACAA AATCTTCAAT GCCACCCGCG CCGATTTGTT CCGGCCAAGT ACAGCCCAGT TTACGAATAT TCTGGTCGAT GCCGCCGATG AAAAAGCAAC CGACGTTAAC TCGTTCAAAT ACTCAAGCCG CTACATAGAA GATGGCAGCT ACGTTCGACT CGACAACGCC ACGCTGGGCT ACACCCTCAA AAATCTGGGT CAGTACATCC GCAACGTGCG TATTTATACA TCGGTCAACA ATGCGTTCGT GATCACCGGT TACAAAGGAA TTGACCCCGA AATCAATCAG GGCGGCCTGG CTCCGGGTAT CGAAGCCTAC AATTTTTATC CAAAGACCCG CACATTCCTC CTGGGTGTAA ACGTGTCATT TTAA
|
Protein sequence | MPLLRLPKIS LVLLGLLCQQ LATAQAIVFA RQQRKANNLA QIAPVESQKL KEVLTDMSRQ FQVSILFEEA TVKGITVPVD ARPGTGKLEK QLQSLLKPYG LMAQKKGEQA YYVIKIPAKE SATSVRMIQS ENGMVPAIEN SVSALQPLAP TLTEKVTADI RVTGRVTSEK GEGLPGVNVV IKGAIRGTNT DADGRYQLNV PDANTTLVFS FVGYATQEAL VGNRTTLNIQ LQPDNKSLNE VVVVGYGTQS RKNLTSAVST IKPDELNRGA ISDVGQLLQG KVPGLNISAN GDPNAPAAVI LRGASTINSS QGPFYVIDGV PGADISIIAP DDIASIDVLK DAAATAIYGN RAANGVIMVT TKRGKKGQMQ ITYSGYAGIE KVSSKLNMMN ASQLRDFLTK NGQSFSPNDD KGVDTDWQAA VQRSTAISHN HNISISGGTE HSTYSASINY LDKQGILQSS SLNRVIARLA VEQMAFNDKL KLGLNVTNSS SNANNTPLRN NVLNQMVNHL PVSPVTNPDG TYFENFQNTG YFNPVAMINY AKDNTKYNNL VGSLFAQVKL PFGLSYDLNL SYQSNTSLHG ESYASYYTQY NSANFYNYPD PPLVHSLLNF GTNGSALRNT YQTTRKVLET FFTWNKEFGD HSVNAVLGYS WQGNVSGDGF QTSTTNFPVD NIGYNNFALS NPYAVSSYRI NFGPDGIYQE TRLISDFARL NYNYKNKYLL QGSIRRDGSS VFGKNNQWGY FPAAGVAWRI DQEKFMQNQN LFSDLKFRAS YGVTGNSSGF NAYTAQFISG SLGTYYYNGI QTAAYGPTQA ANPDLHWEKT ATANIGLDFT ILKGKLSGTV EWYNKETTGM IYAYRVNPVL VPAGSIIANG GSMSNKGVEV SLNATPVQAG KFSWTTGLNL AHNSNRINSL TNPLFVGGDS VRTTQPEGAG QTGSTLQILK AGMPLGQFFS LEYAGKNDKG VSQYVSRNGS LTTTPVIGTD YKYLGSPQPK LLVGWTNTLR YGNVDLNVFF RGVFGNKIFN ATRADLFRPS TAQFTNILVD AADEKATDVN SFKYSSRYIE DGSYVRLDNA TLGYTLKNLG QYIRNVRIYT SVNNAFVITG YKGIDPEINQ GGLAPGIEAY NFYPKTRTFL LGVNVSF
|
| |