Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_3040 |
Symbol | |
ID | 8726792 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 3690422 |
End bp | 3693745 |
Gene Length | 3324 bp |
Protein Length | 1107 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | |
Product | TonB-dependent receptor plug |
Protein accession | YP_003387850 |
Protein GI | 284037920 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.0955373 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAAAAG GTTTACAAAG GCTTATTATC TTATTGTGGG TAATAAGTAC CCCCGTATTC GCTCAGACGA TATCGGGTAG GGTCACGGCT GGTACCGATG GACAGCCATT ACCGGGTGTT TCCATTCTGG TAAAAGGAAC AACGTCTGGT ACCATAACGG ATACGGATGG GAAGTACAGC CTTGCTGCGG CAAAGAATAA AGTACTTGTA TTTTCCTTTA TCGGTTACAA GAGCAAGGAA GTTGTTATCG ACAACAAAAC GACGGTCGAC GTTACGCTGG ACGAAGATGC ATCCGTGATC AATGAAGTAG TTGTCACCGC CTTAGGCATT CCCAAAGCAG AGCGTGCACT GGGTTATGCT ACGGCGGTTG TCAAAAATGA TGCGCTGATC AAAACTGCGA CACCAAACTT TGCCACGGCG CTTTATGGTA AAGCTCCTGG TGTAACAATC AATGCAACAC CGGGTGGAGC CACTAGTGGC GTAAGCATCA GCATTCGCGG GTTAAGTTCG ATAACCGGCA ACACACAGCC GCTCATCGTG ATGGATGGTA TCCCTATCCG GAATGGCGAA GCCCGCAATA CTGACTACTG GGGCGACCAG CGGATTCGGG GTAACGGCCT TCTGGACCTC AACCCTGCCG ACATCGAAAA CATCTCGATT CTGAAAGGGG CATCTGCGGC AGCTTTGTAC GGTTCGGAAG CTGTAAACGG GGTTGTACTG GTGACTACTA AAACCGGAAA AGGCCGCAAA GGGCTGGGCG TTGATTTCAG CGCGAGCTAC AGTGCCGATA AAATTGCTTA CCTGCCACGT TACCAGAATG TTAGAGGCCC TGGCTATTTC CAGAACTACG CCAATGGGGG GCAGGATGCC AATGGTTTCA TTTCATACGA TACCGATGGA GATGGCAAAG GGGATACCCG CGGTCTGTTG GGTGCTACGG TTAACTTCGG CCCCAAGTTC GATGGACAAC CGGTTATGGC CTTTGATGGC GTCATTCGGC CTTATGTGGC ATCCAATAAC AGTTATGCCA ACTTGTTCCA GAACGCAAAT AGCGCGAACA TCAACCTGGC TGTTTCCAAA GCAACGGATA ATTCGACGAT TCGCTTTTCG TACACGCGGC AGGATAATGG CATGATTAGC TATGCGGCAA AAAACGAAAA GAATATTATG AACCTGAATG CCAGCTTTAG TCTCAACAAA AAGCTGACAA CGGATCTGAT GGTCAACTAC GTAAACCAAT ACACGCACAA CCGGCCCTTT AAAGTGGATC GTATGATCAA CAACTTCTCG GGGATGATGA ACCGGTTCGA ATCAGCCGAT TGGTACTTCA ATAAATACCA GACCAGCCAG GGCTATAAGT ATGTAACCGG TACCAACCAA AGCCTGACTC CCAAAGAGAA CATCATTCGT AACGGATTCA AAGGCGATAT TGGTGATTAC GTTTGGAGCA CCCGTGCCAA CACCTATGAT GAATACAGCA ACCGGGTTAT TGCCAGCATC ACGCAGCATT GGCAAATTCT GGACAACCTG AAGCTCCGAG GCCGGATTGG TACTGACCTT ACATCTGAGC GGCTCGAAGA CAAGCAGCGG AGTTCTATTC CTCTGGCCTT TGGTTACTCG GGTTACTTTG CCATGAACAA CAACCTGTAC AGTAATGTCT ATGGTGATGT GTTGCTTACC TATACCAAAA AGCTGAATCC GGACGTAACC GTGATGGCAT CGGGAGGCTA TACCGCCAAC AAGATGCTGA ATACGTACGT GGGCCGGTCA ACTAACGGAG GACTGAGCAC CGAGAATTTC TTCGATATCT CCGCGTCGGT GAATACACCG AACGGCAGCA ACAGCCGCGA CAAGTCTATC CGGGATGCAT TCCTGGGTAC GGTGAACTTT GATTACAAAA ATTTCTTCTT TATCGAAGGT ACCTTACGTC GTGACCGCAC GTCTACACTC GCACCGGGTA ACAACGCCTT CGTGTATCCG TCTTTGAACT CCAGCCTTGT ATTCAGTGAT CTGTTCCGGT TACCGGCGGT TATCGACTAC GCCAAGCTGA GAGGTTCGTG GGGTATTGTG GGTAACTACC CAACCATCTA TAGCGCCAAT AATGCCTATA ACCAGGGTAA CCTGAGCATC CAGCAAACCG GTGGCAGCTC GGTATTGTAT ACCAACATCA GCAGCGACTA TGGCAACGAC AAGATCCGTC CCGAGCAGAA ACATGAGTTT GAGTTCGGCC TGGAAGCCAA GCTGTTCAAG AACCGGCTGG GTGTAGACCT GTCGTATTAC AACGCCCAGA TTGTTGACCA GATTCTGCCG TTAACGATTG CCGCTACATC CGGTGCCAAG TCGATCCTGG CCAACATCGG TACATTGAGA AACCAGGGTG TCGAACTGGC CCTTAACTTT TCCGCCCTAA AAAGCGCGGA CCCTAACGGT CTGAACTGGG ACGTTACGTT GAATCTGGCT AAAAACAGCA ACAAAGTAGA GAAGTTGACC AACAACTCAA CCGAGCTGCT GCACGCCGAT TATGATGGCA ATGCCGCTCA GCTTCGTTCG GTGGTTGGCC AGCCAATGGG CGATATTTAT GTGCATGGTA TTCTCAAAAA TGCCGATGGA CGCAATGTCG TTGGGCCGAA TGGTATCTAC CAACTCGATG GTGCCAACTG GATAAAGGCT GGTAACGCCA TGCCAAAACT GACGGGTGGC TTGCTGAACA ACATAGGCTA CAAAGGTTTC AATCTGGATG TGGTTGTTGA CTTCCGGTAT GGTGGCTCTA TTATGCCAAC GGGTATCAAC TGGTTGACAT CGCGCGGGCT GACTGAGGAG AGCCTCACTG CTATGGACGC CGAGCACGGC GGGTTGCGTT ACTACAAAGA TGCCAACGGT AAAGGCATTG CAACTACAGG TTCTGCCGGG CCAAACGGTG AAGTGGTGTA TAACGACGGT ATGTTAATGG ATGGCGTACT GCCAACCGGC GAAGCTAATA CCAACATTAT CTCTCAGGCT GTGTATTACA ATAACACCTA CAACTGGGGT GGACCGCAGT ACAGCAGCTC GCGTTATGAG CTGTACGTAA AGGAAAATAC GTACATAAAA ATGAGAGAGA TCTCGCTGGG CTATCGGATT CCGGCCAGTA TTACCCGTAA GATTGGTACC CAGAACCTGA CCCTGTCGGT ATTTGGTCGT AACCTGTTCT TCATCTACAG AACTATTAAG GATCTGGACG CCGAACAAAC CAATTCGAGT ACACGCTGGG CCGAAAACAT CAATAACGCT GGTAACAACC CGTCGTTCCG CACCATGGGG GTAATGCTAC GCGCCAGCTT CTAA
|
Protein sequence | MVKGLQRLII LLWVISTPVF AQTISGRVTA GTDGQPLPGV SILVKGTTSG TITDTDGKYS LAAAKNKVLV FSFIGYKSKE VVIDNKTTVD VTLDEDASVI NEVVVTALGI PKAERALGYA TAVVKNDALI KTATPNFATA LYGKAPGVTI NATPGGATSG VSISIRGLSS ITGNTQPLIV MDGIPIRNGE ARNTDYWGDQ RIRGNGLLDL NPADIENISI LKGASAAALY GSEAVNGVVL VTTKTGKGRK GLGVDFSASY SADKIAYLPR YQNVRGPGYF QNYANGGQDA NGFISYDTDG DGKGDTRGLL GATVNFGPKF DGQPVMAFDG VIRPYVASNN SYANLFQNAN SANINLAVSK ATDNSTIRFS YTRQDNGMIS YAAKNEKNIM NLNASFSLNK KLTTDLMVNY VNQYTHNRPF KVDRMINNFS GMMNRFESAD WYFNKYQTSQ GYKYVTGTNQ SLTPKENIIR NGFKGDIGDY VWSTRANTYD EYSNRVIASI TQHWQILDNL KLRGRIGTDL TSERLEDKQR SSIPLAFGYS GYFAMNNNLY SNVYGDVLLT YTKKLNPDVT VMASGGYTAN KMLNTYVGRS TNGGLSTENF FDISASVNTP NGSNSRDKSI RDAFLGTVNF DYKNFFFIEG TLRRDRTSTL APGNNAFVYP SLNSSLVFSD LFRLPAVIDY AKLRGSWGIV GNYPTIYSAN NAYNQGNLSI QQTGGSSVLY TNISSDYGND KIRPEQKHEF EFGLEAKLFK NRLGVDLSYY NAQIVDQILP LTIAATSGAK SILANIGTLR NQGVELALNF SALKSADPNG LNWDVTLNLA KNSNKVEKLT NNSTELLHAD YDGNAAQLRS VVGQPMGDIY VHGILKNADG RNVVGPNGIY QLDGANWIKA GNAMPKLTGG LLNNIGYKGF NLDVVVDFRY GGSIMPTGIN WLTSRGLTEE SLTAMDAEHG GLRYYKDANG KGIATTGSAG PNGEVVYNDG MLMDGVLPTG EANTNIISQA VYYNNTYNWG GPQYSSSRYE LYVKENTYIK MREISLGYRI PASITRKIGT QNLTLSVFGR NLFFIYRTIK DLDAEQTNSS TRWAENINNA GNNPSFRTMG VMLRASF
|
| |