Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_0553 |
Symbol | |
ID | 8724281 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 681252 |
End bp | 684362 |
Gene Length | 3111 bp |
Protein Length | 1036 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | |
Product | TonB-dependent receptor plug |
Protein accession | YP_003385416 |
Protein GI | 284035486 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAAACT ATTTACTCAA TAGGCTTCAA CAGTCGATAC CCTACTGTTG GTTAGCTCTG CTGTTGACGA TCGGCATTGC CAACGGGCAG ACAACCACTT ACTCGTTTAG CGGGCGGGTA CTGGATGAGA AAAATACGGG CCTCCCCGGC GCTACGGTCG TCCTGAAAAA CAATAACAAG ACGGGCACTA CCACCGACGC CAACGGTAAG TTCACGATCA GCATGCCCAC GGGCGGTGGC ACACTGGTGG TATCGGCCAT TGGTTACCTG GCCAAAGAAG TTGCCGTTAC GAGCGAAACC ACCATCGACG TACCGATGGC TCCCGACGTA AAGACGCTCA ACGAAGTGGT GGTAGTTGGT TACGGAACTC AAAAGAAGGA AAACCTGACA GGGGCCGTGG CGGCCATTAC CATCGATGAT AAAATATCGA GCAGGTCGCT GTCAAACGTC TCCTCGGCGC TGTCGGGCCT GATTCCGGGT CTGGCCGTTC AGCAGTCGAC GGGGCAGGCG GGGCGCAGCG GGGCCGCGCT AGTGATCCGG GGGCTGGGAA CGGTCAATAA CTCGGGGCCG CTGATTGTCG TAGATGGTAT TCCCGATGTC GACATCAACC GGATCGACAT GAACGACGTT GCCAGTATCT CCGTCCTGAA AGATGCGGCC TCGGCTTCCG TTTACGGGTC AAGGGCAGCT AACGGCGTTG TGTTGATCAC CACCAAAAAC GGCTCGCAGA ACAAGAAACC GGTCATCAGT TACACGGGCA CGTACGGCCT TTCAGAACCG ACCAATTTCT ATAACTACTT CGATGACTAC GCCCGTTCGC TTACCATGCA CCTGCGGGCC TCGGGGGCGG GGGCATCGTC GACCACCTTC CGGTATGGCA CCGTGGAGGA CTGGCTGTCG AAGAGCATGA TCGACCCGAT CAAATACCCC AGCACCAACT GGTGGGATGT GGTGCTGCGC GATAAGGGCC GGATTCAGAC ACATAACCTG TCGGCAGCGG GTGGCAATGA GCGTTCCAAT TTTTACCTGT CGGCGGGGAT ATATGATGAG TTGGGTATCC TGATCAATCA CGATTACAAG CGATACAACA CCCGATTTAA CCTGGACTAC AAACTGAGCG ACCATATTAA AGTCGGCATT CGGATGGATG GACAATGGTC GAAGCAGACC TACGCCAACT CCGAAGGGCT GATTACCTAT ACCGGAACCG GGGGCTACGA CATTCGCTAT GCCGTGGCCG GTATTCTGCC GCAAAACCCG CTTACTGGTC AGTATGGGGG TGCAATGGCC TATGGCGAAG ATGCCCTGGC GTATAATATG CTGGCGGCCA TGAACGTCAA CCATAACCTA CGCGACCGCC AGGAAGCCAA CGGTAATTTA TACGGCGAGT GGACGCCCAT CACGGGCTTG ACCATCCGCG GTGATTATGG CCTGCGGTAT TACAATCAAT TTACAAAAAG CTACGCCGAC CCCTCGGATG TCTTTAACTT CCAGACGAAC CAGATTTCAC GCAATCTCGT ATCCAGCAGC GCTGGTATTA GCAACGCCAT CAATTCGGGC TATAAAACGC TGCTTCAGGG CCGGGTAACG TACAACAAAA CGCTCTTCGG CAACCATCAG CTGAGTCTGT TGGGGGCTTA TACGGAAGAA TACTGGTTCA ACCGAAACCT GTCGGCCAGT CGCCTGGAGC GCATCAACCC GCTCCTGAGT GAAATCGACG CGGCTCTTAC CACGACGCAG GCTGCCGGGG GTAACTCCGA CGCCGAAGGG TTGCGGTCGG GCATCGGTCG ACTCAATTAC GTCGTCAACG ATAAATACCT GTTTGAAGTG AACGCCCGCT ACGATGGGTC CAGCAAGTTT CTGCCGGGAT TTCAGTACGG CTTTTTTCCG TCGGCATCGG CAGGCTGGCG TTTCTCGGAA GAGCCTTTCT TCAAGCGGTT CAGTTCCGTG GTATCGTCGG GTAAAGTCCG GGCTTCCATT GGTAAGCTGG GCAACAACTC GGGCGTGGGG CGGTACGAAC AGCGCGATAT TTTCAACCTG ACCAATTATA TCCTGAACGG GAAAATCACC AAAGGCTTTA GTTCGGCCAA AATTATCAAC GAGGATTTCT CCTGGGAAGA AACCAACGTA ACCAACCTTG GGCTGGACCT GGCTTTCTTC GGCGGTCGCC TGACAACCGA TATTGATTAC TACAACAAGC TGACTTCCGG GATGATCCGC CCGTCGTCGC TGTCGACTTT CCTGACCGGC TACAACGCCC CGCGTGTGAA CATCGGCAAG CTCCGGAATA CGGGTGTAGA AGTCAATGTC ACCTACCGGG CCAAGGTTCG TGATGCTAAC GTTGGAGCAA CGCTCAACAT GGCCTTTAAC CAAAATAAAC TGCTGGAGTG GAATGAGTTT CTGAGCAAAG GGTACACCTA CCTGAACCTG CCTTATCACT TCGCTTACAG CCGCGTGGCC ACGGGCATTG CCCAGAGCTG GGAGGATATT GCCAACGCAC CCTATCAGGG ACAGTACTTT TCGCCGGGCG ATATTCTGTA TAAAGACCTC AATGGCGACG GTCAGGTGAA CGACGAAGAC CGTAAAGCCG AACCCAAATT TAACCGGGAT CAGCCTACGG GTACCTACGG CCTCAATCTG TTTGCCAACT GGCGGGGTTT CGATGTCAGC GTACTCTGGC AGGCTGCCAC CGGCCGGAAA GATTTCTGGC TGGAGCCGTT CAACAACGTC AACATTCCGG CCGCACGCAA CGCCTTTCAG GACTTTTTAT GGAACGATAC CTGGAGCCTC GACAACCGGC TGGCGTCGCT GCCCCGGCTT GTTACGGGTT CGGGTGGTAA CAACCAGGCC GAATCGACCT TCTGGCTCGA CAACTTTGGT TACCTGCGGC TTAAGAATAT TCAGTTGGGC TACAACATTC CCACCAAATA TATCAGCCGG TTGGGATTGA GTAAGGTCCG TATCTACGGA ACATCCGAAA ACCTGCTGAC CTTCACAAAA TACCGCGGTG TCGACCCGGA AAAGAGCACG AGTGTGTCGG GAGCCGATAA TAATGACGAC CCGTTTCCGC TGCTCAAATC CTATTCGTTC GGTCTTAACC TCAGCTTTTA A
|
Protein sequence | MPNYLLNRLQ QSIPYCWLAL LLTIGIANGQ TTTYSFSGRV LDEKNTGLPG ATVVLKNNNK TGTTTDANGK FTISMPTGGG TLVVSAIGYL AKEVAVTSET TIDVPMAPDV KTLNEVVVVG YGTQKKENLT GAVAAITIDD KISSRSLSNV SSALSGLIPG LAVQQSTGQA GRSGAALVIR GLGTVNNSGP LIVVDGIPDV DINRIDMNDV ASISVLKDAA SASVYGSRAA NGVVLITTKN GSQNKKPVIS YTGTYGLSEP TNFYNYFDDY ARSLTMHLRA SGAGASSTTF RYGTVEDWLS KSMIDPIKYP STNWWDVVLR DKGRIQTHNL SAAGGNERSN FYLSAGIYDE LGILINHDYK RYNTRFNLDY KLSDHIKVGI RMDGQWSKQT YANSEGLITY TGTGGYDIRY AVAGILPQNP LTGQYGGAMA YGEDALAYNM LAAMNVNHNL RDRQEANGNL YGEWTPITGL TIRGDYGLRY YNQFTKSYAD PSDVFNFQTN QISRNLVSSS AGISNAINSG YKTLLQGRVT YNKTLFGNHQ LSLLGAYTEE YWFNRNLSAS RLERINPLLS EIDAALTTTQ AAGGNSDAEG LRSGIGRLNY VVNDKYLFEV NARYDGSSKF LPGFQYGFFP SASAGWRFSE EPFFKRFSSV VSSGKVRASI GKLGNNSGVG RYEQRDIFNL TNYILNGKIT KGFSSAKIIN EDFSWEETNV TNLGLDLAFF GGRLTTDIDY YNKLTSGMIR PSSLSTFLTG YNAPRVNIGK LRNTGVEVNV TYRAKVRDAN VGATLNMAFN QNKLLEWNEF LSKGYTYLNL PYHFAYSRVA TGIAQSWEDI ANAPYQGQYF SPGDILYKDL NGDGQVNDED RKAEPKFNRD QPTGTYGLNL FANWRGFDVS VLWQAATGRK DFWLEPFNNV NIPAARNAFQ DFLWNDTWSL DNRLASLPRL VTGSGGNNQA ESTFWLDNFG YLRLKNIQLG YNIPTKYISR LGLSKVRIYG TSENLLTFTK YRGVDPEKST SVSGADNNDD PFPLLKSYSF GLNLSF
|
| |