Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_0962 |
Symbol | |
ID | 8724692 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 1170301 |
End bp | 1173483 |
Gene Length | 3183 bp |
Protein Length | 1060 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | |
Product | TonB-dependent receptor plug |
Protein accession | YP_003385812 |
Protein GI | 284035882 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000104088 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.00653561 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGCTATGG CCATGGTGAT TGCAGAGTCA TTCGCAAGTT CTGCAAAACA ACCTAATGGG GGCCTTTCTA AAGACGGGCG GTCTGTTAGT GCGCTGGTAG CTATCAACGC TCCGGATCGT TTGGCAGAGG CCGCGACGGA CATTACCGTG TCAGGTAAAG TAATTGATGA GAAAGGTGAT GGGTTGCCCG GCGTAAGTGT TGTTATAAAA GGGTCAACAC AAGGGACAAC TACAGACGGA ACAGGAAGCT TTAAAATTTC CGTTCCCAAC GCCAATTCAA CGCTGGTTTT TAGCTTTGTC GGGTATGCGC GAAAAGAAGC CGTTGTGGGT GGCCAAACCA CACTTACCGT AACACTAACG CCCGATGACC AAACCTTAAA TGAAGTTGTG GTAGTTGGTT ATGGTAGCCA GTTAAAAAAG GAAATAACGG GGGCTGTTCA GACAGTAAGT GCCGCAGAAA TCAAAGATCT TCCTGTTTCC CAGATTGGCC AGAAATTACA GGGCCGGCTG GCGGGTGTTC AAATCAACCA GGCTACTGGT AAGCCGGGTC AGGGAATAAG CATCCGTATT CGCGGTCAGG TATCCGTTTC GGCGGGTAGC GACCCGCTTT ATGTAGTGGA TGGTTTCCCC ATAACGGGAA ATATTGCCCA GCTTAACCCC GACGAAATCG AAGATCTTTC TGTGTTGAAA GACGCTGCTT CGACCTCGCT GTACGGTTCG CGGGCGGCCA ACGGAGTTGT GCTGATTACC ACTAAAAAAG GTAAGCCCGG TCAGACAAAT ATTAGCTTCA GCGCGTTTGC AGGTGTCCAG AAAGTTCCCA TGCGAGGTCG CGTGAAGATG CTGGATGCCG TTCAGTTTGC CCAGTTCAAA AAGGAATATT ACGAAGATCA GGGGCAAGCT GTGCCGGTTG AGTTCCAGAA TCCGTCGCAG TACGAAGGTA AAAACAACGA CTGGTATGAT GCTTTGCTGC GGCAGGCACC TCTTCAGAGC TACAATCTGA GTATCTCTAA CAATACAGGT AAGGCTAATA CATCGTTGGT TGCCGGTATC TTCAATCAGG ATGGGGTTGT ACTGAACAAT AAATACAAAC GGTATTCACT GCGCCTGAAC TCAAACTATA ACCTGTCTGA CCGCGTAACG ATCGGCTTTA ACGTGGCCCC TTCGTACGTG TACGACAATA CCCCCCGGAC GGATGGTGAC CGGGGAACCG GAATTCTGTT CAACGCCCTG CACACCTGGC CAGTAATGCC CATTTATGCC GCCAATGGTG AACTGACCAA GTTCAATACT TTTCCGGGTA GTACGGGTAA TATTTTCCAG TATCCTAACT GGGTGCGGGC CGCTAATGAA CTGGTCAATG AAACCAAGAA CACAAACCTG CTGGCAAATG CGTATGTACA ATATCGGCCG ATTACTGGGT TAACGTTACG GTCGACGATG AATATCGAGT ATCAGAACTC TAAGTTCTTC TTCTTTAACC CGTCGACGGC TACGAGTGCC ATCAACGTGC CAATCCCAAC AACGGCCGTT TCTATTCGGC AGGGGCTGGA GAACACATCC TGGCTGAACG AAAACCTGGC TACCTATACC CGCAGCTTTA ACGATCATAA CTTTGAGTTG CTGGCTGGTT TTACCAACCA GTGGTATCGG CAGGAGTTCA ACCGTATTCA GGCCGATACG TATGCCGATG ATCGGCTTCC TACCATTCAG GGAGCACTCA ACATCAACCG CGGTGGTACA AACAACGGTA TCAACCAGTG GGCATTAACC TCCTATCTGT CTCGTCTGAC CTATAATTAC AAAGGAAAAT ACCTGTTTAC GGCAGCTGTC CGGTCTGATG GCTCGTCCCG ATTTGGCGCG AACAATCAGT ACGGTACATT CCCGTCGGCT TCGGTAGGTT GGGTACTTTC TGATGAAAAC TTCATGAAAA CAGTTATGCC TGTTTCGTTT GCTAAGGTTC GGGCTAGCTA CGGCGTAATT GGTAACAATA ACATTGGTAA CTACACCTCC TACGCCCTGG TGAACAACAC TACAAACGCC GTGTTCGGTA GCACGGTTGC TACGGGAGCG GTCGTTCGAT CGTTGGCTAA TCCAAATCTG GGCTGGGAAA CGACCAAACA ATTTGATATA GGGCTTGACC TGGGTCTGTT GAACGACCGT ATCCAGTTCA TTTACGATTT CTACACGAAG CGGACAACCA ATCTGCTTTA CGCTGTACAA ATTCCGCAGG AGTCGGGTTT CACAAACTTC AATGACAACA TTGGCGAAAT CAAGTTCTGG GGCCATGAAT TCTCGCTGAC AACCAAAAAC ACGACGGGTC GGCTGAAATG GAATACCAAT GCGAACATCT CGTTCAACCG CAATCTGGTT GTGGCGCTGG CTCCGGGTAT TGACCGGGTG TATGGTTCGT TCCACATTAC GCAGGTGGGT AAGCCCTTTG GCCAGTTCTA TGGTCTGATC AAAGAAGGAT ACTATCAGAG TGCTGAAGAA CTCCGATCAT CACCGATCAT TCCGGGCCGT TCAGCCATTG GCACCATCAA AATGAAAGAT GTGAACGGCG ACGGTGTGAT TACCTACGGT GGTGATGCCG ATGATCGCAC CATAATTGGT AGCCCATTCC CAAAATTCAC CTACGGTATT ACCAACGACC TGAAATACGG CAACTTTGAT TTCTCAATTA CGGGCTCTGG TTCGTATGGC AATCAGTTAT GGGTTCGCCA TTTGTACAGC ACCGCCAACC TGGACGCTGT ATTTAACATG GTTGAGGGTG TAAAGGACCG TTTCCGCGTT CAAAACGTCG TGACGAATGG TGTTGGTGTG GCTACTAAAG TAATAACACC AGGAGCCGGT CAGTTTGGCG CAACCAACAA TGGCGGGAAC TTTACGGGTA TTGAGCGTGA CTGGAACAGT ACGCAATTCC TGGCTGATGC CTCTTTCTTT ACCATCAAAA ATATAACGCT TGGCTATAAC ATTGGAGCAG TCAATAAGCT CTTCAAGTCG GCACGTTTAT ATGCTTCGGC TCAGCAGGTC TATATATTCA CGAAATACTG GGGTGGTCCA AACCCGGAGA CCAGCGGCAA CGGAGCTGGC GATGGTGACG GTGGTAACCT AAGCCAGGGA GTTGACTTCT CGAACTATCC GGTTCCACGT ACCTACACAC TTGGCGTTAA CCTGAACTTT TAA
|
Protein sequence | MAMAMVIAES FASSAKQPNG GLSKDGRSVS ALVAINAPDR LAEAATDITV SGKVIDEKGD GLPGVSVVIK GSTQGTTTDG TGSFKISVPN ANSTLVFSFV GYARKEAVVG GQTTLTVTLT PDDQTLNEVV VVGYGSQLKK EITGAVQTVS AAEIKDLPVS QIGQKLQGRL AGVQINQATG KPGQGISIRI RGQVSVSAGS DPLYVVDGFP ITGNIAQLNP DEIEDLSVLK DAASTSLYGS RAANGVVLIT TKKGKPGQTN ISFSAFAGVQ KVPMRGRVKM LDAVQFAQFK KEYYEDQGQA VPVEFQNPSQ YEGKNNDWYD ALLRQAPLQS YNLSISNNTG KANTSLVAGI FNQDGVVLNN KYKRYSLRLN SNYNLSDRVT IGFNVAPSYV YDNTPRTDGD RGTGILFNAL HTWPVMPIYA ANGELTKFNT FPGSTGNIFQ YPNWVRAANE LVNETKNTNL LANAYVQYRP ITGLTLRSTM NIEYQNSKFF FFNPSTATSA INVPIPTTAV SIRQGLENTS WLNENLATYT RSFNDHNFEL LAGFTNQWYR QEFNRIQADT YADDRLPTIQ GALNINRGGT NNGINQWALT SYLSRLTYNY KGKYLFTAAV RSDGSSRFGA NNQYGTFPSA SVGWVLSDEN FMKTVMPVSF AKVRASYGVI GNNNIGNYTS YALVNNTTNA VFGSTVATGA VVRSLANPNL GWETTKQFDI GLDLGLLNDR IQFIYDFYTK RTTNLLYAVQ IPQESGFTNF NDNIGEIKFW GHEFSLTTKN TTGRLKWNTN ANISFNRNLV VALAPGIDRV YGSFHITQVG KPFGQFYGLI KEGYYQSAEE LRSSPIIPGR SAIGTIKMKD VNGDGVITYG GDADDRTIIG SPFPKFTYGI TNDLKYGNFD FSITGSGSYG NQLWVRHLYS TANLDAVFNM VEGVKDRFRV QNVVTNGVGV ATKVITPGAG QFGATNNGGN FTGIERDWNS TQFLADASFF TIKNITLGYN IGAVNKLFKS ARLYASAQQV YIFTKYWGGP NPETSGNGAG DGDGGNLSQG VDFSNYPVPR TYTLGVNLNF
|
| |