Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_3036 |
Symbol | |
ID | 8726788 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 3680686 |
End bp | 3683787 |
Gene Length | 3102 bp |
Protein Length | 1033 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | |
Product | TonB-dependent receptor plug |
Protein accession | YP_003387846 |
Protein GI | 284037916 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000276569 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.556141 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTCAA AATTACGTAG AGCCATTCCC GGGTTATTAC TTTTTCTGGG CATATGGATT TCCGCTGTTG CATCACCTGT ATTTGCTGTC GACAGTGAAA TAACGGGCCG GGTTTCAGAC GAGAAAGGAA ATGACGTGGT TGGTGCCTCT GTCACCATCA AAGGTACCAA TCGGGGTACT AATACAGATG CCAGTGGCAA GTATCGAATT GTCGTTCCAA ATGGAAGTGC CGTACTTGTA TTTTCATACA TTGGCTACAC CAAGCAGGAG GTTACCGTTG GTAACCGCTC CGTTATTGAC GTGAAGCTGG AGCCAGGTAG CGCTGTACTC GACGAGGTCG TGGTAACGGC TTTAGGGATA TCGAAAGAAG CCCGTAAGGT TGGTTATGCG GTTACGACGG TTAGTAGTGA AGCATTCACA AAAGCTCGCG AAACCAACGT TGGTAATTCG CTGGTTGGCC GGGTTGCCGG TTTGACCGTA AAGGGTACCA ACGGTGGCCC CGGCAGCACA TCTAAAATCC TGTTGCGGGG TATGCCCAGT ATTAATTCGG GTGGTGCGCC ATTGATCGTT ATTAATGGTG TACCCATGGA CAATACCCAA CGGGGTAGCG CCGGCGAGTG GGGTGGTGCC GATGGTGGAG ACGGTATCGG TAACCTGAAC CCGGATGACA TCGAAACAAT GACGGTGCTG AAGGGACAAT CTGCATCGGC CCTGTACGGT GCCCGATCAT CTAATGGTGT TATTCTGATT ACAACCAAAC GCGGTAAGAA AGGTGATTAT GCCATTGAAT ACAACGCAAA CCTGACAGCT GATAGTCCGA TTAACTTTAC TGATTTTCAG TACGAGTACG GACAGGGAAC AGGTGGCGTA AAACCCACTA CCATTGCCGC TGCTCAGCAA ACGGGTCGTC AGAGCTGGGG TGCTAAACTG GATGGCTCGC AAATCACCCA GTTTGATGGT AAGCAATATG CTTACTCGGC TCAAAAAGAT AACATCAAAA ATTTCTACCG GACAGGCACC AACTTCACCA ATACAGTTTC GGTCACTAAA GGGGGGGATA ACGGGTCATT CCGTTTGTCA TTGTCTAACC TCGATACCAA GTCGATTCTA CCGAACAGTG GTCTGGGCCG TAAAACGTTT AACCTGACGG CCGACCAGAA CATCACCTCG AAACTAAGCG TGAGCCTGCT GGCAAACTAC ATCGACGAAA AGATTACGGC AAAGCCGCAG TTGAGTGATG GTCCAATGAA TGCCAACAAT GGTTTATTCC TGGCAACGAA CATCGATCAG CGAATTCTGG CCCCGGGTTA TAATACCACA ACCGGACGTG AGATTATTTT TAGTGATGAT GAGTACGTAA CGAACCCGTA CTTCGTAACC AATCAGTACG TGAACGACGT AAGCCGCAAG CGATTGATTT CGATGATTGC GACCAAGTAT CAGTTTGCCG ACTGGATTTA CGCGCAGGGA CGGGTTGGAT ACGATAACGG CAATGACCGG ATTTTCCGGG TTACACCCTA CGGAACGGCT TATTCGCAGG ATGCCAAAGG TGGCCTGGAC GAGCAGTCGA ACGCCCAAAC GACTGAATTG AACATCGACG GTTTGATTAG TGTTAGTAAG GCCATTACGC CCGACTTCTC TATTGATGCT ATCGTGGGTG GTAACATCCG CAAAAATAAC TATGAGAAAA TCGGCATCGG CGGTGGGCCA TTCGTTCTGC CTTACCTCTA TAGCTACAAT AACGTTGTAA ACTTTAACCG GAGCTATGGC TTTTCCAAGT CTGAAGTTCA GTCGGCTTAT TATAGCCTCG ACTTTAGTTA TAAAAGCTTC CTGAACATAA GCACAACGGG TCGCTACGAT GCTTATTCGA AACTGCCCAG TACGGCACGA ACAATTTTTA CGCCGTCTGT AACGGGTGCT TTCATTTTCT CTGAGTTTGT TAAAACACCC AGCCTGAGCT TTGGTAAACT ACGGGCGTCT TATGCAGTTA CCAGTGGTGA ACCAGCAGAC GCCTATGGAA CTAGTGTTTA TTACGGGGTA GGAAGTGCGC TGAATGGTGT TCCTACTGGT AATTTTAGTT CCAGCTTGCC CAACTTGTTC CTCAAACCCT TTACCAAGAG CGAAGTTGAG GTTGGTTTAG AACTTAAGTT CTTCGGTAAC CGGTTAGGAT TCGATCTGGC CTATTTTGAT CAGAAAACAC ATAACGAAAT CCTGCCAGCT AACTACAGCC CGGCAACAGG GTATACGAGT GGGGTAGTGG CTACCGGTTC TACCCAGAAC CGGGGTCTCG AAGTGCTGGT AACCGGCACG CCGGTGAAAA CGGCTAAATT GGCCTGGAAT GTTTCGTTTA ACCTGACTTC GGTTAAAAAC AAAATCCTCC AGACCGATGC CAATAACAAT CCGCTGGGTT TGGGCTCAAA CCGTGCTACA CTGGGGAATG CGACTACTGC GTTTGTTGTG GGTGAGTCTG GTCCGCAGAT CCGCGCGTAT GATTACAAGT ATGCCTCGAA CGGACAAATC ATTGTCGATG CATCCGGCCT GCCGGTTCGG GGTAACCTGA TCAATATGGG TACGGTATTA CCAACGCTCT TCGGTGGTTT AAATAACGAG TTCTCGTTCG GTAATTTCAA CCTGGCGTTC CTGGTCGATT ACAACTACGG TAACAAGATT CTTTCGGCTA CCGAAAACTA CGCCTACCGC CGTGGCCTGC ATAAAGCGAC TTTGGTGGGC CGTGAAGGAG GTATCACCAC GGGTGTTGTA GAGGGCGGTG CTGCCAATAC GGTTAGCGCC ACCGCTCAGA ATTACTACAC GGCACTGGCC AACAACGTAA CCAAAATCAG TGTGGTCGAT GGCGATTTCA TCAAATTGCG GCAGCTGACA TTTGGCTACA ACATACCTGC CAGCGTTTTG ACAAAAGTGC CTCTGATTCG TGCGGTTAAT ATTTCTTTCG TGGCCCGGAA CCTGTTCTAT ATCATGAAGA AAACAACCAA TATTGATCCA GAAGCTACGT TTGGCGCTAA CCTGCGTTAC GCTGGTATTG AAGGAACGAG CCTTCCATCA AGTCGTAACT ACGGGGTTAA CCTAAACATC CGGTTCAAGT AA
|
Protein sequence | MKSKLRRAIP GLLLFLGIWI SAVASPVFAV DSEITGRVSD EKGNDVVGAS VTIKGTNRGT NTDASGKYRI VVPNGSAVLV FSYIGYTKQE VTVGNRSVID VKLEPGSAVL DEVVVTALGI SKEARKVGYA VTTVSSEAFT KARETNVGNS LVGRVAGLTV KGTNGGPGST SKILLRGMPS INSGGAPLIV INGVPMDNTQ RGSAGEWGGA DGGDGIGNLN PDDIETMTVL KGQSASALYG ARSSNGVILI TTKRGKKGDY AIEYNANLTA DSPINFTDFQ YEYGQGTGGV KPTTIAAAQQ TGRQSWGAKL DGSQITQFDG KQYAYSAQKD NIKNFYRTGT NFTNTVSVTK GGDNGSFRLS LSNLDTKSIL PNSGLGRKTF NLTADQNITS KLSVSLLANY IDEKITAKPQ LSDGPMNANN GLFLATNIDQ RILAPGYNTT TGREIIFSDD EYVTNPYFVT NQYVNDVSRK RLISMIATKY QFADWIYAQG RVGYDNGNDR IFRVTPYGTA YSQDAKGGLD EQSNAQTTEL NIDGLISVSK AITPDFSIDA IVGGNIRKNN YEKIGIGGGP FVLPYLYSYN NVVNFNRSYG FSKSEVQSAY YSLDFSYKSF LNISTTGRYD AYSKLPSTAR TIFTPSVTGA FIFSEFVKTP SLSFGKLRAS YAVTSGEPAD AYGTSVYYGV GSALNGVPTG NFSSSLPNLF LKPFTKSEVE VGLELKFFGN RLGFDLAYFD QKTHNEILPA NYSPATGYTS GVVATGSTQN RGLEVLVTGT PVKTAKLAWN VSFNLTSVKN KILQTDANNN PLGLGSNRAT LGNATTAFVV GESGPQIRAY DYKYASNGQI IVDASGLPVR GNLINMGTVL PTLFGGLNNE FSFGNFNLAF LVDYNYGNKI LSATENYAYR RGLHKATLVG REGGITTGVV EGGAANTVSA TAQNYYTALA NNVTKISVVD GDFIKLRQLT FGYNIPASVL TKVPLIRAVN ISFVARNLFY IMKKTTNIDP EATFGANLRY AGIEGTSLPS SRNYGVNLNI RFK
|
| |