Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_4810 |
Symbol | |
ID | 8728574 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 5865385 |
End bp | 5868606 |
Gene Length | 3222 bp |
Protein Length | 1073 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | |
Product | TonB-dependent receptor plug |
Protein accession | YP_003389587 |
Protein GI | 284039657 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0674411 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.790047 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGACACA ATTTACGAAC GAGGATTAGC GCCTGTATTC TAGCAATATG GGTCACTTTC CTGATTAGCC ATGCCGCGTT GGCACAGGAT CGACGCGTTA CGGGCCGGGT GGTAGCGGCA AAGGACCAGC AGCCTATTCC GGGGGTAACA ATCTTGGTTA GAAATACTCA GTTAGGTACT ACGACCGATG CTAACGGTTC GTTTACACTT AACGTACCAG CCAACTCTAC GCTGGTATTT AGTGCCATTG GTTTTGCCGG GCAGTCACTG GCCATCGGCA ACCAGACCCA GTTAACAATA ACGCTTCAGG AAGCCGAGCA AAATCTGGGC GAAGTAGTCG TAACAGCGCT GGGTATCAAA AAAGAAGCCA AACGGCTGGG CTATGCTACC GCCATTGTCA ATCCTGAGCA GGTAACCACT AACCGTACGG TTAACTTCAT TAACGCCTTA CAGGGTAAAA TTGCGGGTGT TAACATCAGC AGCCTGGGTA CGGGTGCCGC CGGAACGAGT AAGATCCGTA TTCGGGGTCA GTCGTCCTTC TCGGGGCAGA ACAGCCCGCT TATCGTAGTA AACGGTGTGC CAATTGACAA CACCAACTTC GGTCAGAATA ACGGGAACAC CGGTGGCGAT AACTCCATCG GCAACCGCGA CCGCAACTAC TCCGACGGCG GTGACGGTCT TTCGTCCATC AACCCGGATG ATATTGAGGG AATGACGGTG CTGAAAGGCG GTACGGCTGC GGCTCTGTAC GGCTCCCGCG CCAAAGACGG TGCCATCCTG ATCACGACAA AAACCAAAGG TACCGGTCAG GGTATTGGTG TAACGTTCAA CAGCAACTTC ACTACAGACC GCCCGCTTGA TTTTACTGAT TACCAGTATG AGTACGGACA GGGTGAATAT GGAGTGCGGC CAACAGCGGC CAACCCAACA TCGGGCGTAT GGAGCTTCGG GGAGAAGTTT GCCGGTCAGA CGCAAGTGCT GTTTGGGGGT GTAACGGTGC CCTATGCGCC AGTTCGTAAC CGGATCAACA CGTTCTACCG GGATGGGTCG ACATGGACCA ATTCGATTTC GGTATCGTCG GGCAGTGAAA AAGGCGGGTT CAACTTGTCT ATCGCTAACC TGGACAACAA AGGCATCACG CGCAACAACA CCTTTAACCG GAAGACGATG AACCTTGGTT TCAGCTACAA CCTGTCGCCA CGGTTAACCG TTACGGGTAC ACTCAACTAC TCCAACGAGT ACAACAAAAA CCCGCCCCAA ATTGCCCAGC AGGACAACAG TACGCCAACG GTAATTTACA CACTGGCCAA CTCCATGCCG CTGGACGTGC TGGAAGCCAA CCAGATCAAC CCGGCTACGG GCAACGAGTT CGTGTATTCG CGCTTCATGA ACCGCACGAA TCCGTACTTT GTCCTCAACA ACAAGTTTGA GAACATTCGC CGGGATCGCC TGTTCGGTAA CCTTACGGCC CGCTATAACG TGACCGACTG GCTGTACGTG CAGGGACGGG TTGGGCAGGA TTACTGGTCG CGCGATCAGG ATTATAACTT CCCAACGGGG CAGGCCTCTC TGGCAGCAGC ACCAGCCGGT TTTGTAAACG GAGCCTATGT ACAGGAAGCC CGTCGTTTTC GCGAAATCAA CGCCGACTTC CTGATTGGTG CCAACCACAA GTTTGGCGCG TTCGGTGTCG ATCTTACCGT TGGCGGCAAC CAGTTGTACC GTCGAAGCGA CCTCAACAGC GTACAGGTAA CCGACTTCAT TGTGCGGGGC TTATACGTAC CGCAGAACGG ACGGGTGAAA GACCCCATCT ATGGTCTGAG CGAACGGAAA GTTAATTCGC TTTACACAGC AGCCGAATTT TCGTTCAAAG ACGTCCTCTT TCTGAACGGT ACGCTACGTA ACGACTGGTT CTCTACCCTT GCGCCAGCTA ACCGCAGCAT TCTGTACCCA TCGTTAACAG GTAGCTTCGT CTTTTCGCAG GCCTTCGACA ACCTGCCCTC TTTCATAAAC TTCGGTAAGA TCCGGGCTGC ATATGCCGAG GTCGGCAGCG ACGGCGACGT AGCACCTTAC TCGAACAACC TGTTCTATTC GGTCAATGCC AACCTGTTCC CGAACCCCGC AGGTCTGGGC CAGCCGGTCG GTAACATTAC ATCCAGTACC GTGCCAAGCT CCACCCTCAA ACCCAGCCGT ACCGCCGAAA CCGAAGTGGG TCTGGAGTTG AAGTTGTTCA ACAACCGGGT TGGCCTGGAC ATGGCCGTGT ATCGCAAAAT TACCAGCGAC CAGATTGTAC AGGCCCAGTC GTCCGATGCG TCGGGGTACA CCTCTACTCT GATTAACAGT GGACAGAGCC AGAATCAGGG CATTGAAGTA TTACTGAATC TCGCACCTAT TCGCACTAAG GATTTTTCCT GGGACATTAC GCTGAACGGG GCTTACAACA AGACCAAACT ACTCCGGCTA CTCACCGACG ACGACGGCTC GCCCGAGAGA GATTATAACA AAGACAAACA GGCCGAGCAG ATTGTGGTCG GTACGGGTAT TTACGTGGGT GATCTTCGGC AGGTAGTTGG CCAGGAACTG GGTCAGTTAT ATAGTTTCGG CTACCAGCGC GACGCACAGG GACGTATCAT TCACGGGGGC GATGGTCTGC CCGTTCGGAC ACCGGCGCCT ATTTCGTTCG GCTCAGCTCT CCCTAAGTAT ACGGGCGGTA TCACCAACAC GTTCAACTAT AAAGGCGTTA ATCTCTCGTT CCTGATCGAC TTCAAGCTCG GTGGCAAGAT GATCTCGGGT ACCAACCTGA ACGCTTTCCG TCATGGATTA CAGAAAGAAA CACTGGTAGG CCGGGGCGAA GCCGACAACA AAATGGTGGG TGTTGGCGTG AACGATAAAG GTGAGGTAAA CGCCGTTCGG GCGTTCGTGC AGGACTACTA CTCGGTAGGT CGTTCCAAAA GCCTGGGCGA GCAGGTAGTA TATGACGCCG GTCTGTGGAA ACTCCGCCAG ATCAGCCTTG GCTACGACTT CACCAAGATG CTGCCTAAAA GCCTGTTCAT TAAAGGTATT CGGTTGAGTG CGGTAGCTAA CAACGTGGCC ATCATCAAAA AATGGGTACC CAACATCGAC CCCGAGCAGT TTGGCTTTAG CTCCGACAAC CTGATCGGTC TGGAATCAAC CGGCTTACCG ACAACGCGCA GCATTGGCTT TAACCTGAAT GTTAAATTCT AA
|
Protein sequence | MGHNLRTRIS ACILAIWVTF LISHAALAQD RRVTGRVVAA KDQQPIPGVT ILVRNTQLGT TTDANGSFTL NVPANSTLVF SAIGFAGQSL AIGNQTQLTI TLQEAEQNLG EVVVTALGIK KEAKRLGYAT AIVNPEQVTT NRTVNFINAL QGKIAGVNIS SLGTGAAGTS KIRIRGQSSF SGQNSPLIVV NGVPIDNTNF GQNNGNTGGD NSIGNRDRNY SDGGDGLSSI NPDDIEGMTV LKGGTAAALY GSRAKDGAIL ITTKTKGTGQ GIGVTFNSNF TTDRPLDFTD YQYEYGQGEY GVRPTAANPT SGVWSFGEKF AGQTQVLFGG VTVPYAPVRN RINTFYRDGS TWTNSISVSS GSEKGGFNLS IANLDNKGIT RNNTFNRKTM NLGFSYNLSP RLTVTGTLNY SNEYNKNPPQ IAQQDNSTPT VIYTLANSMP LDVLEANQIN PATGNEFVYS RFMNRTNPYF VLNNKFENIR RDRLFGNLTA RYNVTDWLYV QGRVGQDYWS RDQDYNFPTG QASLAAAPAG FVNGAYVQEA RRFREINADF LIGANHKFGA FGVDLTVGGN QLYRRSDLNS VQVTDFIVRG LYVPQNGRVK DPIYGLSERK VNSLYTAAEF SFKDVLFLNG TLRNDWFSTL APANRSILYP SLTGSFVFSQ AFDNLPSFIN FGKIRAAYAE VGSDGDVAPY SNNLFYSVNA NLFPNPAGLG QPVGNITSST VPSSTLKPSR TAETEVGLEL KLFNNRVGLD MAVYRKITSD QIVQAQSSDA SGYTSTLINS GQSQNQGIEV LLNLAPIRTK DFSWDITLNG AYNKTKLLRL LTDDDGSPER DYNKDKQAEQ IVVGTGIYVG DLRQVVGQEL GQLYSFGYQR DAQGRIIHGG DGLPVRTPAP ISFGSALPKY TGGITNTFNY KGVNLSFLID FKLGGKMISG TNLNAFRHGL QKETLVGRGE ADNKMVGVGV NDKGEVNAVR AFVQDYYSVG RSKSLGEQVV YDAGLWKLRQ ISLGYDFTKM LPKSLFIKGI RLSAVANNVA IIKKWVPNID PEQFGFSSDN LIGLESTGLP TTRSIGFNLN VKF
|
| |