Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_0801 |
Symbol | |
ID | 8724532 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 969997 |
End bp | 973347 |
Gene Length | 3351 bp |
Protein Length | 1116 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | |
Product | TonB-dependent receptor plug |
Protein accession | YP_003385663 |
Protein GI | 284035733 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.383306 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.353517 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACAAA AATCTACTTT GTACTGCCAC TGCCGGGTGC ACATTGGCCA ATTCAGGTCA GCGTTCGGTC TGTTGCTTCT AACCTGCCTG TTGATGGGAG GTTCTGCTTT TGCCCAAAAT CGGGTCGTTA GCGGGAAGGT AACGGATGCT AAAACCAATG GCCTGCCGGG TGTTAGTATC ATCATCAAAG GAACCACAAC CGGAACCACA ACCGATGCGA ACGGCGATTA TTCGCTCAGC GTACCGTCTG CTGAAGCGAC CTTGACATAT TCCTACATTG GGTTTGATGC ACAGTCGAAA ACGATCGGTA GTCAGTCGGT TATCAACATA ACACTTGTTG AAAATACGGC GCAGCTCAAC GAGGTTATTG TAACGGCTCT GGGTATCAGG AAAGAAGCCC GAACAATTGG CTATACCACG CAGGATGTAG CGGGCGATCA GCTCGTGAAG GCGCGTGAGC CAAACCCGGT TAACTCGCTG ACCGGTAAAA TCGCCGGTCT GACGGTCGGG CCTTCGGCCG AGATGCTGTC GAAACCCAAG CTCCTGCTGC GGGGTAACAG CGATCTGTTA TTTGTCGTCG ATGGTGTTCC CATCAACTCG GATACCTGGA ACGTGTCGGC CGATGACATT GAAACGTACA CCGTCTTGAA AGGTCCTAAC GCAGCTGCTC TTTATGGCTT CCGGGGGCAG AACGGAGCGA TCATGATTAC GACCAAGAAG GGGACGAAAG ACAAACGCAA AATTGCTGTC GACTTCAACA CGAGTACCAT GTTTGAATCG GGTTTTCTGG CTTTGCCCGA CCGTCAGAGT GAATATGGAT ACGGGAATAA CTTCAAGTAT GCCTATGGCA ATAAGCTCTA TGACGAAGAC GGTGGCTACC GCCGGACAAA CCTGTGGGGC CCTCGTTTCG AAGGACAAAA TGTGCCGCAA TACAATAGCC CGGTAAATCC AACAACGGGT ATCCGTCAGG GAACGCCCTG GCTAAATGTT GGTAAGGACA ACTTCAGGAA TTTTGTACAG ACCGGTATCA TTTCGACCAA TAACGTTTCG GTATCGTCGA GTGGTGAGAA GTATGACCTG CGGATGTCGG TATCGAACAA CTACCAGCGG GGTATCTACC CAAACACACG GCTGAACATT ACCAATTTCA ACCTGACCAC GGGTATCAAT TTTACCGACA GGCTGCGGTT TGATGGTAGC TTGAATACGA ATATCCAGGC ATCGCCAAAT ATTCCAGAAT ATAGCTCAGG TCCCGAGAGT TATGTGTACG CCTTTCAAGT ATATGGTTCC AGTAGCTGGG ACCTCGCCGA TATGCGCGAT TATTACAAAG GGCCACAGGG TAAGCAGGGT GTGCAGCAGT ATTACGCGGA ATATGGCCGG GAGAATAACC CTTATTTCGT TGCTTATGAA TGGCTACGTG AGCATCGCAA AACGGATATT TACGGCTATA CCCGGTTGAG CTACAAAATC AATGATTTCC TGAACCTATC TCTCCGGACA CAGATAACCA CCTGGAATCA GCTGCGGACC GAAAAATTGC CCTATTCCAT GATCACTTAC AAATCACCTG ATTTGCGGCA GGGTGATTAT CGCGAAGATC GTCGGAACAT GCTCGAAAAC AATACGGACC TGCTGCTGAC CTTCAACAAG GACGTAGCCA AAGATTTTCA TATTAACGCA TCGGCCGGTG CCAACGCGCG CACGTTTACT TACAATTCGA ACTGGACCAC AACCGACTTC CTGATTGTGC CGGGTGTGTA TGCGTTTACC AACTCGAAAA ACCCTGTTCG GGCCTACAGC TTCCGCTCGG ATATGCGGGT TCTGAGCGCC TACGCAACCA GTGACTTTAC CTACAAAAAC CTGGTGACAC TGGGTGTAAC GGGCCGGTTT GATAAACTCT CGACTTTGCC AAAAGAGAAC AACACGTATT TCTATCCGTC TGTGGCCCTT AGTACAGTCG TATCAGACTA TGTGAAAATT CCGGAAGCGA TTTCATTCCT GAAACTACGA GGTTCCTATG CCAACGTGCG TGGTGGGTTG ACGCAGTCGG AAATTGGTAC GGCCTACCGG GCGGTAACCG GTAGCGGTAC CGATGCCTTA ATAGGCTACG GTACCGACCT GACCTCCTCG TATGACGGTC CAAGCTACGC CAACCAGAAC ACCTACAGCA TCTCAACGGG CCTTTATAAC AACACCCCAA TGGCCAACTA TTCGGGAACG CTGGCCAACA AATCGCTGAA GGCGTATACC GTTAGCTCGT ATGAGTTTGG TTTTGATGCC AAATTTTTAG GCAATCGACT GGGCTTTGAC CTGACTCATT TTACCGCCGT GAACGGTCCA CAGATTTTTG CCTTACCGGT GCCAAGTTCA ACCGGATTCT ACAATGAAAA CGTAAACGGT CTGGTAACGA AACGGGACGG CTGGGAAGTG TCGGTAACGG GATCGGCCCT TAAAAATCCA AACGGTCTGA ACTGGGATGT GTTGGCAAAC TGGTCGACGT TTAAAGAGCG GCTGAAAGAG ATATACGGTA ACGAAACAAG TATATATCTC AGCGGCCCTG ACCACGTCTT TACCATCGGT GACCGGCTTG ACGGCTATTA TAGCTACAAT TTCCTGCGCG ATCCAAACGG TAATATCATT AACTCAGCTA CCGGACAGCC ACTAACACGT CCTTCTGGAA CGAACACCAA GCAGCTACTG GGCTATACGA ACCCTGATTT TGTGTGGTCG CTCAATAACC GCTTCAGTTA TAAAAATTTC AACTTCAGTT TCCAGTTCGA CGGCCGTGTG GGCGGTGTTA TTCGCGATCA GGTATATGCC TATGCCATGA ACGCGGGTAA CCAAAAAGAT CTGGTAACCG GAGCCTTTGG CGAAGCTCGT TTGAAGGAAT GGCAGAGTAC AAATACTAGC ACCGTAGCGG CAACCCCCGC TTACGTTGGC CCAGGTGTGG TGACAACGGG TCAGGTTAAG TTCGACGGTC AGGGCAACAT CAGTAACATG AGTGAGTTAA CCTTCTCGCC GAACACCAAA GCGGTAACGG TGCAGTCATA TGCTCAGGGT GTTTATAACA GCGGTATCGA AGAATCCTAT ATGGTTAGCA AAACATACGC TAAACTACGG GAGGTCATCA TTGGCTACAC GGTGCCCGTT ACGGTGTTAC CCCGGTTTAT TCGGGCGGCT TCGGTATCCG TAGTAGGTCG TAACCTGCTC TATTTCGCTC AGCGTAAGGA TTTCGACCTG GACCAGTTCC CGGAAGGCTA CAACGCCACA TCTAACTCCA CCCTGCGTAA CCCTGGTTTG CAGTCGTCGA CGTTACGCCG ATTTGGCGTG AATCTAAATC TGACATTCTA A
|
Protein sequence | MKQKSTLYCH CRVHIGQFRS AFGLLLLTCL LMGGSAFAQN RVVSGKVTDA KTNGLPGVSI IIKGTTTGTT TDANGDYSLS VPSAEATLTY SYIGFDAQSK TIGSQSVINI TLVENTAQLN EVIVTALGIR KEARTIGYTT QDVAGDQLVK AREPNPVNSL TGKIAGLTVG PSAEMLSKPK LLLRGNSDLL FVVDGVPINS DTWNVSADDI ETYTVLKGPN AAALYGFRGQ NGAIMITTKK GTKDKRKIAV DFNTSTMFES GFLALPDRQS EYGYGNNFKY AYGNKLYDED GGYRRTNLWG PRFEGQNVPQ YNSPVNPTTG IRQGTPWLNV GKDNFRNFVQ TGIISTNNVS VSSSGEKYDL RMSVSNNYQR GIYPNTRLNI TNFNLTTGIN FTDRLRFDGS LNTNIQASPN IPEYSSGPES YVYAFQVYGS SSWDLADMRD YYKGPQGKQG VQQYYAEYGR ENNPYFVAYE WLREHRKTDI YGYTRLSYKI NDFLNLSLRT QITTWNQLRT EKLPYSMITY KSPDLRQGDY REDRRNMLEN NTDLLLTFNK DVAKDFHINA SAGANARTFT YNSNWTTTDF LIVPGVYAFT NSKNPVRAYS FRSDMRVLSA YATSDFTYKN LVTLGVTGRF DKLSTLPKEN NTYFYPSVAL STVVSDYVKI PEAISFLKLR GSYANVRGGL TQSEIGTAYR AVTGSGTDAL IGYGTDLTSS YDGPSYANQN TYSISTGLYN NTPMANYSGT LANKSLKAYT VSSYEFGFDA KFLGNRLGFD LTHFTAVNGP QIFALPVPSS TGFYNENVNG LVTKRDGWEV SVTGSALKNP NGLNWDVLAN WSTFKERLKE IYGNETSIYL SGPDHVFTIG DRLDGYYSYN FLRDPNGNII NSATGQPLTR PSGTNTKQLL GYTNPDFVWS LNNRFSYKNF NFSFQFDGRV GGVIRDQVYA YAMNAGNQKD LVTGAFGEAR LKEWQSTNTS TVAATPAYVG PGVVTTGQVK FDGQGNISNM SELTFSPNTK AVTVQSYAQG VYNSGIEESY MVSKTYAKLR EVIIGYTVPV TVLPRFIRAA SVSVVGRNLL YFAQRKDFDL DQFPEGYNAT SNSTLRNPGL QSSTLRRFGV NLNLTF
|
| |