Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_4319 |
Symbol | |
ID | 8728079 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 5229225 |
End bp | 5232440 |
Gene Length | 3216 bp |
Protein Length | 1071 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | |
Product | TonB-dependent receptor plug |
Protein accession | YP_003389100 |
Protein GI | 284039170 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0290506 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.156669 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTGAAT ATATACGTAG CGGTCTTCGA CTGACCGTTT GGCTGACGTT CCTGGCGAAT GCAGCAGCGC CCATACAGGC ACAGCAATTG GCTGCTGCCC TGACCAGGCA AAAATCGTCG GCAGCACTGG CCGCCAGCAC GAGCGCGTCG GTTCAGCTAG TAACAGGCCG CGTAACCGAC GAAACCGGCA CGGGCTTGCC GGGTGCCAAC GTGACCGTAA AAGGCACTAC CACCGGAACC GCCACTGACG AGAAAGGGCA GTACCGAATC AGTGTTCCAA CGCCCAATGC GGTGCTGGTC TTTAGTTCGG TTGGCTACCT GAAGCAGGAG GTCAGCGTAG GAAACCGGAC AACGGTAGAT ATTCAGATGC GCGTTGACAA CCAGAGTTTG AGCGAGGTTG TCGTGATCGG GTACGGCGAG CAGTCGCGGA AAACGCTCTC CACGGCCATT GCCAAAGTAG AGGGCAAAAA TATTGGTATA CAGCCCGTCA GTACACCCGG TGAGGCTCTG GCCGGTCTGG CGGCTGGTGT GCAGGTGCAG TCTGACCGGG GGAGTACGCC GGGCGCACCA CCCACTATCC GTATTCGGGG AGTTGGTTCG CTGAGTACGG GCAGCACACC GCTGTATGTC GTTGACGGCT ATCCGCTACA GGACCCCGCC CAGTTTGCGC TCATTAATCC AACGGACATC GAGTCGATGG AAATTCTGAA AGATGCGGCT TCGGCAGCTA TTTACGGGTC ACGGGCGGCC AACGGGGTTG TTATTGTAAC GACCAAGCGA GGCAAAGCGG GCAAAACCAG TTTGAATGTA TCCATCTACA CAGGCATTCA GCAGCTGGCC AAGAAAGTAC AGCTCCTGAA TCGGGATCAG TACATAGAGA ACGCCATTTA TGCCTCCAGA CTCAAGAATA TACCTTACCC AAAAGTATTT GATACCAAAC CCGACAGTTT GCCCGATACT GACTGGCAGG ATGCTATTTT CCGGCAGGCG GCCATCAGTA ACTACCAGAT TTCGGCCACG GGCGGCACCG ATAAGGTTCG TTTTGCTGTC TCGGGCGGGT ATTTCAAGCA GGACGGTATC CTGAAAGGTT CAGCCTACGA ACGGTATAAC CTGCGCTTTA ACCTGGATGC CGACTTGAGT CCTAAACTCA AATTAGGGGT GTCGATGGCG CCTTCCTACA GCAGTCAGTT TCAGCAGCAG GCGGCCGGGC AGTTCAACGG ATCGAACGGT ACCGAAACCA GCGGCACCCG GTCGTTACCC AGTGCCATTA TTTCGGCCAT CGACATGCCG CCAACCATTC CGGTGTATAC GCCCAATGGC GATTACGCGC AGACCTTCAA CGGCAACACG AACCCCAATG GTACCAATTT TTACCAGACC AACCTCTATA ACCCGCTGGC CGTTCTGGAG CTTAGCCGCA ACAACCTGAA AGGCTATCGG CTGTTCGGCA ATGGCTTTCT GGAATGGCAA CCGATTGCCA ACCTGCGGCT GAAAACAACG CTGGGGTCAA CGCTGAGTAT TTTCGATCAG TCGGCCTATA TCCCGGCCAA TCTGGCCAAC GAATCGGCTC CCCGCGCCAA CTCCACGAAC CCGGTGCTGG GTCAGATATT CGCCCGCGAG TCGCAGACGG TGACGCTGGA CTGGCTCTGG GAAAATACCG CTACCTACAA CAAGACGTTT GGTAACCACA ACTTCTCGCT GCTGGCCCTG TACTCGCTCC AGAAATTACA GGCTAAAAAC ACGGCTACGT CGGGCCGGTC GGGTAGCTAC ACGACCAGTC TGCTGGATAA CCCGCTGGCC TCGCCCGACC GGATTGGTGA GCTGAACTAC GATCAGAACG CCTTTCTGTC ATTGGGCGGA CGGATCACCT ACGACTTCAA AAGTAAATAC ATCTTTTCGG CCGCCATTCG CCGGGATGCG TCGTCGCGCT TCGGGCCAAA CAACCGCTTT GCTACGTTCC CATCCATCTC GGGAGCCTGG CGGATCAGCG AAGAGAAGTT CTGGTCGGGC CTGAAGAACA GCATCAGCGA ATTCAAAATC AGGGCCAGCT ATGGCGAAAC GGGCAATGCC AATATTGGTA GTTTCAACTG GACAAACAGC GTACAGGGCC GGAACTACAG CTTCAATCAG GCGCGGACCT TCGGCTATGC CCAGACCGGC TTTGCCAACT ACGACCTGAC CTGGGAGAAA AACGTGCAGA CCGACCTGGG CCTCGAAATG GGTTTCCTGA ACGACCGATT CACCCTGGGC CTCGACTATT ACAATCGACT GACAACAGGT ATGCTGTTCC AGAAAGATTT GCCGGGCATT GTGGGCTACG CTACTAATTT CCGAACCAAC ATCGGCAGCC TGCGCAACCG GGGGCTTGAA CTCTCAGCCC GGGCGAACCT CACTGTAGGT GCTGTTCGCT GGACGATAGA CGGCAATATT TCGGGCAACC GCAGCAAGGT GATGGATCTC GGCGGTCCTT CGTCACTGCC AACGGTAGCG GCTATTTTTG GCTGGAATAA CGTCTATCAG GTTCGCGTGG GCGACCCGCT GGGCAATATG TATGGCTATC AGGTGGTGGG TATCTTCAAA AATGCCGATG ACCTCAGCAA GAACGCCCAG TTCACAACGG GCGACAAAGT GGGGAACTGG ATGATTCGGG ATCAGAATGG CGATAATAAA ATCGACGAGA ACGACCGGGT GTATGTCGGC AAAGGCGTAC CCAGCTATAT CTGGGGGATG ACCCACAGCT TTCAGTACAA AAACTTTGAC CTGAGCGTCA TTCTTCAGGG TGTACAGGGC GTCAATGTCA TCAATGGAAA CCTGCGGCAC ATCTGGGCAA ACCAGGTGTT CAACACCATT CCGCTTTACT TCCGGAACCA GTTCGATCCG GCCAACCCGA CGCAAAACAC CGACTTCCCG GCGGCTGGTG CGGGGGGTAT TCACCCCGGC AACAACCTCA CCGACCGGTT GCTTTTCGAC GGTTCGTTTG TCCGCATTCG TAACCTCACC TTCGGCTACT CCGTACCGAC GGTTTTCCTG AACAAGATCA AGCTACAGTC GGCGCGCATC TACGTGACGG GGCAAAATCT GTTCACCTTC ACCAGCTATC CCTGGTACAA TCCGGAGACT AACACCGTGC CCGATTCACC CGTGCAGATT GGCGTCGATC AGGGTACCTA CCCACTGGCA CGTACCTACA CCATTGGCCT AAATATCGGC TTCTAA
|
Protein sequence | MREYIRSGLR LTVWLTFLAN AAAPIQAQQL AAALTRQKSS AALAASTSAS VQLVTGRVTD ETGTGLPGAN VTVKGTTTGT ATDEKGQYRI SVPTPNAVLV FSSVGYLKQE VSVGNRTTVD IQMRVDNQSL SEVVVIGYGE QSRKTLSTAI AKVEGKNIGI QPVSTPGEAL AGLAAGVQVQ SDRGSTPGAP PTIRIRGVGS LSTGSTPLYV VDGYPLQDPA QFALINPTDI ESMEILKDAA SAAIYGSRAA NGVVIVTTKR GKAGKTSLNV SIYTGIQQLA KKVQLLNRDQ YIENAIYASR LKNIPYPKVF DTKPDSLPDT DWQDAIFRQA AISNYQISAT GGTDKVRFAV SGGYFKQDGI LKGSAYERYN LRFNLDADLS PKLKLGVSMA PSYSSQFQQQ AAGQFNGSNG TETSGTRSLP SAIISAIDMP PTIPVYTPNG DYAQTFNGNT NPNGTNFYQT NLYNPLAVLE LSRNNLKGYR LFGNGFLEWQ PIANLRLKTT LGSTLSIFDQ SAYIPANLAN ESAPRANSTN PVLGQIFARE SQTVTLDWLW ENTATYNKTF GNHNFSLLAL YSLQKLQAKN TATSGRSGSY TTSLLDNPLA SPDRIGELNY DQNAFLSLGG RITYDFKSKY IFSAAIRRDA SSRFGPNNRF ATFPSISGAW RISEEKFWSG LKNSISEFKI RASYGETGNA NIGSFNWTNS VQGRNYSFNQ ARTFGYAQTG FANYDLTWEK NVQTDLGLEM GFLNDRFTLG LDYYNRLTTG MLFQKDLPGI VGYATNFRTN IGSLRNRGLE LSARANLTVG AVRWTIDGNI SGNRSKVMDL GGPSSLPTVA AIFGWNNVYQ VRVGDPLGNM YGYQVVGIFK NADDLSKNAQ FTTGDKVGNW MIRDQNGDNK IDENDRVYVG KGVPSYIWGM THSFQYKNFD LSVILQGVQG VNVINGNLRH IWANQVFNTI PLYFRNQFDP ANPTQNTDFP AAGAGGIHPG NNLTDRLLFD GSFVRIRNLT FGYSVPTVFL NKIKLQSARI YVTGQNLFTF TSYPWYNPET NTVPDSPVQI GVDQGTYPLA RTYTIGLNIG F
|
| |