Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_4723 |
Symbol | |
ID | 8728487 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 5752104 |
End bp | 5755343 |
Gene Length | 3240 bp |
Protein Length | 1079 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | |
Product | NHL repeat containing protein |
Protein accession | YP_003389500 |
Protein GI | 284039570 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.645184 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACCTAC TGAAACCAAT TCAAATGCTA AAATTTTACC CGCCAACCGG CCTCGATAGG TCGGCGACTA ACTCGTTTCG TCCTGACGCC AGTTGGGTCA GCCGACTAAC CTGCTTAGTC GTATTTCTTT TACTGAGCCT GAGCAGCCCA TTGCTAGCGC AATATAATTC CAACGCCGTA ACAGTTGCCG GTACCGGTAC CGCTGGTTCA GCCGCTAATC AGTTAGATAG TCCCTTCGGT ATTTATGTTG ACGAAGCCGG AAGCATGTAT GTAGCTGATT ACAACAACCA CCGCGTCCAG AAATGGGCCT CGGGGGCTAC CTCCGGAACT ACTGTAGCCG GTACCGGTAC CGCTGGTTCA GCCGCTAATC AGTTAAATCA TCCTCTTGGT GTGTATGTAG ACGGAGCGGG AGCCATCTAT GTGTCTGACA CCGATAACAA TCGCGTCCAG AAATGGGCCT CGGGAGCTAC CTCCGGAACT ACAGTGGCCG GTACCGGTAC CGCCGGTTCA GCTGCGAATC AGTTGAATTA TCCTATAGGT ATCTATGTAG ACGGAGCGGG AGCCACCTAT GTGGCTGATG CTTCTAACTC ACGCATTCAG AAATGGGCTG CGGGAGCCAC ATCCGGTACT ACCGTCGCCG GGGGGAATGG ACAAGGCTCA GCTGCCAATC AGCTTTGGAG TGCAGCGGGT GTGTATATAG ACGGAGCAGG AGCCATTTAT GTGGCTGACG GCGGAAACAA CCGCATCCAG AAATGGGCCT CAGGGGCCAC ATCCGGTACC ACCGTGGCCG GTACCGGAGT ATATGGACCC GCCAGCAATC AACTATCATA CCCATTTGCT GTGTATGTTG ATGGAGCGGG CACGATGTAT GTGAGCGACC AGCAATCTCA CCGCATTCAG AAATGGACTG CGGGGGCGAC CTCGGGTACC ACCGTAGCCG GGGGGTACGG CAATGATGCT CTGGTGCCTT ATCAGTTAAA TTACCCAAGA GGTATTTATC TTGACAGGGC CGGAGCCATC TATGTAGCCG ACCAACGCAA CAACCGCATC CAGAAGTTTA GTCGCATCAA TCCACTCGTC AACCCTTCCC TGACCATCGC CACAACGAGT CAGACCAGTT GTACAGGCGC ATCGGTGAGC TTCACCGCCA CTTCCACCAA TGGCGGCACC AGCCCCGCTT ACCAGTGGAA AAAGAACGGC TCCAATGTGG GCACCAGTGA CGCTACCTAT ACCGATGCGG CCCTGACTAG TGGCGATGTC ATTAACTGCG TGCTCACCAG CAATGATCAA TGGGCCTCCC CTACCACGGC CACCAGCAAT AGCCTGACGA TGACGGTCAA CCCGCTACTT ACCCCGGCCC TCACCCTTGC CATTACCACC GGCAGTCAGA CCAGCTATGC GGGCACCGCC ATCACGTTTA CCGCTACGCC TATCAATGGC GGTACCAACC CCAGATACCA GTGGAGAAAG AACGGCACCA ACGTGGGCAC CAACAGTGCC ACCTACACTG ATTTCGCCCT GGCCAACAAT GACCTGATCA GCTGTATACT TGTCAGCAAC GTCACCTGCT ATACCACGCC CACCGCTGAC AGCAATAGCC TGAAGATGAC CGTCATCGCT CTGGTAACGC CTACCCTCAC CATCGCCACC GGTAGCCAGA CCAATTGCGC AGGCAAATCA GTCAGTTTCA CCGCTACGCC TACCAATGGC GGTAGCAGCC CCGCCTACCA GTGGAAAAAG AACGGCTCCA ACGTGGGCAC CAACAGTGCC ACCTATACCG ATGCCGCCCT GGCCAACAAT GACGTGATCA GTTGTGTGCT CACCAGCAAT GCCCCCGGAA CTACGACCAG CACGGCCACC AGCAACAGCC TGACGATGAC GGTCAACCCG CTACTTACCC CGGCCCTCAC CCTTGCCATT ACCACCGGCA GTCAGACCAG CTATGCGGGC ACCGCCATCA CGTTCACCGC TATGCCTACC AATGGCGGTA CCAACCCGGC CTACCAGTGG AAAAAGAACG GCTCCAACGT GGGCACCAAC ACGGCGACCT ACACTGATTT CGCCCTGGCC AACAATGATG TTATCAGTTG TGTGCTCACC AGCAACGTCA CCTGCCCCAC CACGCCCACC GCTACCAGCA ATGACCTGAC CATGACCGTC ATCGCTCTGG TAACGCCCAC CCTCACCATC GCCACCACCA GTAGCCAGAC CAGTTGTGCA GGTACATCAG TCAGTTTCAC CGCTACGCCT ACCAATGGCG GTAGCAGCCC CGCCTACCAG TGGAAAAAGA ACGGCTCCAA CGTGGGCACT AACACGGCGA CCTACACCGA TGCCGCCCTG GCCAACAGTG ACGTGATCAG CTGTGTGCTC ACCAGCAATG CCCCCGGAAC TACGACCAGC ACGGCCACCA GCAATAGCCT GACGATGACG GTCAACGCCC GGCCCGATGC GCCCGCCCTG ACCCCCGCCA GCTCTAGTCT GGCAGCGACT CTGACACCCC TCTCGCTGAC GAGCTTTGCA CTGGCAACCA CCGGCAATAG CCTCCACTTC TTCCAAGCCG GAGGTAGTGA ACTCAGCCCC CCTACCGTCA GTATCGCCAC TGCCGGGGTT ATGAGCTTTT CGGTCGGCCA GACCAACAAC GCCAGCGGCT GCAAGAGTTT ACTCACGCCA TTGAGTCTGA CCATCACGGC CACCCCCACC AGCCAGACCG TTTGCCGCAG TAGCAACGCC ACTCTGAACG TCACTCTGGT GGGAACCGCC TTCCAGTGGT ACAAAAACGG TACTACCACA GCCAACAAAC TCACCGAGCT GACCAGTGCC CAGCGCGGTA CGACCACCGC CACCCTGACA CTGGTCAATT TGCAAACCAC CGCCGACTAC TACTGCAAAA TCACTACTTC CACCGGCGTT CAGACCGTGG GGCCCCTGAA GGTGAGTGTC AACTTTGGCT GTTCGGCCCG GCCTGCGGCC GAGGAAGCAG ACTTGCAACT ATTGGTACTG GTCAGGCCAA ACCCTATCGT AGACGGCCAC CTGCGGGCCC TGGTGAAGGG GGCTCAGGGG CAAGCCCTGA ACGTAGCCCT CTACAGTCTG CAAGGGGAGT TGGTGAACCA GCAGGTCTGG CCCTCGGCAC CCGCCGAAGT CAATCTGGAT TGGGACATCA GCCAGCGAAC CACGGGAGTG TTACTCTTGC GGGCCCAGAC CCCAACTCAA CAGCAAACCA TCAAGATTAT CCAGAATTAA
|
Protein sequence | MYLLKPIQML KFYPPTGLDR SATNSFRPDA SWVSRLTCLV VFLLLSLSSP LLAQYNSNAV TVAGTGTAGS AANQLDSPFG IYVDEAGSMY VADYNNHRVQ KWASGATSGT TVAGTGTAGS AANQLNHPLG VYVDGAGAIY VSDTDNNRVQ KWASGATSGT TVAGTGTAGS AANQLNYPIG IYVDGAGATY VADASNSRIQ KWAAGATSGT TVAGGNGQGS AANQLWSAAG VYIDGAGAIY VADGGNNRIQ KWASGATSGT TVAGTGVYGP ASNQLSYPFA VYVDGAGTMY VSDQQSHRIQ KWTAGATSGT TVAGGYGNDA LVPYQLNYPR GIYLDRAGAI YVADQRNNRI QKFSRINPLV NPSLTIATTS QTSCTGASVS FTATSTNGGT SPAYQWKKNG SNVGTSDATY TDAALTSGDV INCVLTSNDQ WASPTTATSN SLTMTVNPLL TPALTLAITT GSQTSYAGTA ITFTATPING GTNPRYQWRK NGTNVGTNSA TYTDFALANN DLISCILVSN VTCYTTPTAD SNSLKMTVIA LVTPTLTIAT GSQTNCAGKS VSFTATPTNG GSSPAYQWKK NGSNVGTNSA TYTDAALANN DVISCVLTSN APGTTTSTAT SNSLTMTVNP LLTPALTLAI TTGSQTSYAG TAITFTAMPT NGGTNPAYQW KKNGSNVGTN TATYTDFALA NNDVISCVLT SNVTCPTTPT ATSNDLTMTV IALVTPTLTI ATTSSQTSCA GTSVSFTATP TNGGSSPAYQ WKKNGSNVGT NTATYTDAAL ANSDVISCVL TSNAPGTTTS TATSNSLTMT VNARPDAPAL TPASSSLAAT LTPLSLTSFA LATTGNSLHF FQAGGSELSP PTVSIATAGV MSFSVGQTNN ASGCKSLLTP LSLTITATPT SQTVCRSSNA TLNVTLVGTA FQWYKNGTTT ANKLTELTSA QRGTTTATLT LVNLQTTADY YCKITTSTGV QTVGPLKVSV NFGCSARPAA EEADLQLLVL VRPNPIVDGH LRALVKGAQG QALNVALYSL QGELVNQQVW PSAPAEVNLD WDISQRTTGV LLLRAQTPTQ QQTIKIIQN
|
| |