Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_1642 |
Symbol | |
ID | 8725377 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 1975065 |
End bp | 1977041 |
Gene Length | 1977 bp |
Protein Length | 658 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | |
Product | type 3a cellulose-binding domain protein |
Protein accession | YP_003386488 |
Protein GI | 284036558 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.117524 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCATCAT TTTTACGCTC ACTGAGCCTC GCATGGCTCT GGCTTACGGC CGTCTTCCTG TTGCACGGAG CTATCGCGCA ACCTGTCCAG CAAGGTGACC CAACACCGCA AATCACCGTC ACGCCTTCAT CACTGACCAT TCTGAATTAC AGTGAGTATG AAGGTCAGTT TCCGGCCAGT TACACTGTTT CGGCCAGCGG GTTAACAAAT GATCTGGTCA TTACTGCTCC GCCTTACTTC CTGCTCAGCG CCCGGGGAAA TACAGGGCCG AGCCTTTCCC TGCCGATTGT CAATGGGGAA GTATCACCAA CACAGGTTTC GGTAATACTG CTCGCGTCGT CACCCGGTAC GTTTACCGGT GTCGTTACCA ATGTCAGTGG CTCCGCAACA GCGACTGTGG CGGTGAGTGG AACCGCCATT TCGGAGTCGG TGAGTGTAAG CCCCAGTACT CTGAATTCGT TTACGACAAC GGCCGGGCAG CCTTCAGCTG TTCAATCCTA TACCGTTACC AGTCGTGGAG GTCTGTCTGT TGTTGTCAAT GCTCCAGCCG GATTCGAGAT TCGTACGGGC AGCGCAGCGT TTGGCTCATC GCTGGTGATT GGCCCGAGTT TGTCGTATAA GAATACACAG GTCGATGTAC GCTTGGTTGG AACAACGCCC GGAACTGTTT CGGGGGTTAT AGCCAATGAT ACCTACTACC ACTCGGCTCA CCTTACGTAT CCGGTAGCAG TGAGTGGCGT GGTTACACCG GTTACTGCAT CTGCTTCACT GAGCGTGCTG CACCGCGATG CTGATTATGG CAATCGAACG GATCAGCTTA TCCGACCTTA CCTTGAGCTT GGTAATGAAG GCACTACGGC CATTCCGTAT AGCCAGATTA CCCTGCGGTA TTGGTTTACC TCCGAGGGGG GCTCACCACC TACCGATTTG CAGGTGTACT ACGCACAGAT GGGAACCCGT TACGTCAGGA TGAAGTATGT GCCGCTTGCA GAGCCGCGCC AGGGGGCATT CGGTTATGTT GAGTACAGTT TCGACGCATC GGCGGGGAGC TTAGCCGCCG GGAGTCGGTC GGGTCCGATT GAGAACGGTA TCCTGAAGCA GGATCGGTCA GCCTTCAACG AGTCTGACGA TTATTCGTAT GCTACTCCAA CTACGTTTAC GCGTAATACG CATGTAACGG CCTACCTGAA TGGGCGGCTT ATCTGGGGGG AAGAACCCGC CCCTGCACTG GTATTGCGGC AGGTTAAGGT TTATTCGGCC GCAAAAAACA GTGATATCAC CAGCAGTATC AGTACCGTTC TTGAGGTGCG TAATACAGGG AATGTGGCAA TTCCTTTGCA GGATTTGACG GTACGCTATT GGTTTACGTC CGAAACGAGC CAGCTGCTCA ATAGCTATAT TGATTATGCA CAAATAGGTG CTCAGACCAT TAAGCACAAC GTTGTTCGAC TGGCACAGCC AGTGTCGGGT GCCGATAGCT ACCTTGAACT GAGCTTTTCG GCTGGAGCCG CTGGCCTGGC ACCGCTGAGT AGTACAGGGC AAATTCTGTT TCGGCTGGTA AAGCCTGACT TCTCGTTGCT GAACCAGGTA AATGATTATT CTCACGGTCC TGTAAACCTG ACCGAAAACC CCCACATAAC CGTCTATCTA CAGGGAAATC TGATTTATGG TACCGAGCCG CCGGGTGGCA TGGGACGTAT GGGTGTACCG GATGAAAACA AGTTATTACA GGTCACGCTA TTGGGCAATC CGGTACAGAA TGAACAGTTG ATTCTGGAAG CGCGTGGTGC CCAGGGTCTT CCTTTGGTTC TGCAACTGGT TGACCGTCAG GGCGTACAGG TATTCGGGAA GGAGGTAAGC GAGGCCGCCG ACGTGGAGCG GCAGCAGCTG GAAATGAGTC GGCAACCGGC GGGGGTATAC CTATTACGCA TACGTACACC GAATCAGGAG CGGGTACTTA AAGTAATCAA GCCATAA
|
Protein sequence | MSSFLRSLSL AWLWLTAVFL LHGAIAQPVQ QGDPTPQITV TPSSLTILNY SEYEGQFPAS YTVSASGLTN DLVITAPPYF LLSARGNTGP SLSLPIVNGE VSPTQVSVIL LASSPGTFTG VVTNVSGSAT ATVAVSGTAI SESVSVSPST LNSFTTTAGQ PSAVQSYTVT SRGGLSVVVN APAGFEIRTG SAAFGSSLVI GPSLSYKNTQ VDVRLVGTTP GTVSGVIAND TYYHSAHLTY PVAVSGVVTP VTASASLSVL HRDADYGNRT DQLIRPYLEL GNEGTTAIPY SQITLRYWFT SEGGSPPTDL QVYYAQMGTR YVRMKYVPLA EPRQGAFGYV EYSFDASAGS LAAGSRSGPI ENGILKQDRS AFNESDDYSY ATPTTFTRNT HVTAYLNGRL IWGEEPAPAL VLRQVKVYSA AKNSDITSSI STVLEVRNTG NVAIPLQDLT VRYWFTSETS QLLNSYIDYA QIGAQTIKHN VVRLAQPVSG ADSYLELSFS AGAAGLAPLS STGQILFRLV KPDFSLLNQV NDYSHGPVNL TENPHITVYL QGNLIYGTEP PGGMGRMGVP DENKLLQVTL LGNPVQNEQL ILEARGAQGL PLVLQLVDRQ GVQVFGKEVS EAADVERQQL EMSRQPAGVY LLRIRTPNQE RVLKVIKP
|
| |