Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_4161 |
Symbol | |
ID | 8727920 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 5009296 |
End bp | 5010651 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | |
Product | glycoside hydrolase family 1 |
Protein accession | YP_003388947 |
Protein GI | 284039017 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.167873 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAATC AAAAGTCAAT TTGGGAAACG CTGAATAAAC CTTTACAAAA CAAATTAGCC TTATGGGGAG GTCTGGAATG TACCGTTAAC CGGGTAGGTG ACGTGTATCA GGATCAGATC GTACGCAGTG GGCACCATGA TCGGCTAAGC GATCTGGATT TAATCGCCGA CTTAGGTATC CGTGCCCTGC GCTACCCCAT TTTGTGGGAA CGTACAGCAC CCGATCACCC CGATCAACCG GACTGGTCGT GGCCCGATGA ACGGCTCAAC CGGCTGCGTC AGTTAGATAT CCGGCCCATT GTCGGGCTGG TGCATCACGG CTGCGGACCC CGCTATGCCA CGTACGACAC ACCGGCTTTT GAAGACGGGC TGGCCCGCTT TGCCCGTCAG GTGGCCGAAC GGTACCCCTG GATCGATGCT TATACGCCCA TCAACGAACC GCTGACAACC GCCCGTTTCG GAGGCTTGTA CGGACTCTGG TATCCGCACG GCCGGTCTAA TCAGCTATTC GTCGATCTGT TGCTGCGCGA GTGCCGGGCT ACCATCCGGG TTATGGCCGA AATCAGGGCC GTACAGCCCA ATGCCCAACT TATTCAGACC GATGATCTGG GAAAAACTCA CAGCACCCCG GTTCTCAGCT ACCAGGCCGA ACTGGAGAAC GAACGGCGGT GGCTGGGCTG GGATTTGCTC TGCGGACATG TAACGCCCCA CCACCCATTG TGGGCGTACT TGCGGGAGTC GGGAGCATCC GAAGCCGATT TGTGGTATCT CATCGAGCAT GCCTGCCCGC CATCGGTTAT AGGCGTTAAC CACTACGTCA CCAGCGAGCG GTATATAGAT CACCGTATTC ATCACTATCC AACTCATTTA CACGGCGGCA ATGGTTATCA TCGATACGCC GATACGGAGG TGGTACGGGC CGCCCCGGAG CAGCGCACCG GCTTAGCTAC CTTGCTACAG GAAGCGTGGC AGCGGTATGC ACTACCCATC GTTGTGACAG AGGCCCATCT GGGCGATCGG CCCGAAGAGC AGATGCGCTG GCTGGGCGAA CTCTGGCAAC AGGCGCAACA GGCTCAGGAT GCCGGTGCCG ATGTCCGGGC CGTGTCGGTC TGGGCCATCA TGGGCCTGTA TGACTGGCAT TGTCTCCTGA CCCGGCGGGA AGATCGACAC GAGCCGGGGG TATTCAACGT CAGCAGCGGC ATCCCGGAAC CCACTGAGCT GGCAACCATG CTGAAACGCC TGACCGCTGG TGAACCGATA GGGTCGTTAG TGCCGCCCGG ACCGGGTTGG TGGCAGACGC CCGACGCCCA AATCTACCAT GCCTCGGAGC CTTTAGTCAG TAATTCAGTC GAGTAA
|
Protein sequence | MSNQKSIWET LNKPLQNKLA LWGGLECTVN RVGDVYQDQI VRSGHHDRLS DLDLIADLGI RALRYPILWE RTAPDHPDQP DWSWPDERLN RLRQLDIRPI VGLVHHGCGP RYATYDTPAF EDGLARFARQ VAERYPWIDA YTPINEPLTT ARFGGLYGLW YPHGRSNQLF VDLLLRECRA TIRVMAEIRA VQPNAQLIQT DDLGKTHSTP VLSYQAELEN ERRWLGWDLL CGHVTPHHPL WAYLRESGAS EADLWYLIEH ACPPSVIGVN HYVTSERYID HRIHHYPTHL HGGNGYHRYA DTEVVRAAPE QRTGLATLLQ EAWQRYALPI VVTEAHLGDR PEEQMRWLGE LWQQAQQAQD AGADVRAVSV WAIMGLYDWH CLLTRREDRH EPGVFNVSSG IPEPTELATM LKRLTAGEPI GSLVPPGPGW WQTPDAQIYH ASEPLVSNSV E
|
| |