Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_6637 |
Symbol | |
ID | 8730423 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 8072294 |
End bp | 8073601 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | |
Product | Alpha-N-acetylgalactosaminidase |
Protein accession | YP_003391393 |
Protein GI | 284041463 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTTCGC TGTACACGAC CGCTCAGGCG GGCAGTCGGC CTTCGGCTTT ATCGAAAGTA CGGCTGGCAT TCATTGGCGT TGGTCTGCGG GGGCGCAACC ACCTTCAGCA GGCCCTTTAC CGGTCCGATG TCGAGGTGAC CGCCCTCTGC GACATTTCGG CCGATGCCAT CGCCAAATCC AACGCCATGA TTGAAAAGGC AGGCCGTAAA ACTCCGCCCG CCTATACCAA AGGCGACGAA GCCTTTCTGG ACATGCTGAA ACGGGACGAT ATCGACGGTG TCGTGATTGC CACCCCCTGG GAATGGCATA CACCCATGGC CGTGGCGTCC ATGAAAGCGG GTAAATACAC CGGACTCGAA GTATCGGCCA CGGTGACGCT AAAGGAGTCG TGGGATCTGG TCAATACGTA CGAGAAGACG AAGTCGCACT GCATGCTGCT GGAGAACGTC TGCTACCGGC GCGATGTCAT GGCCATTCTT AACATGGTAC GTCAGGGTAT GTTCGGCGAG ATGACCTACG CGCACTGCGG CTATGAACAC GACCTGCGGA ATATCAAATT CAACGACGGT ACCGCCCGTG GCGTCGGCGC TGAATTCGGC GAAAAAGGGT TCTCGGAGGC CCATTGGCGC ACCCAGCACT CCGTCGACCG GAATGGCGAT CTCTACCCCA CGCACGGTCT GGGGCCGGTG GCCCACTGGC TCAACATCAA CCGGGGCAAC CAGTTTGTCC GGCTGACATC GATGGCTACC AAAAGCCGGG GTCTGCACAA GTATGTTGTG GACAAAGGCG GGGCCAACCA CCCCAATGCC AAAGTGAACT TCAAACTGGG CGACGTAGTG ACCACCATGG TCGAGTGCGC CAACGGAGAG AACATCGTTA TCATCCACGA CACCAACTCG CCCCGCCCCT ACTCACTGGG ATTCCGGGCG CAGGGTACGC AGGGAATCTG GATGGACGAC GGCGACACCA TTTATCTGGA GGGCGTCAGT CCGAAGCCGC ACCAGTGGGA ATCGTTCAGC CCGTACCAGG AAAAATATGA TCACCCGCTC TGGAAACAGC ATGTCGAAAC GGCACAGAAT GCCGGACACG GCGGCATCGA CTTTTTCGTC CTGCGCGGCT TCATCGAATC CATCAAAAAT CAGGTAGCCC CGCCCATCGA CGTCTACGAT GCGGCTGCCT GGAGCGCCAT CAGCCCGCTG TCGGAGCAGA GCATTGCAGG CGGCAGTAAA GCCGTCGACA TCCCCGATTT CACCCGTGGC AAATGGAAGA CCAACAAGCC CATCTTTGGC CTTACCGACG TGTACTGA
|
Protein sequence | MPSLYTTAQA GSRPSALSKV RLAFIGVGLR GRNHLQQALY RSDVEVTALC DISADAIAKS NAMIEKAGRK TPPAYTKGDE AFLDMLKRDD IDGVVIATPW EWHTPMAVAS MKAGKYTGLE VSATVTLKES WDLVNTYEKT KSHCMLLENV CYRRDVMAIL NMVRQGMFGE MTYAHCGYEH DLRNIKFNDG TARGVGAEFG EKGFSEAHWR TQHSVDRNGD LYPTHGLGPV AHWLNINRGN QFVRLTSMAT KSRGLHKYVV DKGGANHPNA KVNFKLGDVV TTMVECANGE NIVIIHDTNS PRPYSLGFRA QGTQGIWMDD GDTIYLEGVS PKPHQWESFS PYQEKYDHPL WKQHVETAQN AGHGGIDFFV LRGFIESIKN QVAPPIDVYD AAAWSAISPL SEQSIAGGSK AVDIPDFTRG KWKTNKPIFG LTDVY
|
| |