Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_3893 |
Symbol | |
ID | 8727651 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 4669658 |
End bp | 4671940 |
Gene Length | 2283 bp |
Protein Length | 760 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | |
Product | Beta-N-acetylhexosaminidase |
Protein accession | YP_003388682 |
Protein GI | 284038752 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.470709 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAATCA AACCGATCAT TTTCATCGTC GGGCTCAGTC TGCTGATGGC CGGTATAGGT CAATCGCAAA CGCCCGATCC ATACCCCATT ATCCCGTACC CGACTTCGCT GGTACCGGGT CAGGGGCAGT TTGTTATCAC CCAAAATACC GCGCTGGTTG TCCAGGATGG TCGTTTCCAG AGCGAGGCCG GTCAGTTACA GCAATTGCTG AAACCGGTTT TGGGCAAGCC ACTGCCTGCC CGTGGAGGCA ACGCGCAGAT CGTGCTCAAT TACGACCCAT CCATCACGTC ACCGGAAGGC TACCAGCTTA CCATCGCCCC CCAGCGCATA ACCCTGGCGG CCAGAGAGCC GGTGGGCATG TTTCGGGCCA TTCAGACCAT CCGGCAACTG CTGCCCGTGA GCCTTGAGCA GAAAAAAACA ACCGGCCCAC TGACCCTCCC GGCGGTGCAG ATCCGCGACC AGCCCGCCTA TGCCTGGCGG GGCATGCACC TCGATGTGTC GCGGCACTTT TTCTCGATGG ATTACCTGCA TAAATTCGTC GATCTGCTGG CTCTTTACAA GTTTAATAAA TTTCACCTGC ACCTCACCGA CGACCAGGGC TGGCGGCTGG AGATAAAGGC GTACCCCAAG CTAACCAGCG AAGGAGCCTG GCGGACGTTC AATAACCAGG ATTCTGTCGT GCTGAAACGG GCCACCACAA ATCCCGACTT CGACCTGCCG AAGCAATACC TCCGGCAGAA AGACGGGAAG ACGCAATATG GCGGGTTTTA TACTCAGAAC CAGATGCGCG AACTCATCGC GTACGCAGCC GCCCGCCATA TCGAAATCAT TCCGGAAATC GACATGCCCG GCCATTTGAC GGCGGCCATC AAAGCGTATC CATTTCTAAG CTGTACGGGT CAGGAAGGCT GGGGCAAAAC GTTTTCGGTG CCCATCTGCC CCTGCAACGA ACCCACCTAC ACGTTTACCG AAACCGTACT GAGCGAAGTA GCCGCGCTGT TCCCGAGCCA GTACATCCAC ATCGGGGCCG ATGAAGTGGA GAAATCGACC TGGGCGCAGT CGACGGCCTG TCAGGCGTTG ATGAAGCGGG AGGGAATCAA AAGCGTGGAG GAACTGCAAA GTTATTTCGT ACACCGCACC GAAAAGTTTC TGCTGTCGAA AGGGAAAAAA CTCATGGTCT GGGACGACGC TCTGGAGGGC GGACTGGCAC CCTCGGCCAC GGTCATGTAC TGGCGGAGCT GGGTGACCGA TGCGCCCGTG AAAGCTGTTC GGAATGGCAA CCCGGTGGTT ATGACGCCCG TTAACACCCT TTATTTCGAC GTGTTGCCCG ACAAAAATTC GCTCGCAAAT GTCTATCAAT TCAATCCCGT TCCAACCGGG CTGACACCCG CCGAAGCGAC ATCCATTCTG GGCGCACAGG CCAACACCTG GACGGAATAT ATTCCCTCCG AAAATCGGGT CGATTACATG GTGATGCCCC GCATGACGGC CCTGGCCGAA CGGCTGTGGA CCAATCAGAA TCAGTATGAC ACTTACCGGC AGCGCCTTAC CCGGCACTAC CCGCGTCTGG ATGCGCTGGG GGTTCACTAC CGCGTGCCCG ATCTGAGCGG CTTTGCCGAA GAGAATGTGT TTACGGACCA AACGGCCCTG CGCATCCGCA AACCGACGGA TAATCTGGTT GTTCGCTACA CCATCGATGG GAGCTTGCCG AAGGCGGCTT CGGCGCTACT TCCCGAGTCG TTGCCGATCA GCCAGCCAAC GACGGTAAAG CTGGCGGCTT TTACGAATAG CGGCTTGCAG GGCGATGTGT ATACGCTTCG GTATCAACAG CAATCGCTTG CTGAACCGGT GGGAGTATCG TCCGTCGGGG CCGGATTGAT GAGTACCTAT GTGAAGGGAC AGTTTAAAAA TGTAGCCGCC ATGCTAAAGG CACCGGCTTC CGATTCGGTG GTGGTGAATC AGGTAAAGGT GCCGGAAATG GCCGGAGCGG GTAGTTTCGG CGTTCGGTTT CGGGGTTACA TCAGCGTTCC CGCCACGGGT ATTTACAGCT TTTTCCTGGT AGCCGACGAT GGGGGTGTGC TGCACATTGC CAACCGGACG GTTATCGACA ACGACGGTAA CCACGGCCCC ATTGAAAAAA GCGGTCAGGT GGCCCTCAAA CGGGGTACGC ATCCTTTCGC CCTCGACTTC ATCGAAGCGG GTGGCGGGTA TACCCTAAAG CTGCTGTACA GCCGCGACGG TAGTGATCCA CAACCCGTAC CCGCCGACTG GCTGGGGCAT TGA
|
Protein sequence | MQIKPIIFIV GLSLLMAGIG QSQTPDPYPI IPYPTSLVPG QGQFVITQNT ALVVQDGRFQ SEAGQLQQLL KPVLGKPLPA RGGNAQIVLN YDPSITSPEG YQLTIAPQRI TLAAREPVGM FRAIQTIRQL LPVSLEQKKT TGPLTLPAVQ IRDQPAYAWR GMHLDVSRHF FSMDYLHKFV DLLALYKFNK FHLHLTDDQG WRLEIKAYPK LTSEGAWRTF NNQDSVVLKR ATTNPDFDLP KQYLRQKDGK TQYGGFYTQN QMRELIAYAA ARHIEIIPEI DMPGHLTAAI KAYPFLSCTG QEGWGKTFSV PICPCNEPTY TFTETVLSEV AALFPSQYIH IGADEVEKST WAQSTACQAL MKREGIKSVE ELQSYFVHRT EKFLLSKGKK LMVWDDALEG GLAPSATVMY WRSWVTDAPV KAVRNGNPVV MTPVNTLYFD VLPDKNSLAN VYQFNPVPTG LTPAEATSIL GAQANTWTEY IPSENRVDYM VMPRMTALAE RLWTNQNQYD TYRQRLTRHY PRLDALGVHY RVPDLSGFAE ENVFTDQTAL RIRKPTDNLV VRYTIDGSLP KAASALLPES LPISQPTTVK LAAFTNSGLQ GDVYTLRYQQ QSLAEPVGVS SVGAGLMSTY VKGQFKNVAA MLKAPASDSV VVNQVKVPEM AGAGSFGVRF RGYISVPATG IYSFFLVADD GGVLHIANRT VIDNDGNHGP IEKSGQVALK RGTHPFALDF IEAGGGYTLK LLYSRDGSDP QPVPADWLGH
|
| |