Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_5392 |
Symbol | |
ID | 8729158 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 6552060 |
End bp | 6554921 |
Gene Length | 2862 bp |
Protein Length | 953 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | |
Product | Beta-N-acetylhexosaminidase |
Protein accession | YP_003390158 |
Protein GI | 284040228 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGACA GTGGCCAGCA GTGGGTAGAC AGCGTCTTCC ATACGCTCAC CCCCGAGCAG AAAATCGGTC AGTTCTTCAT GGTGGCGACC TTCTCAAATC GTCACGATAA TCATTATCAA TACATTGAAC ATCTCATTCA GACAAATCAT ATTGGTGGCC TGATCTTCTT CCAGGGCGGC CCTTACCGGC AGGCTATTCT GACAAACCGC TACCAGGCAG CGTCGAAGGT TCCTTTACTC ATTGGTATTG ACGGCGAATG GGGACTGGGA ATGCGGCTGG ACAGCACCAT GAATTTCCCC AAGCAAATGT CGCTGGGAGC TATTCGGAAC AACGAACTGA TCTACCGGAT GGGTGCCGAA ATTGGACGGC AGTGCCAGCG GCTGGGTATT CATATCAATT TTGCACCGGT ATCCGACATT AACAGCAACC CGGCCAATCC GGTCATTGGT GTTCGGTCGT TTGGGGAGTC TAAAGAGAAT GTTGCCCTGA AAGCGTCAGC TTACATGCGG GGATTGCAGC AGACCCGCGT CATTGCCACC GCCAAGCATT TTCCGGGCCA CGGCGATACC AACGCCGATT CGCACCACAC GCTGCCCACC GTTAGCCGGT CGTCGGAGCA AATGCGGGAC ATTGACCTGT ATCCCTTTCG CAAGCTTATT GCCGACAGTT TGATGGGCGT TGTTACCGGT CACCTGCACG TACCGGTGAT GGACAATACG CCCGCGCTGG CCGCTACGTT GTCGGAAAAG ATTGTTACTG AACTGCTCAA GAAAGAGCTG GGCTTCCGGG GGCTGGTCTT TACCGATGCT ATGAACATGG GCGGCATCAG CCGGTCGCCG AAGGCGATGG ACGTTAATCT CCGCGCCCTG ATTGCCGGCA ACGATATTTT GCTTTACCCG GAAAACGTTC GGGAGGCAAC CCTGAACATA CTGAATGCCG TTCAGCAGGG CGTTATCACA CAGGAATTTA TCGACGAAAA AGTCAAGAAG ATCCTTCGCG CTAAATACTG GGCTGGTTTA CATCACTATA AACCCATTAG TCTGGCGGGC CTGTCAGCAG AACTTAACTC ACCCGAAGCG CAGCTCCTGA AACAGGAACT TTGCGAACAA TCGGTAACCG TTATCGACAA CAAGAAAGAC CTGTTGCCTC TAAAGCAGCT GGATACGCTC AAGCTGGCAT CGGTAGCGAT TGGGGCCGAG CCGGGGAATG TATTTCAAAA AACGCTGACT CAATATGCGC CTTTCCAAAC ACTGGCCTAC CCGGAGAAGC CCGTCTCGGA AGCCGATCTT TCCCATATTG TAGCGCAACT CACCGAAGCC AACACGGTTG TGGTTAGCTT TCACCGGATG AGTGAGTCGG CCCTTCGGAA ATACGGTATT ACGAAACCCT CCCTCGACTT AATTACCCGG CTGAAGCAGC AAGGCAAAAA AGTTATTGTA ACGGCCTTTG GTTCGCCTTA TAGCCTGCCT CAGTTCGCTG CAGCCGACGC TCTGATTTGC GCTTATCAGG AGCTTGACGA TATGCAGCGG GTGGTTCCCC AGGTTTTGTT TGGCGGACTG GGGGCAAAAG GCATGCTGCC CATCTCGACG GGTGATCTGA AGGTTGGTCT TGGGCATACG CTCAATCCGG AAGGGCGGTT ATCGACAGGT TCACCGGAGA GTGTTGGCAT GAAAACGACA GTGCTGAACC AGATCGACGC CATTGCTCAG GGAGCCGTAA AGAATCACGT GGTACCCGGT TGCGAGATTC TGGTTGCCCG GAAAGGTAAA ATCGTTTATA GCAAAAACTT TGGTGCGTTA ACCTACGCGG CTGGTGCCGA AAAAGTGACC GATGAGACAT TGTATGACCT GGCCTCGCTG ACAAAAGTGC TGGCTACGCT ACAGTCGGTC ATGATTTTGT ACGACCGCAA ACAAATCGAT CTGACGCAGA AAGCATCGCT GTATCTGCCG GAACTGCGCA ATACGAACAA GCAAAACATA ACCCTTCAGG ACTTACTCTG GCACCAGTCG GGCATGGTTT CTTTTTACCC AACTACCTGG GATCGGACAC GGCTACCCGG CGGGGGGCTG AAAGCCGAGT ACTACGGAGC TGTTCGGGAT AGCCTGCATA CACTTCAGAT TGCGCCAACC CTCTGGGGGG TTCCGGCGCT GAAAGATTCT GTATGGAAGT GGGTTGTTCA ATCGCCGATG TCGCGCAAAA CGGACGAGTC GGGAAAACCG GCGTTCGTTT ACAGCGACCT GAATTTTTTG ACGCTGCAGA AAATCGTTGA GCGTGTCAGC AAGCAACCAC TGGACAAGTT CGTTACCGAA AATGTGTATA AGCCGTTAGG GCTTCACCAG CTTGGGTTCA CGCCCCTGCA ACGGTTGCCA AACCCGCAAT GTGCCCCCAC CGAACAGGAT ACGTACTACA GAAACCAGCT TCTGGTAGGT ACGGTGCACG ATCAGATGGC AGCCGTACAG GGCGGGATTT CCGGCCATGC CGGGCTGTTT GGCAATGCCC GTGACATTGC CACGCTGTTA CAGATGAACT TACAAAAAGG CGTGTATGGC GACGAGCGGA TTTTTCAGCC GATGACGGTG CCTTATTTTA CACAAACCCT AAGCAATCGT AGCCACCGTG CGCTGGGCTG GGACAAGCCC AATCCTGAAA GTGCCAGTAG CGTCTATATG GCGCAGCAGG CTTCGGCCCG CTCATTCGGC CATACGGGTT TTACCGGTAA TGTGGTTTGG GTCGATCCTG ATCAGGACTT GATATTTGTT TTTCTTTCGA ATCGTATCTA CCCAACAGCC GGAAACAATT CTATCAATAC AACAAAGCTC CGTCGACGCA TCCACGAAGT TATTTATAGT GCCATTCAGT AA
|
Protein sequence | MSDSGQQWVD SVFHTLTPEQ KIGQFFMVAT FSNRHDNHYQ YIEHLIQTNH IGGLIFFQGG PYRQAILTNR YQAASKVPLL IGIDGEWGLG MRLDSTMNFP KQMSLGAIRN NELIYRMGAE IGRQCQRLGI HINFAPVSDI NSNPANPVIG VRSFGESKEN VALKASAYMR GLQQTRVIAT AKHFPGHGDT NADSHHTLPT VSRSSEQMRD IDLYPFRKLI ADSLMGVVTG HLHVPVMDNT PALAATLSEK IVTELLKKEL GFRGLVFTDA MNMGGISRSP KAMDVNLRAL IAGNDILLYP ENVREATLNI LNAVQQGVIT QEFIDEKVKK ILRAKYWAGL HHYKPISLAG LSAELNSPEA QLLKQELCEQ SVTVIDNKKD LLPLKQLDTL KLASVAIGAE PGNVFQKTLT QYAPFQTLAY PEKPVSEADL SHIVAQLTEA NTVVVSFHRM SESALRKYGI TKPSLDLITR LKQQGKKVIV TAFGSPYSLP QFAAADALIC AYQELDDMQR VVPQVLFGGL GAKGMLPIST GDLKVGLGHT LNPEGRLSTG SPESVGMKTT VLNQIDAIAQ GAVKNHVVPG CEILVARKGK IVYSKNFGAL TYAAGAEKVT DETLYDLASL TKVLATLQSV MILYDRKQID LTQKASLYLP ELRNTNKQNI TLQDLLWHQS GMVSFYPTTW DRTRLPGGGL KAEYYGAVRD SLHTLQIAPT LWGVPALKDS VWKWVVQSPM SRKTDESGKP AFVYSDLNFL TLQKIVERVS KQPLDKFVTE NVYKPLGLHQ LGFTPLQRLP NPQCAPTEQD TYYRNQLLVG TVHDQMAAVQ GGISGHAGLF GNARDIATLL QMNLQKGVYG DERIFQPMTV PYFTQTLSNR SHRALGWDKP NPESASSVYM AQQASARSFG HTGFTGNVVW VDPDQDLIFV FLSNRIYPTA GNNSINTTKL RRRIHEVIYS AIQ
|
| |