Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal195_1177 |
Symbol | |
ID | 5752904 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS195 |
Kingdom | Bacteria |
Replicon accession | NC_009997 |
Strand | + |
Start bp | 1394521 |
End bp | 1397319 |
Gene Length | 2799 bp |
Protein Length | 932 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641287447 |
Product | Beta-N-acetylhexosaminidase |
Protein accession | YP_001553613 |
Protein GI | 160874297 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3525] N-acetyl-beta-hexosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0322091 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAAAA CACTCTCAGC AAGCGCCATT TTACTGGCTT TAGGATTAGC GGCCTGTTCC GAGGCGCCAG TGACGCCTTC CTCTGCGTCA GCCAATAAGG CTGTGACTCA AACACAGCAA GCTCAGGACG CATTAACTCA AGCGCAGTTA CAACAACTCG GGGATTCGTT AGTCGTAACT TACCGTGTTG TGACCAATGT GCCAGACGAT AAGTGCCTAA AAGATAAAGC CGATGGTCGC TGTTTTGTCG CTCAAATTGA TTTCGTGCCC GGTATGGATT TAGCCAGCCA GGATTGGGAA ATTTATTTAA GCCAAATGCG CCCAGTGCAA TCGGTTGAAA GCGGTGAGTT GACGATAACG CACGTTAAAG GCGATTTACA TCGTATTACG CCAACGGCCG CTTTTAAGGG ATTTGTTAAG GGCGAGAAAA AGCAACTGAC ATTCCGTGGT GAGCCGTGGC AATTGGCTGA AACCGACGCT ATGCCAAACT ATTACCTCGC AGCAAACGGA TTAGAACCCG TCATTATCGC CAGCACTCAA GTGGGTAAAG ATGCCGAAAC GGGTTTAGAG ACGCGTCCGT ATGTTGAAGG GTTTACGAAT TTTAAAACCC AATATCAACG CACTGAAACG GATAAACTCG CGCCAGCCGA TGCCGCGCAA TTATTTAAAG CCAATCAAAG CGTGAGTGAA GATGCCGCCT TGGCTGTGGA TACCATTATC CCAACGCCAC AAAAGGTGAA GATCCATTCA ACTGATACAC CAGTATCCCT TGCTGGCGGT ATTAAGGTCG AATATCAGCA AGATAACCCA GAGCAAAAGA GTCAAGTTGC GGCAGCGGTG GCACGTTTAG CTCGCCTGGG TGTGGCTGAA TCTGCGAGTG GCATTACGGT GAAATTTGCC AGCCAAAAAG GGGATGAGGG CAGTTATCGC TTAGATATTC AGCCAACTGA AATAGTGATC GCTGCCAATG ATGCGGCGGG CTTCTCTTAC GGTTTGTCAT CATTAGCAAG CCTTGTGGAT GTGAATTCGC TGAAAGTGAA TGCCATGACC ATTGAAGACA GCCCACGTTA TCCGTTCCGT GGTATGCACA TTGATGTGGC ACGTAATTTC CACAGCAAAC AATTGTTGCT GGATTTGCTC GACCAAATGG CGGCTTATAA GCTCAATAAA CTGCATTTAC ACATGGCTGA CGATGAAGGT TGGCGTTTAG AAATCGATGG TCTGCCTGAA TTAACCGACA TTGGCAGTAA ACGTTGCCAT GATTTAGATG AAAATTCCTG CTTATTACCG CAGTTAGGCA GTGGGCCGTT TGGCGATACT AGTGTTAATG GCTATTACAC CAAACAAGAC TATATCGACA TAGTGAAATA TGCCGAGGCG CGTCAAATTC AAGTGATCCC ATCGATGGAT ATGCCGGGTC ATAGCCGCGC CGCGATTAAA GCGATGGATG CGCGTTTTCG TCGATTACAG TCTGAAGACG TAGAGCTTCA AAACACGGGC AAAGTGCAGG TTGTCCCTGT GCAGGAAACG GTAAATCAAG CTAGCACAGC GGTAAAAGCG TTAGATGCGG GCGCTCGTCA ATTGCAAGCT GAAGGGAAAA CCACTGCAGC TGAGCAATAT TTGTTATCCG ATGCAAACGA CAAAACGGTT TATTCTTCAG TGCAATATTA CGACGACAAT ACGCTTAACG TGTGTATGGA GTCGACCTAT CAGTTTGTCG ATAAGGTAAT TGATGAAATT GCCAAGTTAC ACCAAGCCGC GGGTCAACCA TTGACGCGTT ACCACATTGG TGCCGATGAA ACGGCGGGAG CTTGGAAACA ATCACCCCAG TGTTTAGCAT TTGTCGCCAA TAACGATAAA GGCGTTAAGT CGATAGATGA CTTAGGCGCT TATTTTATCG AGCGAATTTC GACTATGTTA GCGGCCAAAG GTATTGAAGC TGCGGGGTGG AGCGATGGCA TGAGCCATGT TCGCCCTGAG AATATGCCCG CTAAAGTGCA GTCTAATATT TGGGATGTGA TTGCCCATAA AGGCTATGAG CGCGCCAATC AGCAGGCGAA TCTTGGTTGG GAAACCGTGC TCTCTAATCC TGAGGTGTTG TACTTTGATT TCCCCTATGA GGCCGATCCT AAGGAGCATG GCTATTACTG GGCGAGTCGC GCGACCAACA GCCATAAAGT CTTTAGCTTT ATGCCAGATA ACTTGGCCGC CAATGCCGAG CAATGGACCG ATATTCAAAA TCAGCCGTTC GAAGCGGACG ACAGAATAAA ACTTGATGAG GCGGGTAAAA AAATCTCGGG TCCAAGGGAG AAAGGCAAAC CTTTTACTGG CTTGCAAGGG CAGGTTTGGA GTGAGTCGAT TCGCAGTGAC GATACAATTG AATACATGGT GTTCCCGCGT TTGTTGATGT TGGCTGAGCG TGCTTGGCAT CAAGCCGCAT GGGAAGTGCC TTATCAATAT CAAGGCGCTA TATATAATCA AAGCTCGCGC GCTTTTACTT TAGCCATGCG TGATGCACAG GCAAAGGATT GGCAACAACT CGCGAATACT TTAGGCCACA AAGAGTTTAT TAAGTTAGAT AAAGCGGGGA TTGATTATCG CGTACCGACG GTCGGCGCCG AGATCCGTGA AGGTAAACTG TTTGCTAATA TCGCCTATCC CGGACTCAAA CTGGAATACC GAAGCCTAAA TGGCCAATGG CAAGCTTATC AAGCGGGGCA GGCGGTCACT GCACCGATTG AGGTCCGTGC CATAGCTGCC GATGGTATTC GTAAAGGCCG AAGTTTAATC GTTAACTAA
|
Protein sequence | MNKTLSASAI LLALGLAACS EAPVTPSSAS ANKAVTQTQQ AQDALTQAQL QQLGDSLVVT YRVVTNVPDD KCLKDKADGR CFVAQIDFVP GMDLASQDWE IYLSQMRPVQ SVESGELTIT HVKGDLHRIT PTAAFKGFVK GEKKQLTFRG EPWQLAETDA MPNYYLAANG LEPVIIASTQ VGKDAETGLE TRPYVEGFTN FKTQYQRTET DKLAPADAAQ LFKANQSVSE DAALAVDTII PTPQKVKIHS TDTPVSLAGG IKVEYQQDNP EQKSQVAAAV ARLARLGVAE SASGITVKFA SQKGDEGSYR LDIQPTEIVI AANDAAGFSY GLSSLASLVD VNSLKVNAMT IEDSPRYPFR GMHIDVARNF HSKQLLLDLL DQMAAYKLNK LHLHMADDEG WRLEIDGLPE LTDIGSKRCH DLDENSCLLP QLGSGPFGDT SVNGYYTKQD YIDIVKYAEA RQIQVIPSMD MPGHSRAAIK AMDARFRRLQ SEDVELQNTG KVQVVPVQET VNQASTAVKA LDAGARQLQA EGKTTAAEQY LLSDANDKTV YSSVQYYDDN TLNVCMESTY QFVDKVIDEI AKLHQAAGQP LTRYHIGADE TAGAWKQSPQ CLAFVANNDK GVKSIDDLGA YFIERISTML AAKGIEAAGW SDGMSHVRPE NMPAKVQSNI WDVIAHKGYE RANQQANLGW ETVLSNPEVL YFDFPYEADP KEHGYYWASR ATNSHKVFSF MPDNLAANAE QWTDIQNQPF EADDRIKLDE AGKKISGPRE KGKPFTGLQG QVWSESIRSD DTIEYMVFPR LLMLAERAWH QAAWEVPYQY QGAIYNQSSR AFTLAMRDAQ AKDWQQLANT LGHKEFIKLD KAGIDYRVPT VGAEIREGKL FANIAYPGLK LEYRSLNGQW QAYQAGQAVT APIEVRAIAA DGIRKGRSLI VN
|
| |