Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal223_2949 |
Symbol | |
ID | 7089017 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS223 |
Kingdom | Bacteria |
Replicon accession | NC_011663 |
Strand | + |
Start bp | 3475747 |
End bp | 3478488 |
Gene Length | 2742 bp |
Protein Length | 913 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 643461834 |
Product | Beta-N-acetylhexosaminidase |
Protein accession | YP_002358858 |
Protein GI | 217974107 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3525] N-acetyl-beta-hexosaminidase |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0525073 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAAGGAC TAATAAAACA AAGAGGGATG ATTAAGTGGC AGGGGATTAA GACGGCCTTA ATCAGCGGCG TCATGGCCGT GCTTCCCGCG ATGCCAGTGC TTTCAATACT TGCGATGCCA TCAGTACAGG CAGCCAACGC CGTGCAGGAG GCAAATCATG TCGCCCTGAA TCAATTGGCC GATAAACTCC AATTAAATTA TCAGTTAACC GCCAGTTACC TTGAAACCTG CCCCGCAAAA GAAACCCAAT GCTATCGCTC TGCCATTGAG CTAACCCTGC CCACTCAGCT ATCTAGCGAC AAATTATCCA GTGAAGAGTT AAATCACGGC GACTGGCAAA TCTACTTCAG CCAATTATCG CCCGTTTATT TTGTCGATGC GGGTGACTTT AACATTGAGC ATATCAATGG CGATTTACAC AAAATCACTC CTAAAGCCAG CTTTAAGGGC TTCGTCGCCG ATCAGGTTTA TCGTATTGAG TTTTATAGCC AAGGTAGCCA AATCACCCGT TCCGAGTTTA TACCAAACTT CTTTGTCGCT ATGGGTGATA AAGCGCAGCT TATAAAGAGC ACTCAAACCC AGCGCAATAA AGAAACGGGG TTGCCCGAGC AAGACTACCT CGCACCTTTT AATACCGAAA AACAATTCTT ACTGGCAAAA AACGATGCAA CCCCCTTTGC CGATGGCCAG TATTTAGCCG AGGAATATCA AGCCTTAGCC GCAGACAAAG CACCTGACAT TAGCGGCAAA CTTATCCCCA CGCCACTTCA CAGCCAAAGC TTAGACTCGG CGCCTTTATC CCTTGCCAGA GGTTATCGCC TCACGGCTGC GACTGTAGAT TTTAAGGGAC GACAGGCGGC GTTGGATTAC CTCGATTCTT TAGGCTTTTC GGCCAAGAAC AAAGGCACGC CGCTGTTATT ACAGCTCGAT GCCCAAGCTC CTAAAGTCGC CAGCACTGAA GCCCATGATA GTGAAATTAA AAGTAACGAA GCCTATCGGT TAACCATAAC GGCTGAGCAA ATCAGTATTC GAGCGGGCAA TGAAGCGGGA CTCTTCTACG GTCTACAAAG CCTCGCAGGG TTGATCAGTT TAAGTGACGA TCAACTGGTC GCCATCGAGA TCCAAGACCA GCCTCGCTAC GCCTTTCGCG GCCTACACAT CGATTTAGCC CGTAACTTCC ACTCGCTAGA TTTTATTAAA CGCATCATTC CACAGCTCGC CGCCTATAAA ATCAATAAGC TGCATTTGCA CTTAGCCGAC GATGAAGGCT GGCGTTTAGC CATACCCGGC TTACCTGAAC TAACTGATGT GGGCGCAAAA CGCTGTTTCG ATTTAACCGA GCAAACCTGT TTATTGCCAC AACTAGGCAG CGGAATTGCT GATGCTAATC CCGTCGATGG CTACTTAACC GTGGCGCAAT ACCAAGAAAT ATTACAACTG GCTGATGCTC ACCATATCGA AGTGATCCCT TCATTGGACA TGCCAGGGCA TTCCCGCGCC GCGATAAAAT CCATGGAGGC ACGTTATCAC CATTACCTCG CCAAGGGAGA CAAAGCTAAG GCGGAGGAGT TTTTACTGAC TGAATTTGCA GATAAAACCC AATATTCGTC GCTTCAGTAT TATCACGACA ACACACTCAA TGTGTGCTTG GCGAGCACCT ATCACTTTAT CGATACAGTG ATTGATGAAG TCAAAAAAAT GCACCAAGCC GTAGGCATTC CCTTAGTGCA TTACCACATA GGCGCCGATG AAACCGCGGG CGCTTGGGTC GACTCTCCCG CGTGTATCGC AATGAAAAAA CAAAAAGCCG ATGAGTTAGC TGGACTCCAC TCACTCAATG GCTACTTTAT CGAACGCGTT GCTAACATGC TGGCAGATAA AGGGATTATT GCCGCTGGCT GGAATGATGG CATGGGTGAA GTGCGCCCCG AAAACATGCC CGCCCAAGTG CAATCTAACG CTTGGTCACT GATTTCAGAC AACGGCCATC AAATCGCCCA TAAGCAAGTC AATCTTGGCT GGAAGGTCGT GCTCTCAACG CCGGAAGTCA CCTATTTTGA TTTTCCCTAT GTTAGCCATC CAGATGAGCG CGGTAATCAT TGGGCTGCAA GGGCAATAGA CAGCTTTAAG GTGTTTAGCT TTATGCCCGA CAACCTGCCC GCCCATGCCG AGCGCTGGCG CAATAGTTTA AATCAGCCAT TTATCGCGGA CGACAGCCAC AGCCAGCTCA AGCCGGGACA CGGTTTTTAC GGCTTGCAGG GGCATCTTTG GAGCGAAATG GTGCAATCGG ATGAGCAAGC AGAATACATG CTATTCCCGC GCATGATAGC CCTTGCAGAA CGGGCATGGC ACAAGGCATC GTGGGAATTA GCCTATGATT ATCAAGGTAA AATCTATAGC CAACAGACTC AACATTTTGC CAGCCAACTC AATGGCCAAG CCGAGCAAGT CCTCAAACAG GATTGGCAAA CCTTCGCCGC GGTTTACGCC AATAAAGTGC AGCCTAAGTT AGCAAAAGCG GGAGTTTTCT ACCGCATAGC ACCACCTGGG ATTCTTGTTG AAAATCAGCT GTTAGTGCTC AATAGCTTAT ATCCCAATGC CGAGCTGGAA TACCAACTCG ACTCAGGACC TTGGCTAAGC TACCTCCAAG CTTTCAAGCT TAATGATGTC AAACACATCC GCGCCAGAGT CAAAGATGGT ACGCGTTACT CTCGCCCATC GACTTGGCAA AACACCCTCT AG
|
Protein sequence | MKGLIKQRGM IKWQGIKTAL ISGVMAVLPA MPVLSILAMP SVQAANAVQE ANHVALNQLA DKLQLNYQLT ASYLETCPAK ETQCYRSAIE LTLPTQLSSD KLSSEELNHG DWQIYFSQLS PVYFVDAGDF NIEHINGDLH KITPKASFKG FVADQVYRIE FYSQGSQITR SEFIPNFFVA MGDKAQLIKS TQTQRNKETG LPEQDYLAPF NTEKQFLLAK NDATPFADGQ YLAEEYQALA ADKAPDISGK LIPTPLHSQS LDSAPLSLAR GYRLTAATVD FKGRQAALDY LDSLGFSAKN KGTPLLLQLD AQAPKVASTE AHDSEIKSNE AYRLTITAEQ ISIRAGNEAG LFYGLQSLAG LISLSDDQLV AIEIQDQPRY AFRGLHIDLA RNFHSLDFIK RIIPQLAAYK INKLHLHLAD DEGWRLAIPG LPELTDVGAK RCFDLTEQTC LLPQLGSGIA DANPVDGYLT VAQYQEILQL ADAHHIEVIP SLDMPGHSRA AIKSMEARYH HYLAKGDKAK AEEFLLTEFA DKTQYSSLQY YHDNTLNVCL ASTYHFIDTV IDEVKKMHQA VGIPLVHYHI GADETAGAWV DSPACIAMKK QKADELAGLH SLNGYFIERV ANMLADKGII AAGWNDGMGE VRPENMPAQV QSNAWSLISD NGHQIAHKQV NLGWKVVLST PEVTYFDFPY VSHPDERGNH WAARAIDSFK VFSFMPDNLP AHAERWRNSL NQPFIADDSH SQLKPGHGFY GLQGHLWSEM VQSDEQAEYM LFPRMIALAE RAWHKASWEL AYDYQGKIYS QQTQHFASQL NGQAEQVLKQ DWQTFAAVYA NKVQPKLAKA GVFYRIAPPG ILVENQLLVL NSLYPNAELE YQLDSGPWLS YLQAFKLNDV KHIRARVKDG TRYSRPSTWQ NTL
|
| |