Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal223_3212 |
Symbol | |
ID | 7085825 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS223 |
Kingdom | Bacteria |
Replicon accession | NC_011663 |
Strand | - |
Start bp | 3808138 |
End bp | 3810945 |
Gene Length | 2808 bp |
Protein Length | 935 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 643462096 |
Product | Beta-N-acetylhexosaminidase |
Protein accession | YP_002359120 |
Protein GI | 217974369 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3525] N-acetyl-beta-hexosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000385036 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAAAA CACTCTCAGC AAGCGCCATT TTACTGGCTT TAGGATTAGC GGCCTGTTCC GAGGCGCCTG TGACGCCTTC CTCTGCGTCA GCCAATAAGG CTGTGGCTCA AACACAGCAA GCTCAGGACG CATTAACTCA AGCGCAGTTA CAACAACTAG GGGATTCGTT AGTCGTAACT TACCGTGTGG TGACCAATGT GCCAGACGAT AAGTGCCTAA AAGATAAAGC CGATGGTCGC TGTTTTGTCG CTCAAATTGA TTTCGTGCCC GGTATGGATT TAGCCAGCCA GGATTGGGAA ATTTATTTAA GCCAAATGCG CCCAGTGCAA TCGGTTGAAA GCGGTGAGTT GACGATAACG CACGTTAAAG GCGATTTACA TCGTATTACG CCAACGGCTG CTTTTAAGGG ATTTGTTAAG GGCGAGAAAA AGCAACTGAC ATTCCGTGGT GAGCCGTGGC AATTGGCTGA AACCGACGCT ATGCCAAACT ATTACCTCGC AGCAAACGGA TTAGAACCCG TCATTATCGC CAGCACTCAG GTAGGGAAAG ATGCCGAAAC GGGTTTAGAG ACGCGTCCGT ATGTTGAAGG GTTTACGAAT TTTAAAACCC AATATCAACG CACTGAAACG GATAAACTCG CGCCAGCCGA TGCCGCGCAA TTATTTAAAG ACAATCAAAG CGTGAGTGAA GATGCCGCCT TGGCTGTGGA TACCATTATC CCAACGCCAC AAAAGGTGAA GATACATTCA ACTGATACAC CAGTATCCCT TGCTGGTGGT ATTAAGCTCG AATATCAGCA AGCTAACCCA GAGCAAAAGC GTCAAGTTGC GGCGGCGGTG GCACGTTTAG CTCGCCTGGG GGTGGCTGAA TCTGCTAGTG GTATTACGGT GAAGTTTGCC AGCCAAAAAG GGGATGAGGG CAGTTATCGC TTAGATATTC AGCCAACTGA AATTGTGATC GCTGCCAATG ATGCGGCGGG CTTCTCTTAC GGTTTGTCAT CATTAGCAAG CCTCGTGGAT GTGAATTCGC TGAAAGCGAA TGCCATGACC ATTGAAGACA GCCCACGTTA TCCGTTCCGT GGTATGCACA TTGATGTGGC ACGTAATTTC CACAGCAAAC AATTGTTGCT GGATTTGCTT GACCAAATGG CGGCTTATAA GCTCAATAAA CTGCATTTAC ACATGGCTGA CGATGAGGGT TGGCGACTCG AAATCGACGG GCTACCTGAG CTGACCGACA TCGGTAGTAA ACGTTGCCAT GATTTAGATG AAAATACCTG CCTATTACCG CAGTTAGGCA GTGGGCCGTT TGGCGATACT AGTGTCAATG GCTATTACAC TAAACAAGAC TATATCGACA TAGTGAAATA TGCCGAGGCG CGTCAAATAC AAGTGATCCC ATCGATGGAC ATGCCGGGTC ATAGCCGCGC CGCGATTAAA GCGATGGATG CGCGTTTTCG TCGATTACAG GCTGAAGACG TAGAGCTTAA AAACACGGGC AAAGTGCAGG TTGTCCCTGT GCAGGAAACG GCAAATCAAG CTAGCACAGC GGTAAAAGCG TTAGATGCGG GCGCTCGTCA ATTGCAAGCT GAAGGGCAAA CCACTGCAGC TGAGCAATAT TTGTTATCCG ATGCAAACGA CAAAACGGTT TATTCTTCAG TGCAATATTA CGACGACAAC ACGCTTAACG TGTGTATGGA GTCGACCTAT CAGTTTGTCG ATAAGGTAAT TGATGAAATT GCCAAGTTAC ACCAAGCCGC GGGTCAACCA TTGACGCGTT ACCACATTGG TGCCGATGAA ACGGCGGGAG CTTGGAAACA ATCCCCCCAG TGTTTAGCAT TTGTTGCCAA TAACGATAAA GGCGTTAAGT CGATAGATGA CCTAGGCGCT TATTTTATCG AGCGAATTTC GACTATGTTA GCGGCCAAAG GTATTGAAGC TGCGGGGTGG AGCGATGGCA TGAGCCATGT TCGCCCTGAG AATATGCCCG CTAAAGTGCA GTCTAATATT TGGGATGTGA TTGCTCATAA AGGCTATGAG CGCGCCAATC AGCAGGCGAA TCTTGGTTGG GAAACCGTGC TCTCTAATCC TGAGGTGTTG TACTTTGATT TCCCCTATGA GGCCGATCCT AAGGAGCATG GCTATTACTG GGCGAGCCGC GCGACCAACA GCCATAAAGT CTTTAGCTTT ATGCCAGATA ACTTGGCCGC CAATGCCGAG CAATGGACCG ATATTCAAAA TCAGCCGTTC GAAGCGGACG ACAGAATAAA ACTTGATGAG GCGGGTAACA AAGTCTCGGG CCCAAGGGAG AAAGGCAAAC CTTTTACTGG CTTGCAAGGG CAGATTTGGA GTGAGTCGAT TCGCAGTGAC GATACGATCG AATACATGGT GTTCCCGCGT TTATTGATGT TGGCTGAGCG TGCTTGGCAT CAAGCTGCAT GGGAAGTGCC TTATCAATAT CAAGGCTCTG TGTATAACCA AAGCTCGGGC GCTTTTACCT TAGCCATGCG TAATGCACAG GCAAAGGATT GGCAACAACT TGCGAATACC TTAGGTCATA AAGAGTTTAT TAAGCTAGAT AAAGCGGGGA TTGATTATCG CGTACCGACG GTCGGCGCCG AGATCCGTGA GGGTAAACTG TTTGCTAATA TCGCCTATCC CGGACTCAAA CTGGAATACC GAAGCCTAAA TGGCCAATGG CAAGCTTATC AAGCGGGGCA GGCGGGGCAG GCGGTCACTG CACCGATTGA GGTCCGTGCC ATAGCTGCCG ATGGTATTCG TAAAGGCCGA AGTTTAATCG TTAACTAA
|
Protein sequence | MNKTLSASAI LLALGLAACS EAPVTPSSAS ANKAVAQTQQ AQDALTQAQL QQLGDSLVVT YRVVTNVPDD KCLKDKADGR CFVAQIDFVP GMDLASQDWE IYLSQMRPVQ SVESGELTIT HVKGDLHRIT PTAAFKGFVK GEKKQLTFRG EPWQLAETDA MPNYYLAANG LEPVIIASTQ VGKDAETGLE TRPYVEGFTN FKTQYQRTET DKLAPADAAQ LFKDNQSVSE DAALAVDTII PTPQKVKIHS TDTPVSLAGG IKLEYQQANP EQKRQVAAAV ARLARLGVAE SASGITVKFA SQKGDEGSYR LDIQPTEIVI AANDAAGFSY GLSSLASLVD VNSLKANAMT IEDSPRYPFR GMHIDVARNF HSKQLLLDLL DQMAAYKLNK LHLHMADDEG WRLEIDGLPE LTDIGSKRCH DLDENTCLLP QLGSGPFGDT SVNGYYTKQD YIDIVKYAEA RQIQVIPSMD MPGHSRAAIK AMDARFRRLQ AEDVELKNTG KVQVVPVQET ANQASTAVKA LDAGARQLQA EGQTTAAEQY LLSDANDKTV YSSVQYYDDN TLNVCMESTY QFVDKVIDEI AKLHQAAGQP LTRYHIGADE TAGAWKQSPQ CLAFVANNDK GVKSIDDLGA YFIERISTML AAKGIEAAGW SDGMSHVRPE NMPAKVQSNI WDVIAHKGYE RANQQANLGW ETVLSNPEVL YFDFPYEADP KEHGYYWASR ATNSHKVFSF MPDNLAANAE QWTDIQNQPF EADDRIKLDE AGNKVSGPRE KGKPFTGLQG QIWSESIRSD DTIEYMVFPR LLMLAERAWH QAAWEVPYQY QGSVYNQSSG AFTLAMRNAQ AKDWQQLANT LGHKEFIKLD KAGIDYRVPT VGAEIREGKL FANIAYPGLK LEYRSLNGQW QAYQAGQAGQ AVTAPIEVRA IAADGIRKGR SLIVN
|
| |