Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shew185_1143 |
Symbol | |
ID | 5369442 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS185 |
Kingdom | Bacteria |
Replicon accession | NC_009665 |
Strand | + |
Start bp | 1360286 |
End bp | 1363084 |
Gene Length | 2799 bp |
Protein Length | 932 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640829342 |
Product | Beta-N-acetylhexosaminidase |
Protein accession | YP_001365358 |
Protein GI | 152999677 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3525] N-acetyl-beta-hexosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000565131 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA CACTCTCAGC AAGCGCCATT TTACTGGCTT TAGGATTAGC GGCCTGTTCC GAGGCGCCTG TGACGTCTTC CGCAGCGTCA GCCGATAAGG CTGTGGCTCA AACACAGCAA GCTCAGGACG CATTAACTCA AGCGCAGTTA CAACAACTCG GGGATTCGTT AGTCGTAACT TACCGTGTTG TGACCAATGT GCCAGACGAT AAGTGCCTAA AAGATAAAGC CGATGGTCGC TGTTTTGTCG CTCAAATTGA TTTCGTGCCC GGTATGGATT TAGCCAGCCA GGATTGGGAA ATTTATTTAA GCCAAATGCG CCCAGTGCAA TCGGTTGAAA GCGGTGAGTT GACGATAACG CACGTTAAAG GCGATTTACA TCGTATTACG CCAACGGCTG CTTTTAAGGG ATTTGTTAAG GGCGAGAAAA AGCAACTGAC ATTCCGTGGT GAGCCGTGGC AATTGGCTGA AACCGACGCT ATGCCAAACT ATTACCTCGC AGCAAACGGA TTAGAACCCG TCATTATCGC CAGCACTCAA GTGGGTAAAG ATGCCGAAAC GGGTTTAGAG ACGCGTCCGT ATGTTGAAGG GTTTACGAAT TTTAAAACCC AATATCAACG CTCTGAAACG GATAAACTCG CGCCAGCCGA TGCCGCGCAA TTATTTAAAG CTAATCAAAG CGTGAGTGAA GATGCCGCCT TAGCTGTGGA TGCCATTATC CCAACGCCAC AAAAGGTGAA GATCCATTCA ACTGATACAC CAGTATCTCT TGCTGGTGGT ATTAAGCTCG AATATCAGCA AGATAACCCA GAGCAAAAGC GTCAAGTTGC GGCAGCGGTG GCACGTTTAG CTCGCCTGGG TGTGGCTGAA TCTGCGAGTG GCATTACGGT GAAATTTGCC AGCCAAAAAG GGGATGAGGG CAGTTATCGC TTAGATATTC AGCCAACTGA AATTGTGATC GCTGCCAATG ATGCGGCGGG CTTCTCTTAC GGTTTGTCAT CATTAGCAAG CCTTGTGGAT GTGAATTCGC TGAAAGTGAA TGCCATGACC ATTGAAGACA GCCCACGTTA TTCGTTCCGT GGTATGCACA TTGATGTGGC ACGTAATTTC CACAGCAAAC AATTGTTGCT GGATTTGCTC GACCAAATGG CGGCTTATAA GCTCAATAAA CTGCATTTAC ACATGGCTGA CGATGAAGGT TGGCGTTTAG AAATCGACGG GCTACCTGAG CTGACTGACA TCGGTAGTAA ACGTTGTCAT GATTTAGATG AAAATACCTG CTTATTACCG CAGTTAGGCA GTGGACCGTT TGGCGATACA AGCGTCAATG GCTATTACAC CAAACAAGAC TATATCGACA TAGTGAAATA TGCCGAGGCG CGTCAAATTC AAGTGGTCCC ATCGATGGAT ATGCCGGGTC ATAGCCGCGC CGCGATTAAA GCGATGGATG CGCGTTTTCG TCGATTACAG GCTGAAGACG TAGAGCTTAA AAACACGGGC AAAGTGCAGG TTGTTCCTGT GCAGGAAACG GCAAATCAAG CTAGCACAGC GGTAAAAGCG TTAGATGCGG GCGCTCGTCA ATTACAAGCT GAAGGGAAAA CCACTGCAGC TGAGCAATAT TTGTTATCCG ATGCAAACGA CAAAACGGTT TATTCTTCAG TGCAGTATTA CGACGACAAC ACGCTTAACG TGTGTATGGA GTCGACTTAT CACTTTGTCG ATAAAGTGAT TGATGAAATT GCCAAGTTAC ACCAAGCCGC GGGTCAACCA TTGACGCGTT ACCACATTGG TGCCGATGAA ACGGCGGGAG CTTGGAAACA ATCACCCCAG TGTTTAGCAT TTGTCGCCAA TAACGATAAA GGTGTTAAGT CGATAGATGA CTTAGGCGCT TATTTTATTG AACGAATTTC GACTATGTTA GCGGCAAAAG GCATTGAAGC TGCGGGGTGG AGCGATGGTA TGAGTCATGT TCGTCCTGAG AATATGCCCG CTAAAGTGCA GTCTAATATT TGGGATGTGA TTGCCCATAA AGGTCATGAG CGCGCCAATC AACAGGCGAA TCTTGGTTGG GAAACCGTGC TCTCTAATCC TGAGGTGCTG TACTTTGACT TCCCCTATGA GGCCGATCCC AAGGAGCATG GCTATTACTG GGCGAGCCGC GCGACCAACA GCCATAAAGT CTTTAGCTTT ATGCCAGATA ACTTGGCCGC CAATGCCGAG CAATGGACCG ATATTCAAAA TCAGCCGTTC GAAGCGGACG ACAGAATAAA ACTTGATGAG GCGGGTAACA AAGTCTCGGG CCCAAGGGAG AAAGGCAAAC CTTTTACTGG CTTGCAAGGG CAGGTTTGGA GTGAGTCGAT TCGCAGTGAC GATACGATCG AATACATGGT GTTCCCGCGT TTATTGATGT TGGCTGAGCG TGCTTGGCAT CAAGCTGCAT GGGAAGTGCC TTATCAATAT CAAGGCGCTG TGTATAACCA AAGCTCGGGT GCTTTTACCT TAACCATGCG TGATGCACAG GCAAAGGATT GGCAACAACT CGCGAATACT TTAGGCCACA AAGAGTTTAT TAAGTTAGAT AAAGCGGGGA TTGATTATCG CGTACCGACG GTCGGCGCCG AGATCCGTGA AGGTAAACTG TTTGCTAATA TCGCCTATCC CGGACTCAAA CTGGAATACC GAAGCCTAAA TGGCCAATGG CAAGCTTATC AAGCGGGGCA GGCGGTCACT GCACCGATTG AGGTCCGTGC AATAGCTGCC GATGGTATTC GTAAAGGCCG AAGTTTAATC GTTAACTAA
|
Protein sequence | MKKTLSASAI LLALGLAACS EAPVTSSAAS ADKAVAQTQQ AQDALTQAQL QQLGDSLVVT YRVVTNVPDD KCLKDKADGR CFVAQIDFVP GMDLASQDWE IYLSQMRPVQ SVESGELTIT HVKGDLHRIT PTAAFKGFVK GEKKQLTFRG EPWQLAETDA MPNYYLAANG LEPVIIASTQ VGKDAETGLE TRPYVEGFTN FKTQYQRSET DKLAPADAAQ LFKANQSVSE DAALAVDAII PTPQKVKIHS TDTPVSLAGG IKLEYQQDNP EQKRQVAAAV ARLARLGVAE SASGITVKFA SQKGDEGSYR LDIQPTEIVI AANDAAGFSY GLSSLASLVD VNSLKVNAMT IEDSPRYSFR GMHIDVARNF HSKQLLLDLL DQMAAYKLNK LHLHMADDEG WRLEIDGLPE LTDIGSKRCH DLDENTCLLP QLGSGPFGDT SVNGYYTKQD YIDIVKYAEA RQIQVVPSMD MPGHSRAAIK AMDARFRRLQ AEDVELKNTG KVQVVPVQET ANQASTAVKA LDAGARQLQA EGKTTAAEQY LLSDANDKTV YSSVQYYDDN TLNVCMESTY HFVDKVIDEI AKLHQAAGQP LTRYHIGADE TAGAWKQSPQ CLAFVANNDK GVKSIDDLGA YFIERISTML AAKGIEAAGW SDGMSHVRPE NMPAKVQSNI WDVIAHKGHE RANQQANLGW ETVLSNPEVL YFDFPYEADP KEHGYYWASR ATNSHKVFSF MPDNLAANAE QWTDIQNQPF EADDRIKLDE AGNKVSGPRE KGKPFTGLQG QVWSESIRSD DTIEYMVFPR LLMLAERAWH QAAWEVPYQY QGAVYNQSSG AFTLTMRDAQ AKDWQQLANT LGHKEFIKLD KAGIDYRVPT VGAEIREGKL FANIAYPGLK LEYRSLNGQW QAYQAGQAVT APIEVRAIAA DGIRKGRSLI VN
|
| |