Gene Sbal223_2949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_2949 
Symbol 
ID7089017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp3475747 
End bp3478488 
Gene Length2742 bp 
Protein Length913 aa 
Translation table11 
GC content49% 
IMG OID643461834 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_002358858 
Protein GI217974107 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0525073 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAGGAC TAATAAAACA AAGAGGGATG ATTAAGTGGC AGGGGATTAA GACGGCCTTA 
ATCAGCGGCG TCATGGCCGT GCTTCCCGCG ATGCCAGTGC TTTCAATACT TGCGATGCCA
TCAGTACAGG CAGCCAACGC CGTGCAGGAG GCAAATCATG TCGCCCTGAA TCAATTGGCC
GATAAACTCC AATTAAATTA TCAGTTAACC GCCAGTTACC TTGAAACCTG CCCCGCAAAA
GAAACCCAAT GCTATCGCTC TGCCATTGAG CTAACCCTGC CCACTCAGCT ATCTAGCGAC
AAATTATCCA GTGAAGAGTT AAATCACGGC GACTGGCAAA TCTACTTCAG CCAATTATCG
CCCGTTTATT TTGTCGATGC GGGTGACTTT AACATTGAGC ATATCAATGG CGATTTACAC
AAAATCACTC CTAAAGCCAG CTTTAAGGGC TTCGTCGCCG ATCAGGTTTA TCGTATTGAG
TTTTATAGCC AAGGTAGCCA AATCACCCGT TCCGAGTTTA TACCAAACTT CTTTGTCGCT
ATGGGTGATA AAGCGCAGCT TATAAAGAGC ACTCAAACCC AGCGCAATAA AGAAACGGGG
TTGCCCGAGC AAGACTACCT CGCACCTTTT AATACCGAAA AACAATTCTT ACTGGCAAAA
AACGATGCAA CCCCCTTTGC CGATGGCCAG TATTTAGCCG AGGAATATCA AGCCTTAGCC
GCAGACAAAG CACCTGACAT TAGCGGCAAA CTTATCCCCA CGCCACTTCA CAGCCAAAGC
TTAGACTCGG CGCCTTTATC CCTTGCCAGA GGTTATCGCC TCACGGCTGC GACTGTAGAT
TTTAAGGGAC GACAGGCGGC GTTGGATTAC CTCGATTCTT TAGGCTTTTC GGCCAAGAAC
AAAGGCACGC CGCTGTTATT ACAGCTCGAT GCCCAAGCTC CTAAAGTCGC CAGCACTGAA
GCCCATGATA GTGAAATTAA AAGTAACGAA GCCTATCGGT TAACCATAAC GGCTGAGCAA
ATCAGTATTC GAGCGGGCAA TGAAGCGGGA CTCTTCTACG GTCTACAAAG CCTCGCAGGG
TTGATCAGTT TAAGTGACGA TCAACTGGTC GCCATCGAGA TCCAAGACCA GCCTCGCTAC
GCCTTTCGCG GCCTACACAT CGATTTAGCC CGTAACTTCC ACTCGCTAGA TTTTATTAAA
CGCATCATTC CACAGCTCGC CGCCTATAAA ATCAATAAGC TGCATTTGCA CTTAGCCGAC
GATGAAGGCT GGCGTTTAGC CATACCCGGC TTACCTGAAC TAACTGATGT GGGCGCAAAA
CGCTGTTTCG ATTTAACCGA GCAAACCTGT TTATTGCCAC AACTAGGCAG CGGAATTGCT
GATGCTAATC CCGTCGATGG CTACTTAACC GTGGCGCAAT ACCAAGAAAT ATTACAACTG
GCTGATGCTC ACCATATCGA AGTGATCCCT TCATTGGACA TGCCAGGGCA TTCCCGCGCC
GCGATAAAAT CCATGGAGGC ACGTTATCAC CATTACCTCG CCAAGGGAGA CAAAGCTAAG
GCGGAGGAGT TTTTACTGAC TGAATTTGCA GATAAAACCC AATATTCGTC GCTTCAGTAT
TATCACGACA ACACACTCAA TGTGTGCTTG GCGAGCACCT ATCACTTTAT CGATACAGTG
ATTGATGAAG TCAAAAAAAT GCACCAAGCC GTAGGCATTC CCTTAGTGCA TTACCACATA
GGCGCCGATG AAACCGCGGG CGCTTGGGTC GACTCTCCCG CGTGTATCGC AATGAAAAAA
CAAAAAGCCG ATGAGTTAGC TGGACTCCAC TCACTCAATG GCTACTTTAT CGAACGCGTT
GCTAACATGC TGGCAGATAA AGGGATTATT GCCGCTGGCT GGAATGATGG CATGGGTGAA
GTGCGCCCCG AAAACATGCC CGCCCAAGTG CAATCTAACG CTTGGTCACT GATTTCAGAC
AACGGCCATC AAATCGCCCA TAAGCAAGTC AATCTTGGCT GGAAGGTCGT GCTCTCAACG
CCGGAAGTCA CCTATTTTGA TTTTCCCTAT GTTAGCCATC CAGATGAGCG CGGTAATCAT
TGGGCTGCAA GGGCAATAGA CAGCTTTAAG GTGTTTAGCT TTATGCCCGA CAACCTGCCC
GCCCATGCCG AGCGCTGGCG CAATAGTTTA AATCAGCCAT TTATCGCGGA CGACAGCCAC
AGCCAGCTCA AGCCGGGACA CGGTTTTTAC GGCTTGCAGG GGCATCTTTG GAGCGAAATG
GTGCAATCGG ATGAGCAAGC AGAATACATG CTATTCCCGC GCATGATAGC CCTTGCAGAA
CGGGCATGGC ACAAGGCATC GTGGGAATTA GCCTATGATT ATCAAGGTAA AATCTATAGC
CAACAGACTC AACATTTTGC CAGCCAACTC AATGGCCAAG CCGAGCAAGT CCTCAAACAG
GATTGGCAAA CCTTCGCCGC GGTTTACGCC AATAAAGTGC AGCCTAAGTT AGCAAAAGCG
GGAGTTTTCT ACCGCATAGC ACCACCTGGG ATTCTTGTTG AAAATCAGCT GTTAGTGCTC
AATAGCTTAT ATCCCAATGC CGAGCTGGAA TACCAACTCG ACTCAGGACC TTGGCTAAGC
TACCTCCAAG CTTTCAAGCT TAATGATGTC AAACACATCC GCGCCAGAGT CAAAGATGGT
ACGCGTTACT CTCGCCCATC GACTTGGCAA AACACCCTCT AG
 
Protein sequence
MKGLIKQRGM IKWQGIKTAL ISGVMAVLPA MPVLSILAMP SVQAANAVQE ANHVALNQLA 
DKLQLNYQLT ASYLETCPAK ETQCYRSAIE LTLPTQLSSD KLSSEELNHG DWQIYFSQLS
PVYFVDAGDF NIEHINGDLH KITPKASFKG FVADQVYRIE FYSQGSQITR SEFIPNFFVA
MGDKAQLIKS TQTQRNKETG LPEQDYLAPF NTEKQFLLAK NDATPFADGQ YLAEEYQALA
ADKAPDISGK LIPTPLHSQS LDSAPLSLAR GYRLTAATVD FKGRQAALDY LDSLGFSAKN
KGTPLLLQLD AQAPKVASTE AHDSEIKSNE AYRLTITAEQ ISIRAGNEAG LFYGLQSLAG
LISLSDDQLV AIEIQDQPRY AFRGLHIDLA RNFHSLDFIK RIIPQLAAYK INKLHLHLAD
DEGWRLAIPG LPELTDVGAK RCFDLTEQTC LLPQLGSGIA DANPVDGYLT VAQYQEILQL
ADAHHIEVIP SLDMPGHSRA AIKSMEARYH HYLAKGDKAK AEEFLLTEFA DKTQYSSLQY
YHDNTLNVCL ASTYHFIDTV IDEVKKMHQA VGIPLVHYHI GADETAGAWV DSPACIAMKK
QKADELAGLH SLNGYFIERV ANMLADKGII AAGWNDGMGE VRPENMPAQV QSNAWSLISD
NGHQIAHKQV NLGWKVVLST PEVTYFDFPY VSHPDERGNH WAARAIDSFK VFSFMPDNLP
AHAERWRNSL NQPFIADDSH SQLKPGHGFY GLQGHLWSEM VQSDEQAEYM LFPRMIALAE
RAWHKASWEL AYDYQGKIYS QQTQHFASQL NGQAEQVLKQ DWQTFAAVYA NKVQPKLAKA
GVFYRIAPPG ILVENQLLVL NSLYPNAELE YQLDSGPWLS YLQAFKLNDV KHIRARVKDG
TRYSRPSTWQ NTL