Gene Sbal223_1943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_1943 
Symbol 
ID7090110 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp2299058 
End bp2300086 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content52% 
IMG OID643460847 
Productbeta-hexosaminidase 
Protein accessionYP_002357871 
Protein GI217973120 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.531769 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.491355 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTATT TAATGTTGGA CCTACTGTCG TTGGACGTCA GTGAAGCAGA ATCTGAGATG 
CTACGCCATC CACAGGTGGG TGGTTTAATT CTATTTTCCC GCAATTTTTC AAGCCGTGAT
CAGCTGATCC GTCTGGTACA ACAGATACGT CAAATCCGCC CTGAACTATT AATTGCCGTC
GATCATGAGG GCGGTCGAGT ACAACGCTTC CGCGATGGCT TTACCGTTAT TCCCGCCATG
GGCGACATTT TACCCGCGGC AAAAGGTGAT ATAGCGTTAG CCAAACGCTG GGCCTGTGAG
CTGGGTTTCT TGATGGCGAT TGAACTACTG GCCTGCGATA TCGACTTAAG TTTCGCGCCC
GTATTGGACT TAAACGGCGT GAGCCAAGTG ATTGGTAAAC GCAGTTTTAG CCCTGAGCCT
GCCGAGGTGA TTACACTGGC CGAAAGCTTT ATTGCCGGCA TGGCCGCCGC TGGCATGGGC
GCCGTGGGTA AACATTTCCC CGGACACGGC AGTGTAGTGG CGGATTCGCA CTACGAGAAA
CCGATTGATG AGCGCGATGC CGAGGCGATT TTTGTGAAGG ATATCCTGCC GTTTAAAGAA
TTGATCGCAA AAGAAAAGTT ATTAGGCGTG ATGCCCGCGC ACGTGGTTTA TCCTAAAGTC
GACCCGAATT CTGCGGGCTT CTCTGAATAC TGGCTAAAAC AAGTGCTGCG CAAAGAACTT
GGCTTTAACG GGGTGATTTT CTCCGACGAT CTCGGCATGC AAGGCGCAGG ATTTGCGGGC
GATTACCGAG CAAGGGCCAG CGCGGCGTTA GCTGCCGGTT GCGACATGAT TTTAGTGTGT
AATGACAATG CGGGCGTAAT GTCGCTGCTG GATGGTTTTA CATGGCCAGC GAGTGCGCCG
CAGTATCCTG CAAGTTTACT CAAGCCCAAT GCCGCACAAA CGGCCGCAGC GCTCGATAAT
ACCGCCCGTT GGGAAAACGC GAAACAGCTT GCAGAGCAAA TTTGTTTAGC CCAACAGGCG
AAAGTTTGA
 
Protein sequence
MSYLMLDLLS LDVSEAESEM LRHPQVGGLI LFSRNFSSRD QLIRLVQQIR QIRPELLIAV 
DHEGGRVQRF RDGFTVIPAM GDILPAAKGD IALAKRWACE LGFLMAIELL ACDIDLSFAP
VLDLNGVSQV IGKRSFSPEP AEVITLAESF IAGMAAAGMG AVGKHFPGHG SVVADSHYEK
PIDERDAEAI FVKDILPFKE LIAKEKLLGV MPAHVVYPKV DPNSAGFSEY WLKQVLRKEL
GFNGVIFSDD LGMQGAGFAG DYRARASAAL AAGCDMILVC NDNAGVMSLL DGFTWPASAP
QYPASLLKPN AAQTAAALDN TARWENAKQL AEQICLAQQA KV