Gene Shewmr4_1726 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_1726 
Symbol 
ID4252300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp2051873 
End bp2052901 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content50% 
IMG OID638118337 
Productbeta-hexosaminidase 
Protein accessionYP_733857 
Protein GI113970064 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.153905 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTATC TAATGTTGGA TCTGCTGTCT TTAGATGTTA GCGAAGCTGA AGCCGAGATG 
CTGCGCCATC CGCAGGTGGG TGGTTTGATC CTGTTTTCCC GCAATTTTTC CAGCCGTGAG
CAGTTAATTG CCTTAGTGCA ACAGATCCGC CAAATTCGTC CCGAGATTTT GATTGCCGTT
GACCATGAGG GCGGGCGAGT ACAGCGTTTC CGTGAAGGAT TTACCCTGAT CCCGGCCATG
GGGGATATTT TGCCCGCGGC TAAGGGCGAT ATGGTTCTTG CTAAACGTTG GGCCTGTGAA
TTAGGCTTTT TAATGGCTAT CGAATTACTC GCCTGTGACA TCGACTTAAG TTTTGCGCCT
GTGCTCGATC TTAACGGCAT CAGCCAAGTC ATTGGTAAAC GTAGTTTTAG CGCAAAGCCC
GATGAAGTGA TTGCGTTAGC GCAGAGTTTT ATTGAGGGTA TGGCCGAGGC AGGCATGGGC
GCTGTGGGTA AGCATTTCCC CGGTCATGGT AGCGTGGCGG CGGACTCCCA TATCGCGCAA
CCTATCGATG AGCGTGAAGC TGAGGCGATT TTTAATCAAG ATATTTTGCC ATTTAAAGAA
CTGATTGCTA AGGGTAAATT ATCGGGCATT ATGCCAGCCC ATGTCATTTA CCCTAAAGTT
GACCCTAATC CTGCGGGCTT TTCAAGCTAC TGGTTAAAGC AGATCCTACG CAAAGAGCTG
GGCTTTAACG GGGTGATTTT CTCCGACGAT CTGGGGATGA AGGGCGCTGC CTTTGCAGGA
GATTATTTAG GCCGTGCCCA AGCTGCGTTG GATGCGGGCT GCGATATGAT TTTGGTCTGT
AACGATAATC CGGGCGTTAT GTCACTGTTA AATGGCTTTG TGTGGCCCGC CGCGGCGCCA
CAACATCCTG CGAGTTTACT CAAACCCAAT GCAGCGCAGA CAGCTATCGC ATTAGAAAAT
GCTAGCCGTT GGGAAAATGC CAAGCAGCTG GCTGAGCAAA TCCAACTCGC ACAACAGGCA
AAAGTTTGA
 
Protein sequence
MSYLMLDLLS LDVSEAEAEM LRHPQVGGLI LFSRNFSSRE QLIALVQQIR QIRPEILIAV 
DHEGGRVQRF REGFTLIPAM GDILPAAKGD MVLAKRWACE LGFLMAIELL ACDIDLSFAP
VLDLNGISQV IGKRSFSAKP DEVIALAQSF IEGMAEAGMG AVGKHFPGHG SVAADSHIAQ
PIDEREAEAI FNQDILPFKE LIAKGKLSGI MPAHVIYPKV DPNPAGFSSY WLKQILRKEL
GFNGVIFSDD LGMKGAAFAG DYLGRAQAAL DAGCDMILVC NDNPGVMSLL NGFVWPAAAP
QHPASLLKPN AAQTAIALEN ASRWENAKQL AEQIQLAQQA KV