Gene Sama_0943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_0943 
Symbol 
ID4603195 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp1140745 
End bp1143399 
Gene Length2655 bp 
Protein Length884 aa 
Translation table11 
GC content56% 
IMG OID639780278 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_926820 
Protein GI119774080 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.125845 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTTT CCTTAGCCGC ATCTGCGATT TTTCTCGCCT TGGGTGTCAG TGGGTGCTCA 
CAAGCCCCGG TCGAGCCCGC TGCCGTAGCC GCTGAGCCTT CGGTGTTAAC CCAGGCCAGT
TTACAGGCCT TCGCCGATGG GCTGGACGTG AAATACCGGG TGGTAACCAA CAGACCCGAT
GAACAGTGCA AAAAAGACGC CGGTGAAGGC CGCTGCTTTC AGGCCGAAAT TGTGCTGACA
TCGCCCATCG ACTTCGACGG TCGCGATTTT GAAATCTATT ACAGCCAGAT GCGCCCGGTG
CAGTCGGTGC AAAGCACAGA CTTTGTGATT GAGCATGTTA AAGGCGATTT GCATCGCATC
AAGCCAACCG GGAATTTCGG CGGATTCAAG GCCAACCAGA GCCAGACCCT GGCGTTTCGC
GGCGAGCTGT GGCAGCTGTC CGAAACCGAT GCCATGCCCA ACTACTACAT TACAGCGCCG
GGGCTTAAGC CTGTGGTAAT TGCCAGTACC AGGCTGGCGG TCGAGGCTGA AACCGGTCTT
GAGCTGCGCC CTTACGTCGA AGCCTTTACC GATGCCGACA AGCAGTATCG TCGTACCGAT
AACGACAAAC TGCCATGGGC AACGGCGCCA GTGCTGTTTG AGGCGAATCA GTCGCTTGTA
GTTGACCCTG CCGCGGCGGC CCAGCGCATA GTGCCTGCGC CTGTGTCGCA AACATTTGAA
GCGGGTCTTG GCACGCTGGA CTTGTCCGGC GGTATTGCCG TTGAACTGCC ACAGGGGCTG
GATAAAGCCG CCATCGACGC TGCGCTGGCA CGTCTTGCCC GCCTGGGTAT TGAGGAGGAC
TCATCTGGTG CCAGGATTAA GCTGGTAAGT GATGACTCAC TTGGTGCAGA AGCCTATCAG
CTGGTTATTC GCCCAGAGGG CGCCGAGATT AAAGCCGCCA CCGATGCCGG CTTTGCCTAT
GGTCTGTCGT CACTGGCAGC GCTTGTGCAG CCAGGCAAGC CAGCTATCAG TGCGCAAACC
ATCACTGATG CACCAAGATA CGGCTTTCGC GGCATGCACG TGGATGTGTC CCGCAACTTC
CACTCCAAGG CGTTTATGCT AAGCCTGCTT GACCAGATGG CGGCCTATAA ACTTAACAAG
CTGCACCTGC ACATGGGCGA TGATGAAGGC TGGCGCCTCG AAATTGATGG TTTGCCGGAG
CTGACCGAGA TAGGCAGCAA ACGTTGTCAC GACCTTGCCG AAGATACCTG CCTGCTGCCG
CAGCTTGGCA GTGGGCCTGA TGCCGATGTG AAGGTAAACG GGTTCTACAG CAAGGCCGAC
TATATTGAGA TCCTGAAATA TGCCTCAGCC AGACAAATTC AGGTGATCCC GTCCATGGAC
ATGCCCGGCC ACAGCCGCGC TGCCGTGAAA GCCATGGAAG TCAGATACCG CCGCCTGGCG
GAAGTGGGTG ATATCAAGGG CGCTGAAGAA TATCGCCTTA TCGATCCGGA AGATAAGACG
GTTTACAGTT CTATTCAGTA CTACGACGAC AACACCCTCA ACGTGTGCAT CGAGTCGACT
TACCACTTTG TTGACAAGGT AATCGACGAA ATTGCCAAAC TGCATAAAGA AGCCGGACAG
CCGCTGACCC GTTATCACAT TGGCGCCGAT GAAACCGCCG GTGCCTGGCT TGAGTCGCCA
AAGTGCGAAG CCTTTGTTGC AAATAACGAC AAGGGAGTGA CCAACAAGAG TGAGCTGGGT
GCTTACTTTA TCGAGCGGGT GGCCAACGTG CTGCATGACA AGGGCATTGA GCCAGCCGGT
TGGAGTGATG GCATGAGCCA CACCCGCCCG GAAAAAATGC CTGCCATGAA CCAGAGCAAC
ATCTGGGATG TGGTTGCTCA CAAAGGTCAT CAGCGTGCCC ACAAGCAGGC TAACCTCGGC
TGGGAAATCG TGCTGTCTAA CCCTGAGGTG CTGTATTTCG ATTTCCCGTA CGAGGCCGAT
CCGAAGGAGC ATGGCTACTA CTGGGCGAGC CGCGCCACCA ACAGCCAAAA GGTTTTCGGC
TTTATGCCGG GTAACCTGCC GGCCAATGCC GAGCAGTGGC TGGATATTGA GAACAATCCC
TTCGAAGCCG ACGACACAAA GCAGCAGGAT GAGGCGGGCA AGGTTCTCAG CGAGCCTATG
CAGCCTGGTA AGGCCTTCTA TGGCGTGCAG GGCCAGCTGT GGAGCGAGAC CATTCGCAGT
GACGAGCAGG CACAGTACAT GATTTTCCCC CGCCTGCTGA TGTTGGCGGA GCGTGCCTGG
CACAAGCCGG GTTGGGAAGT ACCCTACAAC CATGAAGGCG CGCTCTATAA CCAGAGCAGT
GGCAGTTTCA GCGCCGAAGC CCGTGCGCTG CAGGCCGCAG ATTGGCAGCA GATGGCCAAC
ACCCTTGGCC ATAAAGAGCT GGCCAAGCTG GATTTGGCCG GTGTGCACTA CCGTGTCCCC
ACAGTCGGTG CCCGTATTGA GGATGGTAAG TTGTCGGCCA ATATCGCTTT TCCCGGCCTT
GGCATCGAAT ACCGTGAAGC CGATGGCAAC TGGCAGCCTT ATCTGGCGCC GGTAGTGGTA
ACCAAGTTAC CGGTTGAAGT GCGAGGTATT GCCGCAGACG GTAAGCGTAA GGGCAGAACC
CTGAAGGTGA AATAA
 
Protein sequence
MKFSLAASAI FLALGVSGCS QAPVEPAAVA AEPSVLTQAS LQAFADGLDV KYRVVTNRPD 
EQCKKDAGEG RCFQAEIVLT SPIDFDGRDF EIYYSQMRPV QSVQSTDFVI EHVKGDLHRI
KPTGNFGGFK ANQSQTLAFR GELWQLSETD AMPNYYITAP GLKPVVIAST RLAVEAETGL
ELRPYVEAFT DADKQYRRTD NDKLPWATAP VLFEANQSLV VDPAAAAQRI VPAPVSQTFE
AGLGTLDLSG GIAVELPQGL DKAAIDAALA RLARLGIEED SSGARIKLVS DDSLGAEAYQ
LVIRPEGAEI KAATDAGFAY GLSSLAALVQ PGKPAISAQT ITDAPRYGFR GMHVDVSRNF
HSKAFMLSLL DQMAAYKLNK LHLHMGDDEG WRLEIDGLPE LTEIGSKRCH DLAEDTCLLP
QLGSGPDADV KVNGFYSKAD YIEILKYASA RQIQVIPSMD MPGHSRAAVK AMEVRYRRLA
EVGDIKGAEE YRLIDPEDKT VYSSIQYYDD NTLNVCIEST YHFVDKVIDE IAKLHKEAGQ
PLTRYHIGAD ETAGAWLESP KCEAFVANND KGVTNKSELG AYFIERVANV LHDKGIEPAG
WSDGMSHTRP EKMPAMNQSN IWDVVAHKGH QRAHKQANLG WEIVLSNPEV LYFDFPYEAD
PKEHGYYWAS RATNSQKVFG FMPGNLPANA EQWLDIENNP FEADDTKQQD EAGKVLSEPM
QPGKAFYGVQ GQLWSETIRS DEQAQYMIFP RLLMLAERAW HKPGWEVPYN HEGALYNQSS
GSFSAEARAL QAADWQQMAN TLGHKELAKL DLAGVHYRVP TVGARIEDGK LSANIAFPGL
GIEYREADGN WQPYLAPVVV TKLPVEVRGI AADGKRKGRT LKVK