Gene Shewmr4_2936 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_2936 
Symbol 
ID4253507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp3505848 
End bp3508538 
Gene Length2691 bp 
Protein Length896 aa 
Translation table11 
GC content51% 
IMG OID638119572 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_735064 
Protein GI113971271 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000338099 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.491372 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAAA CGATTGCCGC CACCGCGATT TTATTGGCTC TGGGCTTAAC GGCCTGTAGC 
GATGTGCCTA AAACCGAGGC AGTGCCTAGC TCAAGTACTG CTGAGCAGGC TAAGCCCAAC
CAATTAACCC AAGCGCAATT GCAGCAGTTT GGGGATACCT TGGGCGTGAG TTATCGCGTG
CTCACTAACA GGCCCGACGA TAGCTGTGAC AAAGCCGCCG CCGAAGGTCG TTGTTTTGTT
GCTGAAATCG ATTTTGTCCC AGAGGTTGAG CTTAAGAGCC GTGATTGGGC GATTTATTTT
AGTCAAATGC GCCCAGTTCA AGCCGTTGAA AGCAAAGAGT TTAGTATTAC GCATATCAAG
GGCGATCTCT ATCGCATCGC GCCCACTGAG GCATTTAACG GTTTTAGCAA GGGCGAGAAA
AAGACCCTAA GGTTCCGCGG TGAGTTGTGG CAGCTCTCAG AAACCGATGC CATGCCCAAC
TATTACATAG TTGCAGGCGA TTTATCGCCA GTGGTGATCG CCAGCACGCA AGTGCAGCAA
GATCCTGAGA CACAGATGGA AGTGCGCCCC TATGTTGAGG CGTACACCGA TATGGTGAAG
CAGTATCGCC GTACCGATGC GGATAAATTA GCGCCTGCGA CGCCTGCGCA GTTGTTTAGC
AATAATCAAC AGGTCAGTGA AGATGCCAGC TTAGCGGTCA ATACCATTAT TCCGACGCCG
CAAAAAGTCG CTATCCACAG CCAAGATAAA GCCGTGTCAC TGACCTCTGG TATTAAACTG
GATTTCGGTT CAGTCACTAA CGCATCAGCC GCGCAACAAT CTTTGGATTC TAAACACTTG
GCGGCGGCAC TTTCCCGTTT AGCGCGCTTA GGCGTTAACG AGTCAGAACA AGGTGTAGCG
GTAAAGCTGA ACTGGCGTCA AGGCGCTGAG GGCAGCTATC TGCTGGATAT CAAAGCGGAT
GCTATAGATA TTGCCGCAGC CGATGCTGCC GGGTTTTCCT ATGCGTTATC GTCACTTGCT
AGTTTGATTG ATGTGCAGGA TCTGCGCGTA AATGCCATGA CGATTGAGGA TAGCCCAAGG
TATCCCTTCC GTGGCATGCA TATCGACGTG GCCCGTAACT TCCATAGCAA GGCGTTGGTC
TTTGATCTGT TGGATCAAAT GGCTGCATAC AAGCTCAACA AATTGCACCT ACATATGGCC
GACGATGAAG GTTGGCGCCT AGAGATCGAT GGCCTGCCGG AACTGACCGA CATCGGCAGT
AAGCGTTGCC ACGATCTTGA GGAAAATACC TGTCTGTTAC CGCAGCTTGG CAGCGGGCCT
TTTGCGGATG TGCCCGTCAA CGGTTTCTAT AGCAAACAAG ACTATATCGA CATCGTTAAA
TATGCCGATG CGCGCCAAAT TCAAGTGATC CCTTCGATGG ACATGCCTGG CCATAGCCGC
GCGGCGATAA AATCGATGGA GGCGCGTTAC CGTAAGCTAG TGGCCGAAGG TAAAGCTGAG
GAAGCTAAAA CCTATCTGTT GTCGGATGCG GCGGACACCA CAGTGTATTC TTCGGTGCAG
TATTACAACG ATAACACCTT GAACGTCTGC ATGGAGTCGA CCTATCTGTT TGTCGATAAA
GTCATTGATG AAATTGCAAA ATTACACCAA GCAGCGGGCC AGCCATTAAC CCGATACCAT
ATCGGTGCCG ACGAAACGGC AGGGGCGTGG AAGCAATCGC CAGCTTGTTT GGACTTTGTG
GCCAATAACG ATAAAGGCGT GAAATCCATC GACGATTTAG GGGCGTACTT TATTGAGCGC
ATTTCGAATC AGCTCGCGAG TAAAGGCATC GAAGCGGCGG GTTGGAGCGA TGGCATGAGC
CATGTTCGCC CAAGCAACAT GCCTGCTAAA GTCCAATCCA ACATTTGGGA TGTGATTGCC
TATAAGGGCT ATGAGCATGC CAATCAGCAA GTCAACAACG GCTGGGATGT GGTGCTCTCA
AACCCTGAGG TGCTTTATTT TGATTTCCCC TATGAAGCCG ATCCTAAGGA GCATGGCTAC
TACTGGGCAA GCCGCGCAAC CAACGCGCAC AAAGTGTTTA GCTTTATGCC GGACAACTTA
GTGGCAAACG CGGAGCAGTG GACCGATATT CAAAATCTAC CTTTTGAGGC CGACGATAGG
GCCAGAACCG ATGAGAAGGG CAAACAATCT GGCCCAAGGG AACAAGGTAA AGCCTTTGCC
GGATTACAAG GCCAGCTCTG GAGTGAAACC ATCCGCAGTG ATAACACCGT GGAATATATG
ATTTTCCCGC GTTTATTGAT GCTGGCGGAG CGTGCTTGGC ATCAAGCCGC GTGGGAAGTG
CCATATCAAT ATCAAGGCGC TTTGTATAAT CAAACTACGG GACATTTCAC CGCTGCTATG
CGTGAGGCTC AGGCGCAATC TTGGCAGCAA ATGGCTAACA CCTTAGGACA TAAGGAGTTT
ATCAAACTCG ATAAAGCGGG TATCGATTAC CGAGTGCCGA CCGTCGGCGC CGAGATCCGT
GACGGTAAAC TGTTTGCCAA CGTCGCTTAT CCAGGACTCA AGATTGAATG GCGCCAAGCG
AGTGGGCAAT GGCAATCCTA TCAAGCTGGG CAGGCGGTAA CGGGCCCTGT TGAGATCCGC
GCCATCGCGG CCGATGGAAA ACGTAAAGGC CGCAGCTTAG TCGTTAATTA A
 
Protein sequence
MNKTIAATAI LLALGLTACS DVPKTEAVPS SSTAEQAKPN QLTQAQLQQF GDTLGVSYRV 
LTNRPDDSCD KAAAEGRCFV AEIDFVPEVE LKSRDWAIYF SQMRPVQAVE SKEFSITHIK
GDLYRIAPTE AFNGFSKGEK KTLRFRGELW QLSETDAMPN YYIVAGDLSP VVIASTQVQQ
DPETQMEVRP YVEAYTDMVK QYRRTDADKL APATPAQLFS NNQQVSEDAS LAVNTIIPTP
QKVAIHSQDK AVSLTSGIKL DFGSVTNASA AQQSLDSKHL AAALSRLARL GVNESEQGVA
VKLNWRQGAE GSYLLDIKAD AIDIAAADAA GFSYALSSLA SLIDVQDLRV NAMTIEDSPR
YPFRGMHIDV ARNFHSKALV FDLLDQMAAY KLNKLHLHMA DDEGWRLEID GLPELTDIGS
KRCHDLEENT CLLPQLGSGP FADVPVNGFY SKQDYIDIVK YADARQIQVI PSMDMPGHSR
AAIKSMEARY RKLVAEGKAE EAKTYLLSDA ADTTVYSSVQ YYNDNTLNVC MESTYLFVDK
VIDEIAKLHQ AAGQPLTRYH IGADETAGAW KQSPACLDFV ANNDKGVKSI DDLGAYFIER
ISNQLASKGI EAAGWSDGMS HVRPSNMPAK VQSNIWDVIA YKGYEHANQQ VNNGWDVVLS
NPEVLYFDFP YEADPKEHGY YWASRATNAH KVFSFMPDNL VANAEQWTDI QNLPFEADDR
ARTDEKGKQS GPREQGKAFA GLQGQLWSET IRSDNTVEYM IFPRLLMLAE RAWHQAAWEV
PYQYQGALYN QTTGHFTAAM REAQAQSWQQ MANTLGHKEF IKLDKAGIDY RVPTVGAEIR
DGKLFANVAY PGLKIEWRQA SGQWQSYQAG QAVTGPVEIR AIAADGKRKG RSLVVN