Gene Snas_5877 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_5877 
Symbol 
ID8887093 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp6237333 
End bp6238973 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content71% 
IMG OID 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_003514598 
Protein GI291303320 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACGAAT CGTTGATCCC CCAGCCGACC TCGGTCGAAG CCCACCCCGG CGGTCCGTTC 
ACTCTCGACG GTGACACCCG CATCGTCGCC ACCGAAGCCG CCGCCGAGGC GGGCTGGCTG
CTGCACGACT ACCTGCGGGC CGGAACCGGC CTGACCGTCC CGGTCACCGA CCGGGCCGAC
GGCGGCGCCA TCACCCTCGA ACTGTCCGGC GACCGTCCCG CTGACGGCAC CACCGCCGCC
GAGGCCTACC GGCTCGACGT CGACGCGGAC GGGGTCCGGC TGTCCGCCGC CCACCCCGCC
GGACTGTCAC GGGCGGTGCA GACGCTGCGG CAGCTGCTGC CCGCCGAGAC GCTGCGCAGC
GCGCCGGTGG GCACCACCCC GGTGACCCTG GCCGCGGTGT CGATTCAGGA CGAACCGCGG
TTCGCCTGGC GCGGCGTGAT GCTGGACGTG GCGCGGCACT TCCAGCCCAA GGAGTTCGTG
CTGCGCATGA TCGACCTGGC GGCGCTGCAC CGGCTCAACG TCGTCCAGCT GCACCTGACC
GACGACCAGG GCTGGCGGCT GGAGGTCCCC GGCCGTCCCA AGCTCACCGA GATCGGTTCC
TGGCGCCCCG AGACCGTGAT CGGGCACGCC CTCGACGACA CCAAGGGCTA CGACGGCACC
CCGCACGGCG GCTACTACAC CGCCGCCGAC CTGCGCGAGA TCGTCGCCTA CGCCGCCCGC
CGCCACATCA CCGTGGTGCC CGAGATCGAC CTGCCCGGCC ACGTGCGCTC GGTGCTGGCC
GGTTATCCCG AACTGGGCAA CACCGGCGAG CCCACCACCG TGGCCACCAC CTTCGGCATC
TTCTCCGAGG TCCTGGCGCC CACCGAGGCG GCGCTCGACT TCGCCCGCGA GGTCTTCGAC
ACCGTCGTCG ACATCTTCCC GTCTCCGTAC ATCCACATCG GCGGCGACGA ATGCCCCCGC
ACCGAATGGC GCGACAGCCC GGCGGCGCGG GACAAGGCGA AGGAACTGGG ACTGAGCAGC
GTCGACCTGT TGCAGTCCTG GTTCACGAAG AACTTCGCCG AGCACCTGGC CGGACACGGC
CGCCAGATCA TCGGCTGGGA CGAGATCCTG GACGGTGGCG CGCCCGACGA CGCGGTCATC
GCGGTGTGGC GGGACTTCTC GATCGCCGCG AAGGCGGCGG CCAAGGGCCA CAAGGTGATC
GTGGCCCCGG TACAGGCGAC CTATCTGGAC TACTACGAGT CCACCGACGC CGAGGAGCCG
CTGCGGATCT TCAAGAACAT CTCCGTCGAC ACGATCGCGG AGTTCGAGCC GGTTCCCGAG
GGCACCTCCG ACGAGCTCAT CTTCGGCGTC CAGGCTCAAC TGTGGAGCGA GTACCTGCCG
GTGCCCTCGG CGGTGGAGTA CGCCGCCTTC CCGCGGCTGT CGGCCATCGC CGACGTCGCG
TGGTCGACAC CCGAGGCGCG CACCGCCTCG CCGGTGACCG GACGGCTGGA GGAGCACCTC
AAGCGCCTGG ACGCGCTGGG CGTCAACTAC CGGCCGCCGT CGGGCCCGAG GCCGTGGCAG
AAGGGTGGCA CGGGAGCGCG GGCACGCTGG GACCTGGACG ACACCTCGAG GGACGAGACC
CCGGACCTGC CGGAGTTCTA G
 
Protein sequence
MYESLIPQPT SVEAHPGGPF TLDGDTRIVA TEAAAEAGWL LHDYLRAGTG LTVPVTDRAD 
GGAITLELSG DRPADGTTAA EAYRLDVDAD GVRLSAAHPA GLSRAVQTLR QLLPAETLRS
APVGTTPVTL AAVSIQDEPR FAWRGVMLDV ARHFQPKEFV LRMIDLAALH RLNVVQLHLT
DDQGWRLEVP GRPKLTEIGS WRPETVIGHA LDDTKGYDGT PHGGYYTAAD LREIVAYAAR
RHITVVPEID LPGHVRSVLA GYPELGNTGE PTTVATTFGI FSEVLAPTEA ALDFAREVFD
TVVDIFPSPY IHIGGDECPR TEWRDSPAAR DKAKELGLSS VDLLQSWFTK NFAEHLAGHG
RQIIGWDEIL DGGAPDDAVI AVWRDFSIAA KAAAKGHKVI VAPVQATYLD YYESTDAEEP
LRIFKNISVD TIAEFEPVPE GTSDELIFGV QAQLWSEYLP VPSAVEYAAF PRLSAIADVA
WSTPEARTAS PVTGRLEEHL KRLDALGVNY RPPSGPRPWQ KGGTGARARW DLDDTSRDET
PDLPEF