Gene Snas_5043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_5043 
Symbol 
ID8886250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp5351540 
End bp5353630 
Gene Length2091 bp 
Protein Length696 aa 
Translation table11 
GC content71% 
IMG OID 
ProductAlpha-N-acetylglucosaminidase 
Protein accessionYP_003513773 
Protein GI291302495 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.424507 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCACGGC AGGATCTGGA ATCGGCATAC GCGGCGATGC GGCGGCTGGC GCCGCGGCTG 
TGGGAGACCC ACCGCGAGAA GTTCACCGTG GATATCACCG GCGAGGCCGC CGACGGCTAC
CGTTTCGCCG CCGACGGCGA GCGGCTGTCG CTGTCGGGCA ACGACATCGG CTCGGTCCTG
ACCGGTTTCC GGCACTATCT GGAGTCCAGC CACCTCGGTC ACATCAGCCG CGGCGGCGAC
CGCTTCACCG TTCCCGAGAC GCTGCCGCTT CCGGAGCGGC CGGTGTCGCG CACCAGTCCG
CACCGGTTCC GCTACGCCAC CAACTTCACC GTCACCGGCT ACACGTCGCC GTACTGGCAG
TGGCCGCGGT GGGAGCGGGA GCTGGACCTG CTGGCCGCCT CGGGCATCAA CCTGTCGCTG
GTCACCGTCG GCACCGACGC GGTCTGGCTG GACACGTTCG GCGAGTTCGG TTTCGACGAG
AAGACGCTGC TGTCGTGGAT CGCTCCCCCG GCCCACAACC CGTTCCACCA GATGGGCTGC
ATGTGCGGCT TCGGCGGGGT GTCGCGGCGG CTCGTCGAGG AACGCGCCGA GCTGGGACGC
CGCATCACCG ACCGGATGCG GGAGTTGGGC ATCGAGCCGG TGCTGCCGGG CTTCGCCGGG
CTGGTGCCCG GCGACATCGG TGACACCGCC GCCATCCCGC AGGGGCAGTG GTTCGGCTTC
GACCGTCCCG CGTGGCTGCC CACGACCACC CGCGCCTACG CCGAGGTCGC GGAAGTGTTC
TACGCCAAGC AGACCGAGCG TCTCGGCGCG ACCCGGGCGC AGGCGGTGGA CCTGCTGCAC
GAGGGCGGCA CCTCCGGCGG GGTCGACCTC GCCGACGCCA CCCGGGGCAT CGCGGCGGCG
ATGGAACGCG CCCACGACGA CTACCTGTGG GTGCTACAAG CCTGGTGGGA CAACCCGCTG
CCGGAAGTGC TGGCGGCCAC CGACTCCGAC CACCTCCTGC TGCTCGACCT CACCGGCGAG
GGCTGGCGCA AGACGAAGGG CTGGCACGGC AAGCCGTGGG CCCGGGGTTC GCTGACGAAC
TTCGGCGGCC GTACCGTCCT GTTCGGAGGG CTGCCCGAGA TCGCGGAGCT GCCGTCGCTG
AAGGACGACC CGAAGGCGTC CTCCCTGGTG GGTACCGCGC TGGTGGAGGA GGCGTGGCAG
GTCAACCCGG TGGTGTGGTC GCTGTTCACC CAGACCTCCT GGGCCGACGG CGATATCGAC
CTGAACGCCT GGGTCCCCGA GTACGTCGCG GCCCGGTACG GGAAGGCCCA CCCTCGGGCC
GTGCGCGCCT GGCACGGCCT GCTGGCCACC GCCTACCGCA GCATGGACGG CCGTCCCGGT
GGCGCCGAGT CGCTGTTGTG CGCGATGCCC AGTCTGGACG CCGACCGTGC CTCGATGAAC
GGCCCGCATT CGCTGCCGTA TCCAGCCGAG GCCCTTGAGG TGGCCTGGCG GGACCTGCTG
GCCGCGCGCG AGGCCCTGGG CGGCGCCGAC ACCTTCCGCT TCGACCTGGT CGACGTGACC
CGCCAGGTCA TCAGCAACCG GGCCCGGCCG CTGCTTCCGT TGCTGCGCAC GGCTTACGCC
ATGAAGGAAC TCGACCGGTT CATCGCGCTG AGCCACAGCT TCATCGACCT GTTCGAGCTG
CTGGACCCGG TGCTGGCCAC CCGCGAGGAG TTCCTCGTCG GGCGTTGGCT CGCCGACGCC
CGGGCGCTGG CCGCCGACGA GGACGAGGCC GACGCGCTGG AGTTCGACGC CCGCACCATC
ATCACCACCT GGGGCGACTC GCCCGAGTCC AGCGCCACCC TCATCGACTA CGCCAACCAC
GAGTGGGCCG GGCTGATCGC CGACTACTAC CGGCCCCGGT GGGAGAAGTA CCTCAAGTCG
CTGGAGACCG AGCTGCGGGA AGGGAAGCCG GCCGAGCCGA TCGACTTCTA CGCCGACGCG
GCGGCCTGGG CGCGGTCCCA CGACACCTAC CCGACCGAGC CGAGCGGCGA CGCGGTGTCC
AGTTGCCGGG CCGTCCACCA CGCGCTGCCG TACTTCGAGG GACTGCCGTA G
 
Protein sequence
MARQDLESAY AAMRRLAPRL WETHREKFTV DITGEAADGY RFAADGERLS LSGNDIGSVL 
TGFRHYLESS HLGHISRGGD RFTVPETLPL PERPVSRTSP HRFRYATNFT VTGYTSPYWQ
WPRWERELDL LAASGINLSL VTVGTDAVWL DTFGEFGFDE KTLLSWIAPP AHNPFHQMGC
MCGFGGVSRR LVEERAELGR RITDRMRELG IEPVLPGFAG LVPGDIGDTA AIPQGQWFGF
DRPAWLPTTT RAYAEVAEVF YAKQTERLGA TRAQAVDLLH EGGTSGGVDL ADATRGIAAA
MERAHDDYLW VLQAWWDNPL PEVLAATDSD HLLLLDLTGE GWRKTKGWHG KPWARGSLTN
FGGRTVLFGG LPEIAELPSL KDDPKASSLV GTALVEEAWQ VNPVVWSLFT QTSWADGDID
LNAWVPEYVA ARYGKAHPRA VRAWHGLLAT AYRSMDGRPG GAESLLCAMP SLDADRASMN
GPHSLPYPAE ALEVAWRDLL AAREALGGAD TFRFDLVDVT RQVISNRARP LLPLLRTAYA
MKELDRFIAL SHSFIDLFEL LDPVLATREE FLVGRWLADA RALAADEDEA DALEFDARTI
ITTWGDSPES SATLIDYANH EWAGLIADYY RPRWEKYLKS LETELREGKP AEPIDFYADA
AAWARSHDTY PTEPSGDAVS SCRAVHHALP YFEGLP