Gene Snas_2451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_2451 
Symbol 
ID8883646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp2597595 
End bp2599340 
Gene Length1746 bp 
Protein Length581 aa 
Translation table11 
GC content70% 
IMG OID 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_003511227 
Protein GI291299949 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0235141 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAGAC ATCTCATCGT CACGGCAGCG GTCGTGCTCA TCCTCGGCGC TTCGGTGACG 
ACCGCTCTGG CCCTGCCCGG AGACGACGCC GACCGGTTGA TCACCGGACC GGCCTCCGAC
CCGGCGGCCG AGAAGGTGGC CCGCGACGCG CTGGCGGACC TTCCCTCGGA TGTGGTGGTG
CGCATCGGTA CCCGGCACAC CGATGCGAAA CTCGCCGCCG AGGGCTTCGA GATCCGCGTC
GAGTCGGGCG AGATCCTGCT CGACGGCGCC GACGCGGCCG GTGTCTTCTA CGGCGCGCAG
GAGCTGCGGG AGCGCGCCCG CTCGGGCCGC CTGGACGACG GGGTGATCCG GCAGGAGCCG
TCCATGCGCT ACCGGGGCCT CATCGAGGGC TTCTACGGCA CGCCGTGGAC CCACGCTGAG
CGGCTCGACC TGATGGACTA CCTCGGCAGC CACCGGATGA ACACCTACGC CTACGCGCCC
AAGGACGACC CGTACCACCG GGAGAAGTGG CGCGAGCCCT ACCCGGCCGA CAAGCTCGCC
GAACTGGGGG AGCTCGTCGA ACGCGCCCAG GCCAACCACG TGGACTTCGC GTTCGCGCTG
TCGCCGGGAC TGTCGATCTG TTACACCTCC CAGGACGACT ACGACGCCCT CATCGCGAAG
TTCGACTCGC TGTACGACCT CGGGGTGCGG CAGTTCAACA TCCCGCTGGA CGACATCGAC
TACGACACCT GGCACTGCGA CGGCGACGCC GAGGAGTACG GCAGCGGCCC GGCCGCCGCC
GGACGGGCCC AGGCCGAACT GCTCACCCGG GTGCAGACCG AATGGGCCGC GAACAAGGAC
GGCGTCGCCC CGATGCAGAT GGTGCCCACC GAGTACTTCG ACAACGAGGA CTCCCCGTAC
AAGGAAGCCC TGCGCGACAT GCACTCCGAC GTCGTCGTCA TGTGGACCGG CGTCGGCGTG
ATCCCCACGA CCATCACCCG CGAGCAGGCG GCGCGGGCCC GCGAGGTCTT CGGGCACGAG
ATCCTGATCT GGGACAACTA CCCGGTCAAC GACTACATCG CCGGACGGCT CCCGCTTGGC
GCCTACACCG GACGCGAGAA CGGCCTGTCG GCCGAGGTCA GCGGCGTCAT CTCCAACCCG
ATGAACCAGC CCGAGGTCAG CAAGGTGGCG CTGTACTCCT TCGGCGAGTA CGGCTGGGAC
GACGAGTCCT ACCAGGCGGA GGAGTCGTGG GAGCGGGCGT TGTCCGAGGC TGCCGGGGGA
TCGGAGGAGG TCGTCGACGC CCTGCGGCGC TTCGCGGATC TCAACCAGTA CGACGAGACC
CTGCACCAGG AACCGGCTCC GGAACTGGCC GCCGCCCTGG ACGCGTTCTG GGAGGCCTGG
GACGCCGGGG ACCACGACGC CGCCGCTGAG CGGTTCGACG CGGTGCTGGC CGACCTGGCC
GCGGCGCCCG ACGTGATCCG CGAGGGCGCG GCCGACCCGG CCTTCGCCGA ACAGGCCAGG
GCCTGGCTCG ACGCGGCCGA ACTGTGGGTC TCGGCGATGC GGCACTCGCT GACCGGCATG
GTCCGGGAGG TCAACGGCGA CCCCGAGGGC GCGTGCGAGG CGATCGCCGA GGCCACCTCG
GATGTGGAGG CGGCCAAGGA GATCCGCGAC GACCGCGAAC CGCACTCCAC GACCCACCCC
CGTATCGCCG ACGGCGTCGC CGACGCCTAC ATCGACGCCG CCGCCGGACA CGCGGGCTGC
GGCTGA
 
Protein sequence
MQRHLIVTAA VVLILGASVT TALALPGDDA DRLITGPASD PAAEKVARDA LADLPSDVVV 
RIGTRHTDAK LAAEGFEIRV ESGEILLDGA DAAGVFYGAQ ELRERARSGR LDDGVIRQEP
SMRYRGLIEG FYGTPWTHAE RLDLMDYLGS HRMNTYAYAP KDDPYHREKW REPYPADKLA
ELGELVERAQ ANHVDFAFAL SPGLSICYTS QDDYDALIAK FDSLYDLGVR QFNIPLDDID
YDTWHCDGDA EEYGSGPAAA GRAQAELLTR VQTEWAANKD GVAPMQMVPT EYFDNEDSPY
KEALRDMHSD VVVMWTGVGV IPTTITREQA ARAREVFGHE ILIWDNYPVN DYIAGRLPLG
AYTGRENGLS AEVSGVISNP MNQPEVSKVA LYSFGEYGWD DESYQAEESW ERALSEAAGG
SEEVVDALRR FADLNQYDET LHQEPAPELA AALDAFWEAW DAGDHDAAAE RFDAVLADLA
AAPDVIREGA ADPAFAEQAR AWLDAAELWV SAMRHSLTGM VREVNGDPEG ACEAIAEATS
DVEAAKEIRD DREPHSTTHP RIADGVADAY IDAAAGHAGC G