Gene Saro_1091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1091 
Symbol 
ID3916387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1134430 
End bp1135497 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content68% 
IMG OID640443826 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_496370 
Protein GI87199113 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGGCGCTC CGGGCCTTGC AAATGGCCAT GGCATCCGCG AATCCACCAC CATGCTTCCG 
GCGATCTTCG GGCTTTCGGG CCTGACCCTG ACTGAAGACG AACGCGCCTT CTTCCGCGAT
GCGGACCCGG CGGGATATAT CCTGTTCGGT CGGAATGTGG AAAGCCCGGC GCAGCTTCGC
GCCCTGACGG ACGAGCTGCG CGCGCTGCAT GGCCGTGACC GCACTTTCAT CTGCATCGAT
CAGGAAGGCG GGCGCGTCGC GCGGATGAAG CCGCCGGTCT GGCAACCCTA TCCTCCCGGC
GAGCGGTTCG ACCGGCTCTA CGACATCGCC CCTGCCAGCG CGATCGAGGC TGCGCGCGCC
AATGCCGAAG CGCTGGGCCT CGATCTGGCG GAAGCGGGGA TCAGCGTCGA TTGCCTGCCG
CTGCTCGACG TCCGCCAGCC GGGCGCGCAC GACGTCATCG GGGACCGTGC GCTCGGTTCG
GAACCGATGC GCGTTGCGGC GCTCGGAAGG GCAACGCTCG ATGGGCTGGC GCGCGCGGGA
ATTGCGGGCG TGGTCAAGCA CATGCCGGGC CATGGCCGCG CGCTGGTCGA TAGCCACAAG
GAACTGCCCA CGGTCTCTGC CAGCGCCGAG GAGCTGGAAA TGGACCTCGC TCCGTTCCGC
GCCCTGCGCG ATGCCACCAT CGGCATGACC GCGCACCTGC GGTTCCTCGC ATGGGATGAC
TGGAACCCGG CGACGCACTC GCCCTTCGTC ATCGAGGAGA TCATCCGCAA GGCGATCGGC
TTCGACGGGC TGCTCCTGAC CGACGATCTC GATATGCAGG CGCTTGGCGG CACCGTGCCC
GAACGCGCGG CGCGCGCGCA GGCTGCGGGC TGCGACATCG CCTTGAATTG CTGGGCGAAG
ATGGATGACA TGGTCGGCAT CGCGAACAGC CTCGCGCCCA TGTCTGACAA AGTGATGCAG
CGGCTGGAAC GCGCGCTCGC GCCCACCGCG GCCTTCGACG CTCCAGCCGA CATGACCGCT
CAGGCCGCGC TTTTCGACAA GCGCGACCGG TTGCTGGAAC TGGCCTGA
 
Protein sequence
MGAPGLANGH GIRESTTMLP AIFGLSGLTL TEDERAFFRD ADPAGYILFG RNVESPAQLR 
ALTDELRALH GRDRTFICID QEGGRVARMK PPVWQPYPPG ERFDRLYDIA PASAIEAARA
NAEALGLDLA EAGISVDCLP LLDVRQPGAH DVIGDRALGS EPMRVAALGR ATLDGLARAG
IAGVVKHMPG HGRALVDSHK ELPTVSASAE ELEMDLAPFR ALRDATIGMT AHLRFLAWDD
WNPATHSPFV IEEIIRKAIG FDGLLLTDDL DMQALGGTVP ERAARAQAAG CDIALNCWAK
MDDMVGIANS LAPMSDKVMQ RLERALAPTA AFDAPADMTA QAALFDKRDR LLELA