Gene Sare_3807 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3807 
Symbol 
ID5705302 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4339491 
End bp4341272 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content73% 
IMG OID641273229 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_001538591 
Protein GI159039338 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0222239 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCACCCCA CCCCCGCCAC CTCGCCGCAG CAGCCGGAGC ACCGGCCTGC CGCAGCACCC 
GCGACCTCGG CCACCTCGGG CACCGAGCCG ACCGCCAACG TCGTCGAGGT CGAGCCGACG
GCAACCGCCC AGGCGGTCGT GCCAGCGGCG GTCCAGCCGG CGGCGGGCGA GCTGGCCCGA
CTGGCGGCTC AGGAGGCGGG CACCCAGCTC ACGCCCGCCC CGACCCGACT GGGCGACGTG
GTACCCGCAC CCGAACAGGT GCGGCCGGAG CCCGCCGCCG ACTTCACGCT GCCGGCCGAC
ACCACGATCC GGGTCAGCCC CGACCCCACC GCGCGGGCCG TCGCCGAACG CCTCGCCGAC
CTGCTCCGAC CGGCCACCGG ATACCCACTC CCGGTCGTCG AGGCCGATCA CCCCGAGCCG
GCCGGCGGCC TCGCACTCGT CCTCGCCGAA CAGCCAGCCC TCGGCCTCGA GGGCTACCAA
CTCGACGTGA CGCCGACCGG CGTCCGGATC AGCGCCGCGA CCGCGGCCGG GCTCCACCAC
GGCACCCAGA CCCTGCGGCA GCTTCTCCCC GCCGCGATCG AGAGCACCAC ACCGGTGCGC
ACCACCTGGA CGCTACCCGG CGGGTCGATC ACCGATCGCC CCCGTTTCCC ATACCGGGGC
GCCATGCTCG ACGTGGCCCG CCACTTCTTC GGAGTCGACG AGGTGCTACG GGTGATCGAC
CACCTCGCCC GGTACAAGCT CAATCACCTG CACCTGCACC TCACCGACGA CCAGGGTTGG
CGGATCGCGG TCGAATCCCG GCCGAGACTG ACCACGATCG GCGGCAGCAC CGCGGTCGGC
GACGCCCCCG GGGGGTGGTA CACCCCAGCC GACTACCAGC GGATCGTCGC GTACGCGGCC
GACCGGCACC TCACCGTCGT TCCGGAGATC GACCTGCCGG GCCACACCAA CGCCGCGCTG
ACCGCGTACC CGGAACTGGC CCCGGACGGG ACCACGCCCG CGCCCTACAC CGGCACCGAC
GTCGGGTTCA GCTACGTCGA CCCGGCCAAC GCCCGAACGT ACGAATTCGT CACCGACGTG
TTGGAGGAGG TCGCTGCCCG CACTCCCGGG CCGTTCCTGC ACATCGGTGG GGACGAGGCG
TTCAAGGTGA AGGGAACGGC GTACACCGGA TTCGTCGAGC GGGTGCAACA CATCGTGGCC
GGACTCGGCA AAACCGCCGT GGGCTGGCAC CAACTGGCTC CGGCTGCACA CAACGAGGGG
CGGGTGCTCC AGTGGTGGGG CACCGACGGT GCCGATCCGG CGACCGCCGA CGCGGTCCGT
CGGGGCGCAC GGCTGATCCT CTCCCCCGGC AACCACGCAT ACCTGGATAT GAAGTACGCC
CCGGACACCC CGATCGGGCA CGACTGGGCC GGCCTGATCG ACGTACGGCG GGCGTACGAC
TGGGATCCGG CGACCCAGGT GGCAGACGTT CCGGCAGCGG CGGTGCTGGG AGTGGAGGCC
CCGCTCTGGA CCGAGTCGGT CACCTCGCTG GCAGAGGTCG AGTTCATGCT GCTGCCCCGG
CTACCCGCCA TCGCGGAACT CGGTTGGTCG CCGCGAGCCA CCCACGACTG GGCAGCGTTC
CGCGCACGGC TGGCCGGGCA GGGGCCCCGC TGGGCGTCGG CCGGCATCGC CTTCTACCGC
TCACCCGAGA TTCCCTGGCC AGGGTCGCCT ACCGACCCGC CGGCAACGAG CGTCCCGACA
CCCGCGCCGC GTCCCCGAGA CCCGCACACC GGGCGCGGAT AG
 
Protein sequence
MHPTPATSPQ QPEHRPAAAP ATSATSGTEP TANVVEVEPT ATAQAVVPAA VQPAAGELAR 
LAAQEAGTQL TPAPTRLGDV VPAPEQVRPE PAADFTLPAD TTIRVSPDPT ARAVAERLAD
LLRPATGYPL PVVEADHPEP AGGLALVLAE QPALGLEGYQ LDVTPTGVRI SAATAAGLHH
GTQTLRQLLP AAIESTTPVR TTWTLPGGSI TDRPRFPYRG AMLDVARHFF GVDEVLRVID
HLARYKLNHL HLHLTDDQGW RIAVESRPRL TTIGGSTAVG DAPGGWYTPA DYQRIVAYAA
DRHLTVVPEI DLPGHTNAAL TAYPELAPDG TTPAPYTGTD VGFSYVDPAN ARTYEFVTDV
LEEVAARTPG PFLHIGGDEA FKVKGTAYTG FVERVQHIVA GLGKTAVGWH QLAPAAHNEG
RVLQWWGTDG ADPATADAVR RGARLILSPG NHAYLDMKYA PDTPIGHDWA GLIDVRRAYD
WDPATQVADV PAAAVLGVEA PLWTESVTSL AEVEFMLLPR LPAIAELGWS PRATHDWAAF
RARLAGQGPR WASAGIAFYR SPEIPWPGSP TDPPATSVPT PAPRPRDPHT GRG