Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3807 |
Symbol | |
ID | 5705302 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 4339491 |
End bp | 4341272 |
Gene Length | 1782 bp |
Protein Length | 593 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641273229 |
Product | Beta-N-acetylhexosaminidase |
Protein accession | YP_001538591 |
Protein GI | 159039338 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3525] N-acetyl-beta-hexosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0222239 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCACCCCA CCCCCGCCAC CTCGCCGCAG CAGCCGGAGC ACCGGCCTGC CGCAGCACCC GCGACCTCGG CCACCTCGGG CACCGAGCCG ACCGCCAACG TCGTCGAGGT CGAGCCGACG GCAACCGCCC AGGCGGTCGT GCCAGCGGCG GTCCAGCCGG CGGCGGGCGA GCTGGCCCGA CTGGCGGCTC AGGAGGCGGG CACCCAGCTC ACGCCCGCCC CGACCCGACT GGGCGACGTG GTACCCGCAC CCGAACAGGT GCGGCCGGAG CCCGCCGCCG ACTTCACGCT GCCGGCCGAC ACCACGATCC GGGTCAGCCC CGACCCCACC GCGCGGGCCG TCGCCGAACG CCTCGCCGAC CTGCTCCGAC CGGCCACCGG ATACCCACTC CCGGTCGTCG AGGCCGATCA CCCCGAGCCG GCCGGCGGCC TCGCACTCGT CCTCGCCGAA CAGCCAGCCC TCGGCCTCGA GGGCTACCAA CTCGACGTGA CGCCGACCGG CGTCCGGATC AGCGCCGCGA CCGCGGCCGG GCTCCACCAC GGCACCCAGA CCCTGCGGCA GCTTCTCCCC GCCGCGATCG AGAGCACCAC ACCGGTGCGC ACCACCTGGA CGCTACCCGG CGGGTCGATC ACCGATCGCC CCCGTTTCCC ATACCGGGGC GCCATGCTCG ACGTGGCCCG CCACTTCTTC GGAGTCGACG AGGTGCTACG GGTGATCGAC CACCTCGCCC GGTACAAGCT CAATCACCTG CACCTGCACC TCACCGACGA CCAGGGTTGG CGGATCGCGG TCGAATCCCG GCCGAGACTG ACCACGATCG GCGGCAGCAC CGCGGTCGGC GACGCCCCCG GGGGGTGGTA CACCCCAGCC GACTACCAGC GGATCGTCGC GTACGCGGCC GACCGGCACC TCACCGTCGT TCCGGAGATC GACCTGCCGG GCCACACCAA CGCCGCGCTG ACCGCGTACC CGGAACTGGC CCCGGACGGG ACCACGCCCG CGCCCTACAC CGGCACCGAC GTCGGGTTCA GCTACGTCGA CCCGGCCAAC GCCCGAACGT ACGAATTCGT CACCGACGTG TTGGAGGAGG TCGCTGCCCG CACTCCCGGG CCGTTCCTGC ACATCGGTGG GGACGAGGCG TTCAAGGTGA AGGGAACGGC GTACACCGGA TTCGTCGAGC GGGTGCAACA CATCGTGGCC GGACTCGGCA AAACCGCCGT GGGCTGGCAC CAACTGGCTC CGGCTGCACA CAACGAGGGG CGGGTGCTCC AGTGGTGGGG CACCGACGGT GCCGATCCGG CGACCGCCGA CGCGGTCCGT CGGGGCGCAC GGCTGATCCT CTCCCCCGGC AACCACGCAT ACCTGGATAT GAAGTACGCC CCGGACACCC CGATCGGGCA CGACTGGGCC GGCCTGATCG ACGTACGGCG GGCGTACGAC TGGGATCCGG CGACCCAGGT GGCAGACGTT CCGGCAGCGG CGGTGCTGGG AGTGGAGGCC CCGCTCTGGA CCGAGTCGGT CACCTCGCTG GCAGAGGTCG AGTTCATGCT GCTGCCCCGG CTACCCGCCA TCGCGGAACT CGGTTGGTCG CCGCGAGCCA CCCACGACTG GGCAGCGTTC CGCGCACGGC TGGCCGGGCA GGGGCCCCGC TGGGCGTCGG CCGGCATCGC CTTCTACCGC TCACCCGAGA TTCCCTGGCC AGGGTCGCCT ACCGACCCGC CGGCAACGAG CGTCCCGACA CCCGCGCCGC GTCCCCGAGA CCCGCACACC GGGCGCGGAT AG
|
Protein sequence | MHPTPATSPQ QPEHRPAAAP ATSATSGTEP TANVVEVEPT ATAQAVVPAA VQPAAGELAR LAAQEAGTQL TPAPTRLGDV VPAPEQVRPE PAADFTLPAD TTIRVSPDPT ARAVAERLAD LLRPATGYPL PVVEADHPEP AGGLALVLAE QPALGLEGYQ LDVTPTGVRI SAATAAGLHH GTQTLRQLLP AAIESTTPVR TTWTLPGGSI TDRPRFPYRG AMLDVARHFF GVDEVLRVID HLARYKLNHL HLHLTDDQGW RIAVESRPRL TTIGGSTAVG DAPGGWYTPA DYQRIVAYAA DRHLTVVPEI DLPGHTNAAL TAYPELAPDG TTPAPYTGTD VGFSYVDPAN ARTYEFVTDV LEEVAARTPG PFLHIGGDEA FKVKGTAYTG FVERVQHIVA GLGKTAVGWH QLAPAAHNEG RVLQWWGTDG ADPATADAVR RGARLILSPG NHAYLDMKYA PDTPIGHDWA GLIDVRRAYD WDPATQVADV PAAAVLGVEA PLWTESVTSL AEVEFMLLPR LPAIAELGWS PRATHDWAAF RARLAGQGPR WASAGIAFYR SPEIPWPGSP TDPPATSVPT PAPRPRDPHT GRG
|
| |