Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_2890 |
Symbol | |
ID | 4444447 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 3255356 |
End bp | 3257017 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 639690713 |
Product | Beta-N-acetylhexosaminidase |
Protein accession | YP_832369 |
Protein GI | 116671436 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.301856 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCGCGA CCGCAGCCTG CACCGGTGTG CCTTCTCCGC CCACCTCGGG AACGCCGTCG GGAACGTCCC CCGGCACGGC ATCACCGTCC GGCGGAGCCG TGCGGTCCGA ACCCGGTGCT TCGGGAGCCC CGCGATCCTC CGGGCCTCCC ACCCCGGCAC CTGGGGAGCG CCAGCTGGGC TGGGGGCCGC AGCAGCAGGA TGAAGACTCG GCCCGCGCCG CTGTCGCAGC AATGAGCCTT GAACAGAAGG CCGGGCAGGT GATGATGCCG TTCTTTACCG GAACGGATTT TGCTTCGCAC GCGGCAACCA TGGAACGGCT GCACCTGGGC GGCGCGATCA TCATGGGTGA CAACGTGCCC CTCTCCGCCG ACGGAACCGT GGACACCGCC GCCATGGCGG CAGGCATCAG CCGCCTCCGG AACGCGGCCA AGGCGGACGG ACGCACCTGG CCTGCACTGG TCGGGGTGGA CCAGGAGGGC GGGGTCGTGG CGAGGCTGCG CGCTCCGTTG ACGGAATGGC CGGCACCCAT GAGCTTCGGC GCCGCAGGGA ACGTCGGGCT GGCAACCGAC GCCGGCAAGG CGCTCGCGGC GGAGCTGGCC GGGCTGGGCT TCACTGCGGA TTTCGCGCCG GACACGGATG TCACAGCGGG GCCGCAGGAT CCGACGATCG GCGCCAGGGC CATGTCCGGC GACCCGGACG CGGCAGCAAG CCTGGGCGTC GGCTTTGCCC AGGGAATGCT GGCGGCCGGG ATCCTGCCTT CCGCCAAGCA CTTCCCGGGG CACGGCTCCG TTGCCGTCGA CTCGCACGAG AACCTGCCGG TGCAGAAAGC AACGGTGGCG CAGCTCCGCG CGAAGGACTG GAAACCTTTC CAGGCCGCCA TCGATGCCGG GCTGCCCATG ATCATGACCG GCCACATCTC CGTGCCGGCC CTGGAACCGG GGGTTCCGGC GTCGTTGTCC AAACAGAGCT ACGCCACCCT CCGCGGCATG GGTTTCAAGG GCGTTGCCGT GACCGATGCA CTCAACATGG GTGCGATCAC GAAGCAATAC CCCGGGGAGT CCGCCGCACC GCTGGCCCTG GCAGCGGGGG CGGATCTGCT GCTCATGCCC GGGGACGTGG CCGCCGCCCA CGCGGCAGTG GTCAGCGCCG TCAAGACCGG CGCGCTCCCG GCGTCGCGCC TCAACGACGC GGCACAGCGG GTGGTGACCA TGATGATTTG GCGCGCACGG ACCCCTGCTC CACAGGGTGC AGCCCCGGGA AGCGGCTCTG CCCTCTCCGA ACGTATTTCC GCGGCCGCGG TGACCGTCCT GGCCGGACCG TGCCACGGGC CGGTGGTGCC CGGCAGCGTC CGCGTTGCCG GCGGCAGTGA ACAGGACCGT GCCCGGTTTG CCCGGGCTGC CCGGGCTGCC GGCATTACCC TCGGCGCCGG GCCGCTCGTG ACGCTGATCG GCTATGAAGG GCCGCCCGCC ACGGGCGACG TCGTGGTGGC CCTTGATGCG CCGTGGCCGC TGGCCGGCTC GACGGCACCC GCCAAGGTGG CCCTCTACGG GCGGAGCCAG GAGGCCTTCA ACGCCCTGGT TGCCGTTCTG GCGGGCAAGG CGCCGGCACC CGGAAAGCTG CCTGCCGCCG TCGGCCCCCA CGCCCCCGGA AGCGGGTGCT GA
|
Protein sequence | MLATAACTGV PSPPTSGTPS GTSPGTASPS GGAVRSEPGA SGAPRSSGPP TPAPGERQLG WGPQQQDEDS ARAAVAAMSL EQKAGQVMMP FFTGTDFASH AATMERLHLG GAIIMGDNVP LSADGTVDTA AMAAGISRLR NAAKADGRTW PALVGVDQEG GVVARLRAPL TEWPAPMSFG AAGNVGLATD AGKALAAELA GLGFTADFAP DTDVTAGPQD PTIGARAMSG DPDAAASLGV GFAQGMLAAG ILPSAKHFPG HGSVAVDSHE NLPVQKATVA QLRAKDWKPF QAAIDAGLPM IMTGHISVPA LEPGVPASLS KQSYATLRGM GFKGVAVTDA LNMGAITKQY PGESAAPLAL AAGADLLLMP GDVAAAHAAV VSAVKTGALP ASRLNDAAQR VVTMMIWRAR TPAPQGAAPG SGSALSERIS AAAVTVLAGP CHGPVVPGSV RVAGGSEQDR ARFARAARAA GITLGAGPLV TLIGYEGPPA TGDVVVALDA PWPLAGSTAP AKVALYGRSQ EAFNALVAVL AGKAPAPGKL PAAVGPHAPG SGC
|
| |