Gene Arth_2890 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2890 
Symbol 
ID4444447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3255356 
End bp3257017 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content72% 
IMG OID639690713 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_832369 
Protein GI116671436 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.301856 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCGCGA CCGCAGCCTG CACCGGTGTG CCTTCTCCGC CCACCTCGGG AACGCCGTCG 
GGAACGTCCC CCGGCACGGC ATCACCGTCC GGCGGAGCCG TGCGGTCCGA ACCCGGTGCT
TCGGGAGCCC CGCGATCCTC CGGGCCTCCC ACCCCGGCAC CTGGGGAGCG CCAGCTGGGC
TGGGGGCCGC AGCAGCAGGA TGAAGACTCG GCCCGCGCCG CTGTCGCAGC AATGAGCCTT
GAACAGAAGG CCGGGCAGGT GATGATGCCG TTCTTTACCG GAACGGATTT TGCTTCGCAC
GCGGCAACCA TGGAACGGCT GCACCTGGGC GGCGCGATCA TCATGGGTGA CAACGTGCCC
CTCTCCGCCG ACGGAACCGT GGACACCGCC GCCATGGCGG CAGGCATCAG CCGCCTCCGG
AACGCGGCCA AGGCGGACGG ACGCACCTGG CCTGCACTGG TCGGGGTGGA CCAGGAGGGC
GGGGTCGTGG CGAGGCTGCG CGCTCCGTTG ACGGAATGGC CGGCACCCAT GAGCTTCGGC
GCCGCAGGGA ACGTCGGGCT GGCAACCGAC GCCGGCAAGG CGCTCGCGGC GGAGCTGGCC
GGGCTGGGCT TCACTGCGGA TTTCGCGCCG GACACGGATG TCACAGCGGG GCCGCAGGAT
CCGACGATCG GCGCCAGGGC CATGTCCGGC GACCCGGACG CGGCAGCAAG CCTGGGCGTC
GGCTTTGCCC AGGGAATGCT GGCGGCCGGG ATCCTGCCTT CCGCCAAGCA CTTCCCGGGG
CACGGCTCCG TTGCCGTCGA CTCGCACGAG AACCTGCCGG TGCAGAAAGC AACGGTGGCG
CAGCTCCGCG CGAAGGACTG GAAACCTTTC CAGGCCGCCA TCGATGCCGG GCTGCCCATG
ATCATGACCG GCCACATCTC CGTGCCGGCC CTGGAACCGG GGGTTCCGGC GTCGTTGTCC
AAACAGAGCT ACGCCACCCT CCGCGGCATG GGTTTCAAGG GCGTTGCCGT GACCGATGCA
CTCAACATGG GTGCGATCAC GAAGCAATAC CCCGGGGAGT CCGCCGCACC GCTGGCCCTG
GCAGCGGGGG CGGATCTGCT GCTCATGCCC GGGGACGTGG CCGCCGCCCA CGCGGCAGTG
GTCAGCGCCG TCAAGACCGG CGCGCTCCCG GCGTCGCGCC TCAACGACGC GGCACAGCGG
GTGGTGACCA TGATGATTTG GCGCGCACGG ACCCCTGCTC CACAGGGTGC AGCCCCGGGA
AGCGGCTCTG CCCTCTCCGA ACGTATTTCC GCGGCCGCGG TGACCGTCCT GGCCGGACCG
TGCCACGGGC CGGTGGTGCC CGGCAGCGTC CGCGTTGCCG GCGGCAGTGA ACAGGACCGT
GCCCGGTTTG CCCGGGCTGC CCGGGCTGCC GGCATTACCC TCGGCGCCGG GCCGCTCGTG
ACGCTGATCG GCTATGAAGG GCCGCCCGCC ACGGGCGACG TCGTGGTGGC CCTTGATGCG
CCGTGGCCGC TGGCCGGCTC GACGGCACCC GCCAAGGTGG CCCTCTACGG GCGGAGCCAG
GAGGCCTTCA ACGCCCTGGT TGCCGTTCTG GCGGGCAAGG CGCCGGCACC CGGAAAGCTG
CCTGCCGCCG TCGGCCCCCA CGCCCCCGGA AGCGGGTGCT GA
 
Protein sequence
MLATAACTGV PSPPTSGTPS GTSPGTASPS GGAVRSEPGA SGAPRSSGPP TPAPGERQLG 
WGPQQQDEDS ARAAVAAMSL EQKAGQVMMP FFTGTDFASH AATMERLHLG GAIIMGDNVP
LSADGTVDTA AMAAGISRLR NAAKADGRTW PALVGVDQEG GVVARLRAPL TEWPAPMSFG
AAGNVGLATD AGKALAAELA GLGFTADFAP DTDVTAGPQD PTIGARAMSG DPDAAASLGV
GFAQGMLAAG ILPSAKHFPG HGSVAVDSHE NLPVQKATVA QLRAKDWKPF QAAIDAGLPM
IMTGHISVPA LEPGVPASLS KQSYATLRGM GFKGVAVTDA LNMGAITKQY PGESAAPLAL
AAGADLLLMP GDVAAAHAAV VSAVKTGALP ASRLNDAAQR VVTMMIWRAR TPAPQGAAPG
SGSALSERIS AAAVTVLAGP CHGPVVPGSV RVAGGSEQDR ARFARAARAA GITLGAGPLV
TLIGYEGPPA TGDVVVALDA PWPLAGSTAP AKVALYGRSQ EAFNALVAVL AGKAPAPGKL
PAAVGPHAPG SGC