Gene Bcav_3043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcav_3043 
Symbol 
ID7858503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeutenbergia cavernae DSM 12333 
KingdomBacteria 
Replicon accessionNC_012669 
Strand
Start bp3394777 
End bp3396546 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content75% 
IMG OID643867140 
ProductGlycoside hydrolase, family 20, catalytic core 
Protein accessionYP_002883049 
Protein GI229821523 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.00805232 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGCAGCAG GACTGAACTC CCTCTTCCCC CGTCCGCGCA GCATCACGTC GCTCGACGGC 
GACGACGTCC CCGCCGGCGT CGTCGCCGTC GAGACGCCCG ACGCGTCGCT GCCCCCGCAG
GGCTACGTCC TGCGCGTGGC CGACGGCGAG GTGCGGCTCG CGTACGCCGA CGACGCCGGA
CGGCGGTACG GCCGGGCGAC GCTCGCCCAG CTCGCCGCGG GCGGCGAAGC CCTTCCCGCC
GTCGAGATCA CCGACTGGCC GGACCTCCCG GACCGCGGGT TCATGCTCGA CGTCAGCCGC
GACCGCGTCC CCACCCGGGA GACGCTGACC CGGATCCTGG ACCTGCTCGA GGCGGCGCGG
TACACGCAGC TCCAGCTGTA CACGGAGCAC ACGTTCGCGT ACGCGGGGCA CGAGGAGGTG
TGGGCCGACG CGTCGCCTCT GACGCCCGAC GACGTCCGGT GGCTCGACGC CTCCTGCGCC
GCGCGCGGGA TCGAGCTCGT GCCGAACCAG AACGTGTTCG GGCACATGGA GCGGTGGCTC
GCCCACGACA CCTACGCCGA CCGCGCCGAG TCCCCGGGCG GGTACACGCT CGCGGGGTCG
GTCCGGAAGC CGGCGGTGCT GGAGCCGACG GCGGACAACG CGGCGTTCGC GCTCGGCCTG
CTGGAGGAGC TGCTGCCGAA CTTCTCCTCC CGGCGCGTCA ACATCGGCGC CGACGAGACG
TTCGAGCTCG GCCTCGGGCG TTCGCGGGAC CGCGTGGCCG CCGAGGGGCG CGGTGCGGTG
TACGTGGAGT ACGTGCAGCG CATCCTGGGC CCGCTCGTCG AGCGCGGGTA CGAGGTGCAG
TACTGGGCCG ATGTGCTCGC CCACCACCCG GAGTACGCGG CCGCCCTGGG CGGCGTGCCG
ATCGTGTGGC TGTACGACTC GCCGTCCGCG ATCGAGCGGG CGCTCGACCT GCCCGCCGCG
ACCAAGGAGC GGATGGCCGC GTTCGACGCC TCCCCCGAGC AACAGCTCGG AGGGTTCGCG
ACTCGCGGCG CCGCCGTCAT CGCGGCCGGC ATCCCGTTCT GGGTGGCGCC CGGCACGGGC
ACCTGGCAGT CCATCGTGGG ACGGCTGGAC AACGCGCGCG AGAACATCCT CGACGCCGCG
ACCACGGCGC TCGCGCACGG CGGCACGGGC TTCCTGCTCA CCCAGTGGGG CGACCACGGG
ATGGTCGAGC CGCCGCCGGT CGCGTTCGCA CCGCTGCTGT ACGGGGGCGC CGTCGCGTGG
TGCTCCGCGG CGAACGCCGA CCTCGACCTG GCGACGACGA CGGCGGAGCT CGCCTTCGGC
GACGCGACGC ACCGCACGGG CGACGCGCTC GTCCGGCTCG GGACGCTCGG CACCGACCTG
GGGATGCCGG CGCTCAACGC GACGCTGCTG TTCGCGTCGC TGTTCCCGGG TCGCTCCGGG
ATGGTGACGA CCGACGGGTT GACGGCCGAC GCCGTCGAGC GCGTGCTCGG CACGATCGAC
GCGCAGCTCG CGAGCCTCGA CCTCGCGACG CCGCAGGGCC CCGACGCCGA CCTGGTGCTC
GCGGAGCTCC AGCACGCCGC GCGGCTCGCC CGGTTCGGGG CGCGGGTGCT GCTCGCGGAC
GTCGGTGGCG GTGCGCTGCC GTCGGTGGCG GAGCTCGACG ACCTGCTCGT GACCCAGCGG
GAGCTGTGGC TGGCGCGCTC GCGGCCCGGT GGGCTCTCCG ACTCCCTCGC GAAGTTCGGG
CCGCTGCGGG AGAAGCTCGC CGCCGGCTGA
 
Protein sequence
MAAGLNSLFP RPRSITSLDG DDVPAGVVAV ETPDASLPPQ GYVLRVADGE VRLAYADDAG 
RRYGRATLAQ LAAGGEALPA VEITDWPDLP DRGFMLDVSR DRVPTRETLT RILDLLEAAR
YTQLQLYTEH TFAYAGHEEV WADASPLTPD DVRWLDASCA ARGIELVPNQ NVFGHMERWL
AHDTYADRAE SPGGYTLAGS VRKPAVLEPT ADNAAFALGL LEELLPNFSS RRVNIGADET
FELGLGRSRD RVAAEGRGAV YVEYVQRILG PLVERGYEVQ YWADVLAHHP EYAAALGGVP
IVWLYDSPSA IERALDLPAA TKERMAAFDA SPEQQLGGFA TRGAAVIAAG IPFWVAPGTG
TWQSIVGRLD NARENILDAA TTALAHGGTG FLLTQWGDHG MVEPPPVAFA PLLYGGAVAW
CSAANADLDL ATTTAELAFG DATHRTGDAL VRLGTLGTDL GMPALNATLL FASLFPGRSG
MVTTDGLTAD AVERVLGTID AQLASLDLAT PQGPDADLVL AELQHAARLA RFGARVLLAD
VGGGALPSVA ELDDLLVTQR ELWLARSRPG GLSDSLAKFG PLREKLAAG