Gene Caci_7198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_7198 
Symbol 
ID8338566 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp8366587 
End bp8368164 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content72% 
IMG OID644960279 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_003117868 
Protein GI256396304 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATAATCC CCCGCCCCGC CGAGTACACC GCGCGCTCCG GCGAGTTCGT CCTGGGACCG 
GCGTTGAACC TGGCCGCCGG TCCCGGAGCC GAACGCCCCG CCGACCTGCT CGCGGCCTAC
CTCGGCGCCG GCCGCCCGCG CACCGGCGCC GGCCCGGCCG TCGCCCTCCG CCTCGACGAC
GGCGCCCACG ACCATCCCCA CGGCTACGAC CTGCTGGTCA CCCCGGAACA GGTGACGCTC
ACGGCGCCGA GCGAAGCAGG GCTCTTCAAC GGGGTGCAGA CGCTGCGGCA ACTGCTGCCC
GCGCAGACGC TGTCAGCCGA TCCCGCCGTC CCCGCCGACG CCTGGCGCTG GCCCGCCTGC
CACGTCCGCG ACGCACCCCG CCTGGCCTGG CGCGGCGTGA TGCTCGACGT GGCACGGCAC
TTCATGCCGA TCGAGTTTCT CTACCGGCTC GCCGACGAGA TCGCGCTGCA CAAAATCAAC
ATCCTGCATC TGCACCTCAC CGACGACCAG GGCTGGCGCG TCGAGATCGA CGGCCTGCCC
CGGCTCACCG AAATCGGTTC CACCCGCACC GCATCGATGG TCGGCCGCGC CGGATCCACG
GTGTTCGACG GCGTCCCGCA CTCCGGCTAC TACACCCGCC GGGAGCTCGC CGACCTCGTC
GAGTACGCGG CGGCGCGCGG CGTCACGATC GTCCCCGAGA TCGGCATGCC CAGCCACACC
CGCGCGGCGC TCGCGGCCTA CCCGGAACTC GGCAACCACC CCGAGACCGC GCTGCCGGTG
TGGACGTCCT GGGGCATCAG CGAGGACATC CTCGCAGTGC ACGACGAAGC GCTCGACTTC
TGCCAGCACG TGCTGTCGGA CGTGATGACG CTCTTCCCGT CCCGGAACAT CCACATCGGC
GGCGACGAAT GCCGGACCGT CCAGTGGGAG GACAACGCCG ACGCCCGCCG CCGGATCGAG
CAGGAGGGTT TGCCGAACGT CTCCGAACTG CTGGGCTGGT TCCTGAGCCA GATGCACGGA
TACCTTGCCG AACACAGACG CCGCGCCGTC TGCTGGAACG ACGCGGTCGG CGTCGGCAAC
CTCGACCCGG GGGTGGTCGC GACGGCCTGG CTGAAGCCGG AACACGCCGC GGAGGCGATA
GCGCGCGGCC ACCAAGTGAT CGTCGCCCCG CACGAACACA CCTACCTGGA CTACCGCCAG
ACCAGCCACC CCGCCGAGCC GCCCTCGGCG GACGACCGCG TCCTGACCCT CGCCGAGGCC
TACTCCTTCG ACCCGCTGCC GACCGATCTG TCCGCCCTCG GCGTCGCCGC CTTGACCGAC
GCCTCCGGCG GACCCGGCGT CCTGGGAACC CAGGCGCAGC TGTGGACCGA GTCCGCACCC
ACGACCGAGG TTGTCAGGCA TCTGCTGTAC CCGCGCCTGT GCGCCCTGGC CGAGGGCGCG
TGGAGCGACG AGCGCCGCGA TCCCGCCGAC TTCGCCCGCC GCCTCCAGCA CCACCTGCTC
CGGCTGGACG CGCTCGGCGC GCTGCCCGCC GCCCGGCCGG ACGGCTGGGA CCCGGGCGCC
GGGCTGTCAA TGCCCTGA
 
Protein sequence
MIIPRPAEYT ARSGEFVLGP ALNLAAGPGA ERPADLLAAY LGAGRPRTGA GPAVALRLDD 
GAHDHPHGYD LLVTPEQVTL TAPSEAGLFN GVQTLRQLLP AQTLSADPAV PADAWRWPAC
HVRDAPRLAW RGVMLDVARH FMPIEFLYRL ADEIALHKIN ILHLHLTDDQ GWRVEIDGLP
RLTEIGSTRT ASMVGRAGST VFDGVPHSGY YTRRELADLV EYAAARGVTI VPEIGMPSHT
RAALAAYPEL GNHPETALPV WTSWGISEDI LAVHDEALDF CQHVLSDVMT LFPSRNIHIG
GDECRTVQWE DNADARRRIE QEGLPNVSEL LGWFLSQMHG YLAEHRRRAV CWNDAVGVGN
LDPGVVATAW LKPEHAAEAI ARGHQVIVAP HEHTYLDYRQ TSHPAEPPSA DDRVLTLAEA
YSFDPLPTDL SALGVAALTD ASGGPGVLGT QAQLWTESAP TTEVVRHLLY PRLCALAEGA
WSDERRDPAD FARRLQHHLL RLDALGALPA ARPDGWDPGA GLSMP