Gene Caci_4810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4810 
Symbol 
ID8336164 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5479676 
End bp5481277 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content69% 
IMG OID644957910 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_003115512 
Protein GI256393948 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.48578 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCGGA CCGCAGCGGC TCTCAGCACC TGCGCCGCCC TCCTTGCTCT GGCCGCGTGC 
GGCGGCGGGA GCAGCGGGAC GAAGAGTTCC GTCCCGCCTG CCAGCACGAA CGGCAGTAGT
TCCGCGCCGT CAGTCCCCTT GTCGCCGGCA GATCCGGCCG CGCTGGAGCG GGTCGTTCCC
GAGCCGGCCG GCATCACGGC GGCTGCCGGG ACGTTCACCC TGACGAGTGC CACCGCGATC
CACGCCTCGT CCGGCGCCGA GCCGGTCGCC GCCGACCTCG CCGCGTATCT CAAGCAGCAG
ACCGGGCTGG CGCCGGCTGT ATCGCAAAGC CCTGATGCGG CGATCCAGCT TGTGCTGCAG
CCCAGCGGCG GCGATCCGTC GCTGGGCACC GAGGGCTACA CGCTGGTGAT CGGTCCGAGC
TCGGTCAAGC TCACGGCGGC TACTGACGCA GGGCTCTTCC ATGGTGTGCA GACCGTGCGG
CAGCTGCTTG TCGGCGCCAA GCTCCAGGAC GGGACGATCA CCGACCACCC GCGATTCGCT
TATCGCGGCG TGATGTTGGA TGTGGCGCGG CACTTCTACA GCGTCGCGGA CGTGAAGGCT
TATATCGACG CCGCCGCGTT GTACAAGGTC AACGAGTTCC ACCTGCACCT GACCGACGAC
CAGGGCTGGC GGTTCGCCGT GCCGGGGTGG CCGAAGCTGA CGTCGGTGGG CGCGGCGACG
CAGGTCGGCG GCGGCGTCGG CGGGTCGTAT TCGGCGGCTG ATCTGAAGGA GATCGTCGAT
TACGCGGCGT CGCGCTACAT GACCGTGATT CCGGAGATCG ACATGCCGGG GCACGTCGGC
GCTGCGGTGT ACGCCTACCC TTCGCTGGCG TGCGACGGTC GGCACCACGG TCCGGTGACG
AGCGTATCGC CGGCGTACGA CTCGCTGTGC ACGTCGAGCG AATCAACATA CAGATTTGTC
GATACAGCGA CCAAAGCCGC CGCCGACGCC ACCCCCGGCG CGACCTACCT GCACATGGGC
GGCGACGAGG CGCAGGCACT GAGCCTGACG CAGTACAACG CCTTCGTCGC GAAGACACAG
AATCTCGTGG CAGGGCACGA TCGCACGCCG ATCGCCTGGG CCGAAGCCGG TACCGCAACC
CTGCTGCCGC AGACGGTGCT GGAGTACTGG AACACCGCGC AGCCGCAGCC CTACGTCCTC
CAGGCCGCCG CCAAGGGCAC CAAGCTCATC ATGGCGCCGG GCAACCACGC CTACCTGGAC
CAGCAGCCGG TCGCCGGATT CCGCGTCGGC CTGCACTGGG CCGGCTACGT GCCGGTGTCG
AAGGCCTACG ACTGGGATCC GGTGACCGTC CTGCCCGGCA TCGCGCCCTC GGCGGTACTC
GGCGTCGAGG CACCGCTGTG GAGCGAGACC GTGAAGAACC TCGCCGACGC CGAAACCCTC
GCCTACCCCC GCCTCCCCGC CATCGCCGAA ATCGGTTGGT CGGCACCGAA CACCCACGAC
TGGCAGCGAT TCTCGAAGAG GCTGGCAGCG CAGGCTCCCC TGTGGGACAA GCTGGGGATC
GCTTACTACA AGTCGCCGGA AGTGCCTTGG GGGTCGGGGT AG
 
Protein sequence
MPRTAAALST CAALLALAAC GGGSSGTKSS VPPASTNGSS SAPSVPLSPA DPAALERVVP 
EPAGITAAAG TFTLTSATAI HASSGAEPVA ADLAAYLKQQ TGLAPAVSQS PDAAIQLVLQ
PSGGDPSLGT EGYTLVIGPS SVKLTAATDA GLFHGVQTVR QLLVGAKLQD GTITDHPRFA
YRGVMLDVAR HFYSVADVKA YIDAAALYKV NEFHLHLTDD QGWRFAVPGW PKLTSVGAAT
QVGGGVGGSY SAADLKEIVD YAASRYMTVI PEIDMPGHVG AAVYAYPSLA CDGRHHGPVT
SVSPAYDSLC TSSESTYRFV DTATKAAADA TPGATYLHMG GDEAQALSLT QYNAFVAKTQ
NLVAGHDRTP IAWAEAGTAT LLPQTVLEYW NTAQPQPYVL QAAAKGTKLI MAPGNHAYLD
QQPVAGFRVG LHWAGYVPVS KAYDWDPVTV LPGIAPSAVL GVEAPLWSET VKNLADAETL
AYPRLPAIAE IGWSAPNTHD WQRFSKRLAA QAPLWDKLGI AYYKSPEVPW GSG