Gene Caci_4251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4251 
Symbol 
ID8335605 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4825931 
End bp4827517 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content69% 
IMG OID644957354 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_003114956 
Protein GI256393392 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.115163 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.185001 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCCGGC GCGCGCGGCT GGCGCTGCTG GCGGCCGGGG TCGCGCTCGG CATGACCGTG 
ACCGGTGCGG CCGGCGCGGC CGGGGCTTCG TCCTCGGCCG CTGCCGCCGC CTCGTTCCCG
AACGTCGTGG TCCCGGTGCC GGTGTCGGAG ACCGCGAACG GCTCGACCTT CACGCTCGCC
TCGGCGGCGA CGCTGACGGC CGACGACGCG AACGTCGGCG GCTACCTGGC CGGGATCCTG
CGGGCCTCGA CCGGGTACGC ACTGCCGCTT ACCGTCGGCG CCGCGGCGCC CGGCACGATC
GCGCTGTCCC TGTCCGGTGC GCCGGCCACG GTCGGCGCCG AGGGGTATCA GCTCACGATC
AAGGCGAGCT CGGTGCTGCT CCAGGCAAAC TCGGCGGCGG GGTTGTTCCA CGGCGTGCAG
ACGTTGCTGC AACTGCTGCC GGCTCAGGTG ATGAGCCCGG CGAAGGTGAC TTCGGTGGCG
TGGAAGGCGA CCGGCGGCAC GATCCTGGAC TATCCGCGCT TCGGGTATCG CGGGGCGATG
CTGGACGTGG CGCGGCACTT CTTCACCGTC GCGCAGGTCG AGCACTACAT CGACGAACTG
TCGCTGTACA AGGTGAACTA CCTGCATCTG CACCTGTCGG ACGACCAGGG ATGGCGCATC
GCGATCAACT CCTGGCCGAA CCTGGCGACC ACCGGCGGCT CCACCGAGGT CGGAGGCGGC
GCCGGCGGCT ACTACACGCA GGCGGACTAC ACCACGATCG TGAACTACGC CGCGTCGCAC
TACATGACGC TGGTCCCCGA GATCGACACG CCGGGTCACA CGAACGCCGC GCTCGCCTCG
TACGCGGCCT TGAACTGCAA CGGGGTCGCG CCGCCTCTGT ACACCGGGAC CGACGTCGGC
TTCAGCTCGC TGTGCGTCTC GCTGCCGCTG ACGTACACGT TCCTGGACCA GGTCGTCGGC
GAGCTCGCGG CACTGACTCC GGGCCCTTAC ATCCACATCG GCGGCGACGA GGCCAGCTCC
ACGTCGCAGA GCGACTACAC GTCCTTCATC ACCAAGGCGC AGCAGATCGT GGGCAACCAC
GGCAAGGCGG TCATGGGCTG GCACAACATC GCCGCGGCCA CCCTGGCGCC GTCCACGCTC
GCGCAGTTCT GGGACACGAC GAAGTCGAAC TCCGCGCTGG CTGCCGCGGC GGCTAAGGGC
ACGAAGATCG TCATGTCCCC GGCGAACCAC GCCTACCTGG ACATGAAGTA CACCAAGAAG
ACGACGCTGG GCCAGAACTG GGCCGGCTAC GTCGACGTCA ACGCGGCCTA CGGCTGGGAC
CCGGGGAACT ACCTGTCAGG CGTCAGCGCC TCGGCGATCG CCGGCGTCGA GGCGCCGCTG
TGGTCCGAGA CGCTCGTCAC GTCGGCGAAC ATCGACTACA TGGCCTTCCC GCGCCTTCCC
GCGCTGATGG AGCTCGGATG GTCGCCCGAA TCGACCCACA ACCAGACGTC GTTCGACGCC
CGGCTCGGCG CGCAGGGACC CCGGTGGCAC GCGATGGGGG TGGATTACTA CAAGTCGACG
CAGGTCAAGT GGCCGAGCGG GTCGTGA
 
Protein sequence
MFRRARLALL AAGVALGMTV TGAAGAAGAS SSAAAAASFP NVVVPVPVSE TANGSTFTLA 
SAATLTADDA NVGGYLAGIL RASTGYALPL TVGAAAPGTI ALSLSGAPAT VGAEGYQLTI
KASSVLLQAN SAAGLFHGVQ TLLQLLPAQV MSPAKVTSVA WKATGGTILD YPRFGYRGAM
LDVARHFFTV AQVEHYIDEL SLYKVNYLHL HLSDDQGWRI AINSWPNLAT TGGSTEVGGG
AGGYYTQADY TTIVNYAASH YMTLVPEIDT PGHTNAALAS YAALNCNGVA PPLYTGTDVG
FSSLCVSLPL TYTFLDQVVG ELAALTPGPY IHIGGDEASS TSQSDYTSFI TKAQQIVGNH
GKAVMGWHNI AAATLAPSTL AQFWDTTKSN SALAAAAAKG TKIVMSPANH AYLDMKYTKK
TTLGQNWAGY VDVNAAYGWD PGNYLSGVSA SAIAGVEAPL WSETLVTSAN IDYMAFPRLP
ALMELGWSPE STHNQTSFDA RLGAQGPRWH AMGVDYYKST QVKWPSGS