Gene Caci_5654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5654 
Symbol 
ID8337014 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp6516273 
End bp6518543 
Gene Length2271 bp 
Protein Length756 aa 
Translation table11 
GC content68% 
IMG OID644958758 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_003116354 
Protein GI256394790 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.171925 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.428495 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCGT CGCCACCCAC GGCGCACGCC GCCACCGGTA CGCCGCAGAC GATCCCCGCC 
GTCCGCACGT GGTCCGCCGG CTCCGGCTCC TTCTCTTGGA GCACGGCCAG CCGCGTCGTC
ATCAACCCGA CCTATGCCTC GCAGCTGCAA GGCGACGCGA ACACCTTCGC CGCCGATCTG
TCCGCGCTGG AAGGGCGCAC GGTCGGCGTC GCGCAGGGCA CCGCAGGTCC CGGCGACATC
GCCCTGAACC TGGGTGGCAG CCAGCCGACC GAGGGCTACA CCATGACCGT CGGCAGCAGC
ATCTCGATCC AGGGCAGCAC CACCACCGGC GAGTTCTGGG GCACCCGCAG CGTGCTGCAG
CTGCTGCATC AGGGTTCGAC GATCGCGGCC GGCACGGCAA CGGACTCCCC CGACAAGTCC
GAGCGCGGGC TGATGCTGGA CACCGGGCGC CGGTTCTTCG ATGTGGCGTT CGTGGAGAAC
CAGATCCGCG AGATGTCCTA CCTCAAGATG AACTATCTGC ACCTGCATCT GTCGGACACC
TACGGCTTCC GGCTGGAGAG CACCACGCAC CCGGAGATCA CCTCCGCGCA GCACTACTCC
AAGCAGGACA TCGCGGCCAT CATCGCGCTG GCGAAGCAGT ACCACGTGAC CGTCGTCCCG
GAGATCGACC TGCCCGGGCA CATGGACGCG ATCCTGTCCG CCGAGCTGGG CATCGGCCAC
GACTACCGGC TCAAGGACAG CAGCGGCAAC GCCAGCAGCA GCTACATCGA CCTGACCATC
CCCGGCGCCC GGCAGCTGAT CAGCGACCTG ATCACCGAAT ACGAGCCGCT GTTCACCACC
AGCTCGTACT GGCATCTCGG CGCTGACGAG TACGTCACCA ACTACGGCTC CTACCCGCAG
CTGCTCACCT ACGCCCACCA GAACTACGGC GCCAACGCCA CGGCGAAGGA CACCTTCTAC
GGCTTCGTCA ACTGGGCCGA CGGCATCGTG CGCGCCGGCG GCAAGACGAT GCGGATGTGG
AACGACGGGC TGAAGTCCGG CGACGGCACG ATCACCGTCA ACCCGGACAT CATCGTGGAG
TACTGGAGCA ACACCGGTCT GTCCCCGCAG CAGGTCGTCA ACGCCGGGCA CACCATCGCC
AACGAGGCGT ACACGCCGAC CTACTACGTC TACGGCGGCG CCAAGCCGAA CACGACGTCC
ATGTACGAGT CCTGGAACCC GGACCTGTTC GACGGCTCGA CCACCATCAC CAACGGCGCC
GCGAACCTGG GCTCCCTGAT CCACGTCTGG TGTGACAACC CCGGCGCCGA GACCGAGGAC
CAGACCGCCG ACGGCATCAA GTACCCGCTG CGCGACCTGG CGCAGATGAC GTGGAACAGC
CCCAAGCTGG TCCCGACCTA CGCCGCGTTC GTCCCGATCA TGGACGCCGT CGGCCGCAAT
CCGCTCTACC CGAAGCCGTC CATCGCCGGC GACCTCGCGC AGGGCAAGCC GACGACCGCC
TCCAGCGTCG AGACGCCGAA CTTCCCCGCC GCCGAGGCCA CCGACGCCGA CCTGAGCACC
CGCTGGTCCA GCCAGTACGC CGACCCGACC TGGCTGCAGG TGGACCTGGG CTCGGTGCAG
ACGGTCAACC GCGTGGTGCT GGCCTGGGAG GCGGCGTACG GGAAGAACTA CCAGATCCAG
CTGTCGAACG ACGGCACGAC CTGGACCACC GTCGCCACGC GCGCCAACGG CACCGGCGGC
ACCGAAACGC TGACGTTCGC CGACGCCACC GGCCGCTACC TCCGCATGTA CGGCACAGCG
CGCGGCACGC AGTACGGCTA CTCGCTGTGG GAGTTCGAAG CCTTCGACGA CGCGAGCGGC
CCGGTCCGCG GCACCCACAC CGTCAGCACC GGCGGCCAGG CGCTGGACGA CCCCGCCAGC
TCCACCGCGA CCGGCACCCA GCTCATCACC TGGGCCCTGC ACGGCGGCAC CAACCAGCAG
TGGACGTTCA CCGAGCAGCC CGACGGCTCC TACCAGATCA CCAACGGCGC CTCAGGCCTG
TGCCTGGACG TGACCGGCAG CTCCACGGCA GCCGGCGCGG CGGTGATCCA GTGGACCTGC
GCGGGCACCG CGAATCAGCA CTGGAACATC ACACCGCTGT CCGGCGGCGG GTACACGATC
GCCTCGGTGA ACAGCAACCT GCTGCTGACC ACGGCCTCCA CGGCGAACAA CGCGCTGGTG
ACGCAGCAGA CCAACAGCGG GAGCGCGTTG CAGCACTGGT CGATCAACTA G
 
Protein sequence
MAASPPTAHA ATGTPQTIPA VRTWSAGSGS FSWSTASRVV INPTYASQLQ GDANTFAADL 
SALEGRTVGV AQGTAGPGDI ALNLGGSQPT EGYTMTVGSS ISIQGSTTTG EFWGTRSVLQ
LLHQGSTIAA GTATDSPDKS ERGLMLDTGR RFFDVAFVEN QIREMSYLKM NYLHLHLSDT
YGFRLESTTH PEITSAQHYS KQDIAAIIAL AKQYHVTVVP EIDLPGHMDA ILSAELGIGH
DYRLKDSSGN ASSSYIDLTI PGARQLISDL ITEYEPLFTT SSYWHLGADE YVTNYGSYPQ
LLTYAHQNYG ANATAKDTFY GFVNWADGIV RAGGKTMRMW NDGLKSGDGT ITVNPDIIVE
YWSNTGLSPQ QVVNAGHTIA NEAYTPTYYV YGGAKPNTTS MYESWNPDLF DGSTTITNGA
ANLGSLIHVW CDNPGAETED QTADGIKYPL RDLAQMTWNS PKLVPTYAAF VPIMDAVGRN
PLYPKPSIAG DLAQGKPTTA SSVETPNFPA AEATDADLST RWSSQYADPT WLQVDLGSVQ
TVNRVVLAWE AAYGKNYQIQ LSNDGTTWTT VATRANGTGG TETLTFADAT GRYLRMYGTA
RGTQYGYSLW EFEAFDDASG PVRGTHTVST GGQALDDPAS STATGTQLIT WALHGGTNQQ
WTFTEQPDGS YQITNGASGL CLDVTGSSTA AGAAVIQWTC AGTANQHWNI TPLSGGGYTI
ASVNSNLLLT TASTANNALV TQQTNSGSAL QHWSIN