Gene Caci_6362 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_6362 
Symbol 
ID8337725 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp7317608 
End bp7318675 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content72% 
IMG OID644959463 
Product1D-myo-inosityl-2-acetamido-2-deoxy-alpha-D- glucopyranoside deacetylase 
Protein accessionYP_003117057 
Protein GI256395493 
COG category[S] Function unknown 
COG ID[COG2120] Uncharacterized proteins, LmbE homologs 
TIGRFAM ID[TIGR03445] 1D-myo-inosityl-2-acetamido-2-deoxy-alpha-D-glucopyranoside deacetylase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.797199 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.473152 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCTTC CCCTGGACCT GCTGGACATG CCGTTCGCGC GCCTCGGCGA CCGCCTCGGC 
AACCCGCTCA GGCTGCTGAT GGTGCACGCG CATCCGGACG ACGAGACCAC CACCACCGGC
GCCACCGCCG CGCTGTACGC CGCCGAGACC ATCGACGTGT ACCTGGTGAC CTGCACCCGG
GGCGAGCGCG GCGAGATCCT GGACCCGGAG GCCCAGCGCG TGGTGGACGA CGCCGCCGAC
GGGGAGCAGG CGCTGGGCGA ACTACGGGTG CGAGAACTGG CCGGCGCCGT CACCATGCTC
GGGATCAAGG GGTCGCGCTT CCTCGGCGGA GCGGGCCGCT GGTGGGATTC GGGGATGGCC
GGCGAGGAGT CCAACACCGA CCCGCGCTCG CTCGTGGCCG GGGACTTCCA GGAGCAGGTC
GACGCGTTGG CCGCGGCGAT CCGCGAGATA CGGCCTCAGG TTCTGGTCAC CTATGACTCG
CGCGGCGGCT ACGGGCACCC CGACCACATC CGCGCGCACC AGCTGAGCCT GGCCGCCGTC
GACCGCGCGG CCGAGACCGG CGGCGAGAGC GAGAGCGGCG GCGAGGGCGG CGGCGAGGGC
GCGGAGGCCT GGAGCGTCGC GAAGGTCTAC GCGGCCGTCG TCCCGTTCAG CATTCTGCGT
TCGGTCGCGC GCCGCCTGGG CTCCAACGGC GACAGCCCCT TCGCCCCGCT CGCCGAGGCC
TTGGCCAACG GCGTGCCGGA GGACCTCATC GAGATCCCGT ACGGCGTCCC CGACCACCTG
GTGACCGCCC AGATCGACGC CCGGGACTGG CTGGACGCCA AGACCGCCGC CATGCGCTCG
CACCGTTCCC AGATGGCCGC CGACAGCTGG TTCTTCAAGC TCGCGGCGAG CTCCGACGGC
GGATTCGGCA TCGAGCACTT CCAGCTGCTG CGCGGCACGG CCGGGCCGTT GGACGACGGT
TTCGAGGCCG ACCTGTTCGC CGGCGTGCGG GCGGTCGACG ACTCCGATTG CGAACCCGAC
TTCGGATGGC TGCCCGAAGA GGAGCCGGCC GGCGGCGAGC TGTTCTGA
 
Protein sequence
MTLPLDLLDM PFARLGDRLG NPLRLLMVHA HPDDETTTTG ATAALYAAET IDVYLVTCTR 
GERGEILDPE AQRVVDDAAD GEQALGELRV RELAGAVTML GIKGSRFLGG AGRWWDSGMA
GEESNTDPRS LVAGDFQEQV DALAAAIREI RPQVLVTYDS RGGYGHPDHI RAHQLSLAAV
DRAAETGGES ESGGEGGGEG AEAWSVAKVY AAVVPFSILR SVARRLGSNG DSPFAPLAEA
LANGVPEDLI EIPYGVPDHL VTAQIDARDW LDAKTAAMRS HRSQMAADSW FFKLAASSDG
GFGIEHFQLL RGTAGPLDDG FEADLFAGVR AVDDSDCEPD FGWLPEEEPA GGELF