Gene Caci_1433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_1433 
Symbol 
ID8332772 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp1630647 
End bp1633658 
Gene Length3012 bp 
Protein Length1003 aa 
Translation table11 
GC content71% 
IMG OID644954581 
Productglycoside hydrolase family 2 TIM barrel 
Protein accessionYP_003112197 
Protein GI256390633 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAGTG ACACTGAGAC CTCCTACCAG CGTTTCGCCT ACGTCGAGGA CCGCTCCCCC 
GGGACCGGCC GGCTCGCACC GCGCGCGGCG TTCGCCTCCG ACGCCGCGGT CCTCGGGCTC
GACGGCCGGT GGCGCTTCCG CCTGGCCGCC GGGCTGCACG ACACGACAGA GGCATTCCAG
GCGCCGGACT TCGACGACGC CGCCTGGGAC GAGATCGCCG TCCCGTCGTG CTGGCAGATG
GACGGCCTGC CCGGCGAGCC GCGCTACGGC GCGCCGGCGT ACACGAACGT CACCTATCCG
ATCCCGCTGA ACCCGCCGCA CGTCCCGCGC GAGAACCCGA CCGGCGAGTA CCGCTACGCC
TTCGACGTGC CCGGCGACTT CCACGCCTCG GGTGCGCGCT TACGCTTCGA AGGCGTCGAT
TCCTGCTTCG CGGTCTGGCT GAACGGCGCG CTGCTCGGCG ACGGCAAGGG CTCGCGGCTG
CCCACCGAGT TCGACGTCTC CTCCGTGCTG GAACCGGGGC GGCAGAACGT GATCGCGGTC
CGTGTCCACC AATGGTCGGC GGGAACCTAC CTGGAAGACC AGGACATGTG GTGGCTGTCC
GGCATCTTCC GCTCGGCGGC GGTCCTGGAG CGGCCGCGCG AGGGTATCTC AGACTTCTTC
GTCCACGCCG ACTACGACCC GGCGACCGGC GCCGGCACGC TGCGGATCGA CGTCACCGGC
ACGGCCCGCC TCACCGTCCC CGACCTCGGC ATCGCCGACG CCGACCCGGC CGGGCCGTTC
GTCATCAGGC GGGTCGAGCC CTGGAGCGAC GAGCGGCCGC GCCTGTACGC CGGCGAGCTG
GTCAGCGCCG GCGAGCGCGT ACCAATCCGT ATCGGCTTCC GCCGCGTCGA GGCCGCCGAC
GGCGTCCTGC GCGCCAACGG CAAGCCGCTG AAGTTCCGCG GCGTGAACCG CCACGAGTGG
CATCCGCTCA CCGGCCGCAC GCTGAGCCCG GAGACGATGC TGGAGGACGT GCTGCTGATG
AAGCGGCACA ACATCAACGC CGTCCGCACC TCGCACTACC CGCCGGACTC CCGCTTCCTG
GACCTGTGCG ACGAATACGG GCTCTGGGTC ATCGACGAGT GCGACCTGGA GACGCACGGC
TTCGCCGTGG TCGGCTGGCG CGAGAACCCC GTCGCCGACC CCGCGTGGCG CGAGGCGCTG
TTGGACCGCG CCGAGCGCAT GGTCGAGCGG GACAAGAACC ACCCGAGCGT GGTGATCTGG
TCGCTGGGCA ACGAATGCGG CAGCGGCGAG AACCTGGCCG CGATGGCCGC GTGGATCCGG
GAGCGCAACC CCGAGCGCCT GATCCATTAC GAGGGCGACC ACGACTCCTC CTACGTCGAC
CTCTACTCGC GGATGTACTC CGACTACGAC CACGTCGCCG CCATCGGGGT GTATCAGGAG
CCGACGACGG TCGATCCGAC GGCCGACGCG CACCGGCGCT CCATCCCCTT CATGCTCTGC
GAATTCGCCC ACGCGATGGG CAATGGACCC GGCGGCCTGC TGGAATACCG CGACCTGTTC
GAGGCCCATC CCCGGCTGGC CGGCGGCTTC GTCTGGGAGT GGATCGACCA CGGCGTCGCG
CAGGGCTCGC ACTACGCCTA CGGCGGCGAC TTCGGCGAGC GCGTGCACGA CGGCAACTTC
GTCGCCGACG GCCTGCTCTT CCCGGACCGC ACGCCCTCGC CGGGTCTGCT GGAATACGCC
AAGCTCTGCG AGCCGGTCCG CATCGAGGGC GACACCGTTC GCAACCTGCA CCACAGCCGG
GACACCGGAT ATCTGCGCTG GCGCTGGCGA TTGGAGATCG ACGGCGACCT TATCGCGCAG
GACGAACTCC CCGTCCCACC GATCGCCCCC GGCTCGACCT TCCGCCTCCG TTACCCGGAC
GAGCTGACGA AGGCCGCCCA CGCCGCCGGC CCGGGCGAGC GCTGGCTGAC CGTCGAGGCG
GTGCTGTCCG ACGGTGAACC ATGGGCGCCG GCCGGGCACG TGGTCGCCTG GAGCCAGCTC
GAGCTGGGAG ACGCGCCGTT CACCGACGCC GATCCGCTGG TGGACCAAGC GGTCGCGCTG
GCTGCGGACG CGCTGCTGGC GGCGACCGCC GTCGGCGGCA GCGCGATCGA CCGCGGCGAC
GCGGCGGCCG ATTACCTGAC GCCGCAACGC CTCGGCGACA CCGTCACCCT GGGACCGGCC
TCCTTCGACG CCTCCTCCGG CGAGCTGCTG GGCTTGGCAG GGCTGGCGAT CGACGGCTTC
GCCCTGGATC TGTGGCGCGC TCCGATCGAC AACGAACGCT GGTCCTCCTT CACCGCGCCG
CCGTTGGTCG AGGCGTGGCG GACGGCCGGC CTGGACCGGC TGGAGCACGA CGTCCTGGCG
GTCGAGTCCG AGCCCGACGC GTTCACCGTC ACGACGCGCG TCGGTCCGGT CGGCCGCGAC
CACCATCTCG ACGTCGTCTA CGTCTGGTCG GCGACGGACT CCCGGCTGTG CCTGACCGTC
CACGTCGCGC CGAACCGGCC TTGGCCGTGC CCGATCCCCC GGCTCGGCGT GTCCTTCCAG
CTGCCCGGCG AGCTGAATAC CGTGAGCTGG TACGGACTCG GGCCCGGCGA GGCCTACCGG
GACAGCCGGT CGGCGGTGCG GATCGGGCAT TATCAGAGCT CTGCCGCCGA TCTGCAGACG
CCCTACCTTT TCCCGCAGGA GAACGGGAAC CGCCACCAGG TGCGCAGAGC CTCGCTCACG
CGTCCCGACG GCACCGGGCT GCTGCTCTCC GGCGCGCCGC ACTTCGATCT CGCCGTGCGG
CCGTGGAGCA GCGCCGCAAT GGAAGCCGCG CGCCATCCCG ACGAGCTGAT TCCCTCCGGG
CGGCTGCACG TCCAGGTGGA CCACGCGCAC CACGGCATCG GCAGTGCGTC GTGCGGCCAC
CCCCTGCAGC CCCGCCATCG CCTCGAGGCC GGCCGCGCGA GCTTTGCCTT CACCCTGGAG
GCGCTACAGT AG
 
Protein sequence
MPSDTETSYQ RFAYVEDRSP GTGRLAPRAA FASDAAVLGL DGRWRFRLAA GLHDTTEAFQ 
APDFDDAAWD EIAVPSCWQM DGLPGEPRYG APAYTNVTYP IPLNPPHVPR ENPTGEYRYA
FDVPGDFHAS GARLRFEGVD SCFAVWLNGA LLGDGKGSRL PTEFDVSSVL EPGRQNVIAV
RVHQWSAGTY LEDQDMWWLS GIFRSAAVLE RPREGISDFF VHADYDPATG AGTLRIDVTG
TARLTVPDLG IADADPAGPF VIRRVEPWSD ERPRLYAGEL VSAGERVPIR IGFRRVEAAD
GVLRANGKPL KFRGVNRHEW HPLTGRTLSP ETMLEDVLLM KRHNINAVRT SHYPPDSRFL
DLCDEYGLWV IDECDLETHG FAVVGWRENP VADPAWREAL LDRAERMVER DKNHPSVVIW
SLGNECGSGE NLAAMAAWIR ERNPERLIHY EGDHDSSYVD LYSRMYSDYD HVAAIGVYQE
PTTVDPTADA HRRSIPFMLC EFAHAMGNGP GGLLEYRDLF EAHPRLAGGF VWEWIDHGVA
QGSHYAYGGD FGERVHDGNF VADGLLFPDR TPSPGLLEYA KLCEPVRIEG DTVRNLHHSR
DTGYLRWRWR LEIDGDLIAQ DELPVPPIAP GSTFRLRYPD ELTKAAHAAG PGERWLTVEA
VLSDGEPWAP AGHVVAWSQL ELGDAPFTDA DPLVDQAVAL AADALLAATA VGGSAIDRGD
AAADYLTPQR LGDTVTLGPA SFDASSGELL GLAGLAIDGF ALDLWRAPID NERWSSFTAP
PLVEAWRTAG LDRLEHDVLA VESEPDAFTV TTRVGPVGRD HHLDVVYVWS ATDSRLCLTV
HVAPNRPWPC PIPRLGVSFQ LPGELNTVSW YGLGPGEAYR DSRSAVRIGH YQSSAADLQT
PYLFPQENGN RHQVRRASLT RPDGTGLLLS GAPHFDLAVR PWSSAAMEAA RHPDELIPSG
RLHVQVDHAH HGIGSASCGH PLQPRHRLEA GRASFAFTLE ALQ