Gene Caci_3464 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3464 
Symbol 
ID8334817 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp3844552 
End bp3846501 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content67% 
IMG OID644956608 
ProductCarbohydrate binding family 6 
Protein accessionYP_003114211 
Protein GI256392647 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5434] Endopolygalacturonase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.235288 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0024319 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTCCTCC ATCGAAGCAG GCGCGCACGA CGCCGCATCC CCCACCTCGC GGCCGCGATA 
GCCGCGACCG GCGTCGCGGC CCTCGCCCTC GCGGGCGTCC AAGGCACAGC CCTGGCCGCG
TCGGCCGCCC CCCACCCCGC AGCCGCCAAG GCCGCCGCCG CGGTCTTCAA CGTCAAGGAC
TACGGAGCCA CCGGCAACGG CTCCACCAAC GACTCTCCAG CCATCAATAA GGCGGTCGCC
GCCGCGAACA CCGCCGGCGG CGGCATCGTG GAGTTCCCGT CCGGCAGCTA CAAGTCCGCG
AACACCGTCC ACCTCAAGAG CAACGTCACC ATCCAGCTCG ACGCCGGCTC CAAAGTCCTC
GGCTCCAGCG CCAAGACCTA CGACGCCGCC GAGTCGAATC CCAACGACAA GTACCAGGAC
TACGGCCACA GCCACTTCCA CGACGCGATG TTCTCCGGCG ACAAACTCTC CAACATCGGC
TTCACCGGCT CCGGCACCAT CGACGGCGGC GGCAACCTGA TCACCGGCAA CCCCGGCGCC
GGGCAGGCCG ACAAGATCCT GTCACTGACC CGCTGCACCA ACCTCACGCT CAGCGGCATC
ACCCTCACCC GCGGCGGGCA CTTCGGCGCG CTGATCAACG GCTGCGACGG CGTGGTGTCC
GACCACCTCA CGATCGCGAC CTCCAGCGAC CGCGACGGCT GGAACATCAT CTCCACGACG
CACGTCACCA TCACCAACGC GAACATCTCC TCCAACGACG ACGCGCTGGT GTTCAAGAGC
GACTGGGCCC TGGGCCAGAC GCTGCCCAGC GGCCACGTCA CCGTCACGAA CAGCACGCTG
CAGGCCAAGT GTTGCAATGC CCTGATGTTC GGCTCGGAGA CCTGCGGATC CTTCACCGAC
TACCGGTTCC AGCAGATCAC CATCCTCGGC GCCGGCAAAT CCGGGCTGGG CATCGTGAGC
ATGGACGGCG CAGACATCTC CGACGTGCAC TACCAAGACG TCACCATGAC CGGTGTGCAC
TCGCCGATCA TGGAGAAGAT CGGGACCCGA TTGCGATGCG GCGGCAGTCC GAAGGTCGGG
CATATCAGTA ACGTGACATT CACCAACGTC ACGGGCACCG GCGTGGCGAG TACGGACTAC
AGCCCGACGA TCTGGGGCGC CGACAGCAGC CACCAGGTCA GCGACGAGAC CTTCACCAAC
GTCAACCTGA CCGTCCCCGG CGGCCACGGC ACGATGTCCA CCGGCGTCCC CAGCGACAAC
GGCGACTACA ACCCGAACAG CATCGGTACG CGGCCGGCGT ACGGCTGGTA CCTGCACAAC
GTCTCCGGCA TCCACTTCAC CGGCGGCTCG GTCAAGGTCG CCAAGACCGA CGGCCGCCCC
GCGGTCATCG CCAACGCCGG CAGCGCCATC ACCTTCGACG GACTGACCGC GCAGACCGGC
AGCTCCAGCC CGTTCGACGT CGGCTTCCAG AACATCACCG GATACTGCCT CAGCAACAGC
CACAACACCT CCGGCGGCGC GCTGCGCGTC TCGGCCAGCG GCTCCACCCA AAGCTGCGGC
TCCTCGGCGA CACGCTATGA GGCGGAGAAC GCGACGCTGT CCACCGGCGA CACCGTGGCC
ACCAACCACA CCGGTTTTTC CGGCAGCGGC TTCGTCGACA CGACCAACGC CGTCGGTGCG
TACGTCGAGT GGACCGTGAC CGCGCCGGCC GCCGGCACGT ACACAGCCAC CGTCGGCTAC
GCGAACGGAA CCACCACCGA CCGGCCGATG GACGTCGCCG TGAACGGCAC GACCGCGGAC
GCCGCAGCGT CGTTCCCCAC GACCGCGAGC TGGAACACCT GGGCCGGCAA GGCATTCAGC
GTTCCGTTGA ACGCCGGCGC CAACACCATC CGGGTAGCCG CGAGCACCGC GAACGGCTGC
CCGAACCTCG ACTACCTCGA CCTCGGCTGA
 
Protein sequence
MFLHRSRRAR RRIPHLAAAI AATGVAALAL AGVQGTALAA SAAPHPAAAK AAAAVFNVKD 
YGATGNGSTN DSPAINKAVA AANTAGGGIV EFPSGSYKSA NTVHLKSNVT IQLDAGSKVL
GSSAKTYDAA ESNPNDKYQD YGHSHFHDAM FSGDKLSNIG FTGSGTIDGG GNLITGNPGA
GQADKILSLT RCTNLTLSGI TLTRGGHFGA LINGCDGVVS DHLTIATSSD RDGWNIISTT
HVTITNANIS SNDDALVFKS DWALGQTLPS GHVTVTNSTL QAKCCNALMF GSETCGSFTD
YRFQQITILG AGKSGLGIVS MDGADISDVH YQDVTMTGVH SPIMEKIGTR LRCGGSPKVG
HISNVTFTNV TGTGVASTDY SPTIWGADSS HQVSDETFTN VNLTVPGGHG TMSTGVPSDN
GDYNPNSIGT RPAYGWYLHN VSGIHFTGGS VKVAKTDGRP AVIANAGSAI TFDGLTAQTG
SSSPFDVGFQ NITGYCLSNS HNTSGGALRV SASGSTQSCG SSATRYEAEN ATLSTGDTVA
TNHTGFSGSG FVDTTNAVGA YVEWTVTAPA AGTYTATVGY ANGTTTDRPM DVAVNGTTAD
AAASFPTTAS WNTWAGKAFS VPLNAGANTI RVAASTANGC PNLDYLDLG