Gene Caci_4157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4157 
Symbol 
ID8335511 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4699982 
End bp4702141 
Gene Length2160 bp 
Protein Length719 aa 
Translation table11 
GC content68% 
IMG OID644957260 
Productcellulose-binding family II 
Protein accessionYP_003114862 
Protein GI256393298 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3534] Alpha-L-arabinofuranosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.894149 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.131999 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCTCG ATCGCAGGCT CAGGGCGAGA TCGGTCGCCC TGACCGCCGC CACGGCTTTG 
TCCGCGCTGT CCATCGTCGG AGCGGTCGCG GCTCCGCAGG CGTCCGCCGC CACCGCGGTG
TCGGTGACCG TCAACGGCAC AGCCGGGCTC GGTACCATTC CCGGCGGCGC GATCGGCCTG
AACACCGCCG TCTACGACAG CTATATGAAC GACACCCCGA TCCCGGGTCT GCTCAAGGCC
GCGGGAATCA ATGCTCTGCG CTACCCGGGC GGTTCGTACT CTGACATCTA CAACTGGCAG
ACCAACGTCG CGCAGGGCGG CTACGACGCG CCGAACACGA GCTTCGCGGA CTTCATGGGG
ACCGCGAAGG CTGCCTCGGC CAGCCCGATC ATCACCGTGA ACTACGGCAC CGGGACACCG
GCGTTGGCCG CGTCCTGGGT GCAGAACGCC GCCGTCACCA ACAAGGACGG CGTCGCGTAC
TGGGAGGTCG GCAACGAGGT CTACGGCAAC GGGACCTACG GCGCGAACTG GGAGACCGAC
GCGCACTGCC AGACGTCCTC CGGAACGCCG GTCACCGTCG GCAGCGAGCC TTCGCAGACC
TACGGTTGCG GTCCCTCGGT CTACGCCAAC AATGTCCTGA GTTACATGTC CTCGATGAAG
GCGGTCAGCT CGAACGCCCA CGTCTGCGCG ATCCTGACCA CGCCGGGGTT CTGGCCCGAC
AACGTCACCA ACGCCACGAC CAGCCCGCTT CCCTGGAACC AGACCGTGCT CACGGCGCTC
GGCGCCAAGA CCGACTGCGT CATCGTGCAC TACTATCCCG GCGGCTCGAA CGCGGCCGGG
ATGCTGACCG ACACCAGCGA CATCTCCGGG ATCATCTCGA CGCTGCACTC CCAGATCAGC
CAGTACGCCA AGGTGAACCC GGCGAACGTG CCGATCCTGG TGACCGAGAC CAACTCCAAC
GTGGACATGG ACACCCAGCC CAACGCGCTG TTCGCCGCCG ACATGTACAT GACCTGGCTG
GAGAACGGCG TCGCGAACGT CGACTGGTGG GACGAGCACA ACGGCCCGGG GACCAACCCG
CCGAGCGTCG TCAACGGCGC GCAGGACTAC GGCGACTACG GCATCTTCTC CACCGGCGGC
AACAACAGCG GCGTGACCGA GCCGGCCGCC GAGACCCCGT TCGGGCCGTA CTACGGCATC
GCGATGCTGT CCAAGCTCGG CGGACCCGGC GACACGATGG TGAACAGCAC GTCCTCCAAC
GCGCTGGTCC GCGTCCACGC GGTGCGGCGG GCCGGCGGGA ACCTCGATCT GCTGATCGAC
AACGAGGATC CCACCACCTC CTACTCGGTG AACCTGGCTT ACAACGGGTT CACGCCAGCC
GGTAGCCCGA CGGTCTTCAC CTTCGCGAAC AACGGGAATT CGATCACCAG TGCGACGCAG
AGCTCTGCGT CGTCGGTCAC GGTCGCTCCG TACACGCTCA CGGTCGTACA GGTCCCGGGC
AGCGGCGGGG GAGGTGTGAC AGCACCGGGA GCGCCGGGGC AGCCGGTCGT CTCCGGGCTG
GCGTCGAGCA CGTCCGGCAA CACCACCGGC GTGGCGACGC TGACCTGGCC AGCAGCCACG
GCCGGCACGT ACCCGGTCGC GTCCTACCAG GTCTACCGGC AGAACAGCGG CGGCGGGACA
ACCCTCGCCG GCACGACCAC CACGACGACG CTGAATCTCA GTGGCCTGAC GATCGGCGCG
GGCTACACCT ATGACGTGGT CGCGGTGGAC TCCCACGGCA ACCCGTCGCT GCCCTCGCCA
CCGGTGACGT TCACCGTGCC ACCCCCGGCG ACCGCGAGCT GCGCGGTGCA CTACGCGGTC
AGCTCCTCCT GGTCCGGAGG CTTCGGTGCC GCGATCACGA TCACGAACCG CAGTGCGACC
GCCATCAGTG CCTGGACCCT GAAATTCACC TGGCCCGACC CCGGCGAGGC GGTGCAGAGC
GGCTGGAACG GCACCTGGAG CCAGAGCGGC TCGGCGGTGA CCGTGGTGAA CGCCGCATGG
AACGGCACGA TCGCAGCCAA CGGCGGCACG GTGAGCCTCG GCTTCAACGG CGCGGACACC
GGCCAGGACC CGGCGCCGAC CGTGTTCTCG CTCAACGGGA CGGTGTGCGC GAACAACTGA
 
Protein sequence
MPLDRRLRAR SVALTAATAL SALSIVGAVA APQASAATAV SVTVNGTAGL GTIPGGAIGL 
NTAVYDSYMN DTPIPGLLKA AGINALRYPG GSYSDIYNWQ TNVAQGGYDA PNTSFADFMG
TAKAASASPI ITVNYGTGTP ALAASWVQNA AVTNKDGVAY WEVGNEVYGN GTYGANWETD
AHCQTSSGTP VTVGSEPSQT YGCGPSVYAN NVLSYMSSMK AVSSNAHVCA ILTTPGFWPD
NVTNATTSPL PWNQTVLTAL GAKTDCVIVH YYPGGSNAAG MLTDTSDISG IISTLHSQIS
QYAKVNPANV PILVTETNSN VDMDTQPNAL FAADMYMTWL ENGVANVDWW DEHNGPGTNP
PSVVNGAQDY GDYGIFSTGG NNSGVTEPAA ETPFGPYYGI AMLSKLGGPG DTMVNSTSSN
ALVRVHAVRR AGGNLDLLID NEDPTTSYSV NLAYNGFTPA GSPTVFTFAN NGNSITSATQ
SSASSVTVAP YTLTVVQVPG SGGGGVTAPG APGQPVVSGL ASSTSGNTTG VATLTWPAAT
AGTYPVASYQ VYRQNSGGGT TLAGTTTTTT LNLSGLTIGA GYTYDVVAVD SHGNPSLPSP
PVTFTVPPPA TASCAVHYAV SSSWSGGFGA AITITNRSAT AISAWTLKFT WPDPGEAVQS
GWNGTWSQSG SAVTVVNAAW NGTIAANGGT VSLGFNGADT GQDPAPTVFS LNGTVCANN