Gene Caci_5002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5002 
Symbol 
ID8336356 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5726282 
End bp5727586 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content71% 
IMG OID644958101 
ProductCarbohydrate binding family 6 
Protein accessionYP_003115703 
Protein GI256394139 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.173534 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.970629 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACGAC CACTGCTGTC CCGTCTGATA CCTGCTGTGG TGCTCGTCGT CGCCACGACG 
GGTGCCGTGG CCGGTCAGAG CGCTGCGAAG GCCGCCGGCG TGGTACCGGG GTTGACCGGA
GCGGCGGCGC GACAGGCCGC CGGTCCCGCG GCCGATCGGT TCCGGCAGCT CACCGAGCAG
CGCGTCCAGG CGGCTGCGGC CGGGCACGGC GCGGCTTCGT CCGCGGCCGC CACGCTGCAC
ACCAGCTGGG GCATCACGAT GCCGGTCGGC GTCGGCACCG GTTTCAAAGC CCTGCAAAGC
GTGGTCGGCG GAGCCCAGCC GACGAACGGC GGCGACTTCG TCTACGCCCC GACGGCGCTG
CCGCCCGGCC GCGCGTGCAT GGAGATCACC ACCGCCTACA CGCCCAGCGG CCCGGATCTG
TGGGCCTGGG ACTGGTGCGG CGGACGCGAT CAGGTCGGCA AGCTCACGGC GATGGACTCG
ACCTTCCTGG CCACCTACAC CACCACGGTC AACGGTCACC CCGCCTACGA CCTCGACGAG
CACCAGACCT CGGCCTCGGG CAACGTCTGG ACCGCGTACC TGTACAACTA CCAGACCCAC
GCCTGGGACA CCTTCTACAC CAGCTCCGGT ACCTACGACC TGTCGCAGTA CCCCTTCGGC
TGGGACATGT TCGAGGTCTA CACCACCCCC GATCCGGGCA CCGGCGCCGG CTACTACTGC
CACGACCTGC TCGGCAAGCC CTTCGAGAGC AGCAGCGTCC AGCTCCTGAC CGGCAGCACC
TGGACTCCCG CGGCGCCCGG CAACAGTTCC CCCGACAGCA CGCCGCCCGC ACCCGGCAGC
AGCCTGGACT GTCCCGCCCT GACCATCACC CTGGCCCACC CGAACGACGA CTGGACCGCC
CTGATCGGCG GCACCAGTGG CAGCTCGCAG TCCTACGAAG CCGAAGCCGC CGGCAACACG
CTGGCCGGTC AGGCCGCGGT CCGCAGCTCC TCCGGCGCTT CCGGCGGCGC CCTGGTCGGC
TACATCGGGA ACGGCACCGC GAACTACCTC CAGGTCAACA ACGTCTCGGC CACCACGGCC
GGCAGCCACC GCCTGACGAT CTACTACGCC GCCGGCGAGA ACCGCTCGCT CACCGTCAGC
ATCAACGGCG GCGCCGCGAC CAGCCTGACC ACCCCCGGCA CCGGCGGCTG GGACACCGTC
GGATCGGTCG CCACGACCGT GACCCTGACC GCGGGCACGA ACACCGTACG GATCGGCAAC
CCGACCGGCT GGGCGCCGGA CGTGGACCGC ATCGTCGTGT CCTGA
 
Protein sequence
MRRPLLSRLI PAVVLVVATT GAVAGQSAAK AAGVVPGLTG AAARQAAGPA ADRFRQLTEQ 
RVQAAAAGHG AASSAAATLH TSWGITMPVG VGTGFKALQS VVGGAQPTNG GDFVYAPTAL
PPGRACMEIT TAYTPSGPDL WAWDWCGGRD QVGKLTAMDS TFLATYTTTV NGHPAYDLDE
HQTSASGNVW TAYLYNYQTH AWDTFYTSSG TYDLSQYPFG WDMFEVYTTP DPGTGAGYYC
HDLLGKPFES SSVQLLTGST WTPAAPGNSS PDSTPPAPGS SLDCPALTIT LAHPNDDWTA
LIGGTSGSSQ SYEAEAAGNT LAGQAAVRSS SGASGGALVG YIGNGTANYL QVNNVSATTA
GSHRLTIYYA AGENRSLTVS INGGAATSLT TPGTGGWDTV GSVATTVTLT AGTNTVRIGN
PTGWAPDVDR IVVS