Gene Caci_4693 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4693 
Symbol 
ID8336047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5349195 
End bp5350526 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content70% 
IMG OID644957793 
Productglycoside hydrolase family 4 
Protein accessionYP_003115395 
Protein GI256393831 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0209787 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.517895 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCCGCA TCGCCTTCGT CGGCGCCGGC TCGGTGGTCT TCACCCGGCA GCTGCTGCAC 
GACATCCTGT CCTACCCCGA GCTGTCCGGC GCGGCCATCG CGCTGCACGA CATCGACCCC
GAGCGGCTGC AGGTCGCCGC CGCGCTGGCC GACCACGCCG CGCGCACCCT CGGCGCCGAG
GTGTCCGTCA CGGCCACCAC CGACCGCCGC GCCGCCCTGG CCGGCGCCGA CGCCGTCGTC
AACATGATCG CCGTCGGCGG CCACCAGGCC ACCGTCACCG ACTTCGAGAT CCCCGCCGCC
GCCGGCCTGC GCCAGACCAT CGGCGACACC CTCGGCGTCG GCGGCATCTT CCGCGCCCTG
CGCACCTTCC CGGTCCTGCG ATCCCTGGCG CAGGACATGG CCGAGGTCTG CCCCGACGCC
TGGCTGCTGA ACTACACCAA CCCCATGGCC ATGAACATCC AGTACCTGAG CACGATCGCG
CCGAAGCTGA AGGTCGCAGG CCTGTGCCAC TCGGTGTACT GGACCGTCCG CGGCCTGTGC
GACATCATCG GCATCCCCCA CGACGACGTC GACGTCCTGT CCGCCGGCGT GAACCACCAA
GCCTGGATCC TGCGCTGGCA GCACCAAGGC CGCGACCTCT ACCCGGCCCT GGACGCCGCC
ATAGCAGCCA GCCCGGACCT GGCCCGCCGC GTCCGCGTCG ACATGTACCA GCGCCTCGGC
TACTACCCGA CCGAGACCAG CGAGCACTCC TCCGAATACG TCCCCTGGTA CCTCGGCCAC
GACACCGAAA TCACCCGCCT CCGCATCCCC GTAGGCGACT ACATCGACAT CAGCGCCGAA
AACCTCGCCG AATACCGCGA ACTCCGCAAA GTCATCACCG ACGGCGGCGA CCCCGCCAAC
GGCTGGGAAA GCGACGCCGC CGAATACGCC CCCCAAGTCA TCCACAGCCT AGCCACCGGC
ACCCCCCGCA CCATCCAAGT CACCACCCCC AACACCGGCC TGATCAGCAA CCTCCCCGAA
GCAGCCGCCG TCGAAGTCCC AGCAACCCTC GACCGCCTCG GCATCCACCC CCACCACGTA
GGAGCACTCC CACCCCAACT AGCAGCCCCC AACCGCCACT TCCTCAACGT AGTCGACCTA
GTAGTAGCCG CCGCCGTAGA AGGCGACCCC CGCCACATCC GCCACGCCGC AATGGCAGAC
CCCGCCACAG CCGCAACCCT GACCGTCGAC CAGATCTGGA ACCTCTGCGA CGCCATGGTC
ACCGCCCACG GAGACGCACT GCCGGAGCCA TTGCGGCGGA GCCCGCTCGC GGCGCGGGGC
GGTGGGGGGT GA
 
Protein sequence
MIRIAFVGAG SVVFTRQLLH DILSYPELSG AAIALHDIDP ERLQVAAALA DHAARTLGAE 
VSVTATTDRR AALAGADAVV NMIAVGGHQA TVTDFEIPAA AGLRQTIGDT LGVGGIFRAL
RTFPVLRSLA QDMAEVCPDA WLLNYTNPMA MNIQYLSTIA PKLKVAGLCH SVYWTVRGLC
DIIGIPHDDV DVLSAGVNHQ AWILRWQHQG RDLYPALDAA IAASPDLARR VRVDMYQRLG
YYPTETSEHS SEYVPWYLGH DTEITRLRIP VGDYIDISAE NLAEYRELRK VITDGGDPAN
GWESDAAEYA PQVIHSLATG TPRTIQVTTP NTGLISNLPE AAAVEVPATL DRLGIHPHHV
GALPPQLAAP NRHFLNVVDL VVAAAVEGDP RHIRHAAMAD PATAATLTVD QIWNLCDAMV
TAHGDALPEP LRRSPLAARG GGG