Gene Caci_7349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_7349 
Symbol 
ID8338719 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp8532511 
End bp8533779 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content70% 
IMG OID644960430 
Productglycoside hydrolase family 4 
Protein accessionYP_003118017 
Protein GI256396453 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.197133 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTCG CCGTCGTAGG CGGAGGATCG ACGTATACGC CGGAGCTGGT GGACGGCTTC 
GCACGGCTGC GCGACACCCT GCCGCTCACC GAGCTCGCCT TGATCGACCC GGCCGCGGAC
CGGGTGGAGC TGATCGGCGG GCTGGCCCGC CGGATCTTCG CCAAGCAGGG GCATCCCGGC
ACCGTCACCA CGCACACCGA GCTGGAGTCC GGCATCGAGG GTGCGGACGC GGTGCTCATC
CAGCTGCGGG TCGGCGGTCA GACGATCCGG AACGTCGATG AGACGTTCCC GCTGGAGTTC
TGCTGCGTCG GCCAGGAGAC CACCGGCGCC GGCGGCTTCG CCAAGGCGCT GCGGACCGTG
CCGGTGGTGC TGGACATCGC CGAGCGCGTC CGGCGAATAG CCCCGCAGGC CTGGATCATC
GACTTCACCA ACCCGGTCGG TATCGTCACC CGCGCGCTGC TGGACGCCGG GCACCGCGCC
GTCGGGCTGT GCAACGTGGC CATCGGCTTC CAGCGCCGCG CCGCCGCGCA CCTGGGCGTG
CAGCCCTCGC GCATCAAGCT CGACCACGTC GGTCTCAACC ACCTGACCTG GGAGCGCGGC
TTCTACCTCG ACGGCGAGGA CTTCCTGCCG AAGTACCTCA GCGAGTCGCT CGAGGAGATC
TCGCACGACA TCGAGCTGCC GGCCGAGCTG ATCCAGCGGC TCGCCGCGAT CCCCTCCTAC
TACCTGCGCT ACTTCTACGC CCACGACATC GTGGTGAAGG AGCAGATCGA CCAGGTCGCC
AAGGGCGAGA ACCGCGCCAA GGCGGTCGCC GCGGTCGAGG CCGAACTCCT GGCGCAGTAC
GCGGACCCGA CGCTGGACAC CAAGCCGGAG GCGCTGAGCA AGCGCGGCGG GGCGTTCTAC
TCCGAGGCGG CAGTCGAGCT GCTGGCCTCG CTGCACGGCG ACCTCGGCGA AGAATTGGTC
GTCAACGTCC GCAACGCGGG CACCTTCCCC TTCCTGGCCG ACGACGCGGT GATCGAGGTC
CCGGCGATCG TCGACGCCTC CGGGGTGCGC CCGGCGCCGT TGCGCGCGCC GATCGAGCCG
CTGTACCGCG GGCTCATCGG ACACGTTTCC GCCTACGAGG AGCTGGCCGT GGAGGCCGCG
ATCAAGGGCG GCGTGGAGCG GGTCCGCACC GCCCTGCTCG CGCACCCCCT GATCGGTCAG
GCGGACCTGG CGGACAAGCT GGCCGACTCC CTGGTCGCCA AGAACCGCAG CTTCCTGCCG
TGGGCGTGA
 
Protein sequence
MKLAVVGGGS TYTPELVDGF ARLRDTLPLT ELALIDPAAD RVELIGGLAR RIFAKQGHPG 
TVTTHTELES GIEGADAVLI QLRVGGQTIR NVDETFPLEF CCVGQETTGA GGFAKALRTV
PVVLDIAERV RRIAPQAWII DFTNPVGIVT RALLDAGHRA VGLCNVAIGF QRRAAAHLGV
QPSRIKLDHV GLNHLTWERG FYLDGEDFLP KYLSESLEEI SHDIELPAEL IQRLAAIPSY
YLRYFYAHDI VVKEQIDQVA KGENRAKAVA AVEAELLAQY ADPTLDTKPE ALSKRGGAFY
SEAAVELLAS LHGDLGEELV VNVRNAGTFP FLADDAVIEV PAIVDASGVR PAPLRAPIEP
LYRGLIGHVS AYEELAVEAA IKGGVERVRT ALLAHPLIGQ ADLADKLADS LVAKNRSFLP
WA