Gene Caci_7918 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_7918 
Symbol 
ID8339295 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp9189632 
End bp9191239 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content67% 
IMG OID644961002 
ProductAlpha-galactosidase 
Protein accessionYP_003118582 
Protein GI256397018 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3345] Alpha-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.542076 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCATC AGGCCCCGCG CAGTCGACAG TTGCGCGCCG CCTTCGCAGC GCTCGCCCTC 
GTGGTCGGGA CGATGTTCGG AGTAGCGGTA GCGGCACCGG CACAAGCCCT CGGCAACGGT
CTGGCACTCA CGCCGCCGAT GGGATGGAAC GACTGGAACT CCTTCGGGTG CAACGTCTCG
GAGAGCCTGG TCGAGCAGAC GGCTGATCTC ATCGTCTCGT CCGGGATGAA GGACGCCGGC
TATCAGTATG TGAACATCGA CGACTGCTGG ATGTCGTCGA ACCGGGACGC CGGCGGCAAC
CTGGTCCCGG ATCCGGCGAA GTTCCCGGAC GGCATCTCGG GGACCGCCGC GTACGTCCAC
AGCAAGGGTC TCAAGCTCGG GATCTACGAG AGCGCGGGCA CCGCGACCTG CGCCGGCTAT
CCGGGCAGCC TGAACCACGA GCAGGCTGAC GCGAACTCCT TCGCGTCCTG GGGCGTGGAC
TACCTCAAGT ACGACAACTG CAACAACCAG GGAATCCCGG CTCAGACCCG CTACACGGCG
ATGCGCGACG CGCTGGCGAA AACCGGCCGG CCGATCGTGT ACAGCCTGTG CAACTGGGGT
CAAGAGTCGG TCTGGACCTG GGGCGCCGGG GTCGGCAACC TGTGGCGCAC CACCGGTGAC
ATCAGCGCCA ACTTCGGCAG CATGCTGTCC AACTTCCACA ACACCGTCGG CCTGGCGTCC
TCCGCCGGAC CCGGCGGCTG GAACGACCCG GACATGCTCG AAGTGGGCAA CGGGATGTCG
TTCACCGAGG ATCGCGCGGA GATGTCGCTG TGGGCCGAGA TGGCCGCGCC GCTGATCTCC
GGGACGGACC TGCGCAAGGC GACGACGGCC ACGCTGTCGC TGTACACGAA CAAGGACGTC
ATCGCCGTCG ACCAGGACTC GCTGGGCAAG GCCGGCCGCG AGATCGCGTC CTCCGGCGGC
GCGGACGTGC TGGCCAAGCC GTTGGCGAAC GGCGACGTCG CAGTCGCGCT GTTCAACGAG
AACTCCTCGG CGCAGACGAT CTCGACGTCC GCGTCGGCGA TCGGGATCGG TTCAGCTTCC
TCGTACAAGC TGAACAACCT GTGGTCGCAT GTGCTCACCT CGACCTCCGG CTCGATCAGT
GCCTCGGTGC CGGGCCACGG CGTAGTGCTG TACCGGGTGT CCGCGGGCAG CGGGAGCAGT
GTCGGCAGTG CGCACCGGTT CCTCGGCGCG TCGTCCGGCC GCTGCCTGGA CGTGCCGAAC
GGCAGCACGA CGACCGGCAC GCAGCTGGAC ATCTGGGACT GCGGCACCGG GACCAACCAG
TCCTTCACCC CGACCTCGGA CAAGGAACTG CGCGTCTACA ACGGCAGCCT GTGCCTGGAC
GCCAGCGGAC AGGGCACGAC CGCCGGCACC AAGGTCATCA CCTGGACCTG CAACGGCCAG
ACCAACCAAC AGTGGAACCT GAACGCCGAC GGGACGGTCA CCGGCGTGCA GTCCGGGCTG
TGCCTGGACG TGACCGGCGG GAACGTGGCC TCCGGCAACG TCAACGGCAC GCTGATCGAA
CTGTGGGGCT GCAACGGAGG CGCGAACCAG CAGTGGAGCC TGGGGTAG
 
Protein sequence
MRHQAPRSRQ LRAAFAALAL VVGTMFGVAV AAPAQALGNG LALTPPMGWN DWNSFGCNVS 
ESLVEQTADL IVSSGMKDAG YQYVNIDDCW MSSNRDAGGN LVPDPAKFPD GISGTAAYVH
SKGLKLGIYE SAGTATCAGY PGSLNHEQAD ANSFASWGVD YLKYDNCNNQ GIPAQTRYTA
MRDALAKTGR PIVYSLCNWG QESVWTWGAG VGNLWRTTGD ISANFGSMLS NFHNTVGLAS
SAGPGGWNDP DMLEVGNGMS FTEDRAEMSL WAEMAAPLIS GTDLRKATTA TLSLYTNKDV
IAVDQDSLGK AGREIASSGG ADVLAKPLAN GDVAVALFNE NSSAQTISTS ASAIGIGSAS
SYKLNNLWSH VLTSTSGSIS ASVPGHGVVL YRVSAGSGSS VGSAHRFLGA SSGRCLDVPN
GSTTTGTQLD IWDCGTGTNQ SFTPTSDKEL RVYNGSLCLD ASGQGTTAGT KVITWTCNGQ
TNQQWNLNAD GTVTGVQSGL CLDVTGGNVA SGNVNGTLIE LWGCNGGANQ QWSLG