Gene Caci_7223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_7223 
Symbol 
ID8338591 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp8395872 
End bp8397623 
Gene Length1752 bp 
Protein Length583 aa 
Translation table11 
GC content68% 
IMG OID644960304 
Productglycoside hydrolase clan GH-D 
Protein accessionYP_003117893 
Protein GI256396329 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3345] Alpha-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCGAC GCCCGCCAGG GATCAGGACC AGGACGTGGA CCGCGGCTGT CGGGCTGGCG 
CTGGCGGCCG CGCTGCTGCC GCCGATCGGC GCGACCGCCC CGGCGCACGC CGAGGACAAC
GGCGTCGGGC TCACCCCGGC GCTGGGCTGG TCCAGCTGGA GCTCGGTCCG CACGCACCCG
ACCGCCGCGA AGATCGACGC CCAGGCCGAC GCCATGAAGT CCTCCGGGCT GGCCGCGGCC
GGCTTCCAGT ACGTCAACGT GGACGACTTC TGGTACCAGT GCCCGGGAAG CCAGGGCCCG
AACGTCGACG CCAACGGCCG CTGGGTCACC GACGCCACGA AGTTCCCGGC GTCCGGCAGC
ACCAACGGGA TCCAGGCCGC CGCGAACCAT GTGCACGCCG ACGGCTTGAA GTTCGGCCTC
TACGTCACCC CCGGCGTCTC CCAGCAGGCC GTCTCGCAGA ACAGCGCGAT CCTCGGCACG
TCGTACCACG TCAAGGACAT CGCGACCACG ACGGCGGAGA AGAACTACAA CTGCAAGGGC
ATGGTCGGCA TCGACTACTC CAAGCCCGGC GCGCAGGCGT TCATCAACTC CTGGGCTGAC
GAGTTCGCGT CCTGGGGCGT GGACTACGTG AAGATCGACG GTGTCGGAAC CTCCGACGTC
CCCGACCTCC AGGCGTGGTC CAAAGCGCTG GTGCAGACCG GGCGCCCGAT CCACCTGGAG
CTGTCCAACA ACCTCGCGAT CGGCAGCGCC TCAACCTGGC AGCAGTACTC CAACGGCTGG
CGCACCGGCG GCGACATCGA GTGCTACTCC AAGTGCACCA CGGCCGGGAC GCTCACCGAC
TGGAGCCACG TGCAGAGCCG GTTCGGCCAG GTCGCCTCGT GGCAGCCGTA CGGCGGGCCC
GGAGCGTTCA ACGACTACGA CTCGCTGGAG GTCGGCAACG GCTCCGCCAC CGGCCTCACC
GACGCCGAGC AGCAGTCGCA GATGAGCCTG TGGGCGCTGG CCGCGAGCCC GCTGATCCTG
GGCACCGATC TGACGAATCT GAACTCCACC GAGCTCGGCT ACCTGAAGAA CGGTCGCGTC
CTGGCTGTCG ACCAGGACGC GATCGATGCC TCGCGCATCG TGAACAACGG CAACCAGCAG
GTCTACATCA AGAAGGAGAA GGACGGCAGC GTCATCATCG GCCTGTTCAA CTACAGCGGC
ACGGGATCGA CGACGGTCAG CGTCCCGCTG TCGGCCGCTG GTATCTCCGG CAGCGCCACC
GCGACAGACC TGTGGAGCGG CGCCTCGGTC GGCACCATCA GCGGCACATA CAGCGTGACG
CTCGGCGCCG GCGCGGTGAA GCTGATCAAG GTGGCTCCCT CAGGATCCGG CGGGAGCACC
ACGATCGAAG CCGAGGCAGC CGGCAACACC ATCGCCGGCG CCGCCAAGAT CGCCACCTGC
GCGGCATGCT CCGGCGGCCA CAAGGTCGGC TATATCGGCA AGGGTGCGGC GAACGCCGTG
ACCATCAACG GAATCACCGA GTCCGCAGCC GGAACCCACA CGCTGACCAT CTCCTATCTG
GTCAGCGGTA CCCGGAGCTT CTCCATCAGC GTCAACGGCG GGCCGGACAT CGTCGAGCAG
CTGACCGGGA CCAGTTTCGC CACGCCGGCG ACCACGAGTG TGGCGGTGCA GCTCGCTGCC
GGAACCAACA CGATCAAGTT CGACAACGAC ACGGCGTACG CGCCGGATCT GGACGCGATC
ACGGTGAGCT GA
 
Protein sequence
MRRRPPGIRT RTWTAAVGLA LAAALLPPIG ATAPAHAEDN GVGLTPALGW SSWSSVRTHP 
TAAKIDAQAD AMKSSGLAAA GFQYVNVDDF WYQCPGSQGP NVDANGRWVT DATKFPASGS
TNGIQAAANH VHADGLKFGL YVTPGVSQQA VSQNSAILGT SYHVKDIATT TAEKNYNCKG
MVGIDYSKPG AQAFINSWAD EFASWGVDYV KIDGVGTSDV PDLQAWSKAL VQTGRPIHLE
LSNNLAIGSA STWQQYSNGW RTGGDIECYS KCTTAGTLTD WSHVQSRFGQ VASWQPYGGP
GAFNDYDSLE VGNGSATGLT DAEQQSQMSL WALAASPLIL GTDLTNLNST ELGYLKNGRV
LAVDQDAIDA SRIVNNGNQQ VYIKKEKDGS VIIGLFNYSG TGSTTVSVPL SAAGISGSAT
ATDLWSGASV GTISGTYSVT LGAGAVKLIK VAPSGSGGST TIEAEAAGNT IAGAAKIATC
AACSGGHKVG YIGKGAANAV TINGITESAA GTHTLTISYL VSGTRSFSIS VNGGPDIVEQ
LTGTSFATPA TTSVAVQLAA GTNTIKFDND TAYAPDLDAI TVS