Gene Caci_2552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_2552 
Symbol 
ID8333901 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp2887373 
End bp2889478 
Gene Length2106 bp 
Protein Length701 aa 
Translation table11 
GC content72% 
IMG OID644955705 
Productglycoside hydrolase clan GH-D 
Protein accessionYP_003113311 
Protein GI256391747 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3345] Alpha-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGTGTTC CCCCGCAACC CGTCGTCCAC GACCCCGCAC GCGGCCTGTG GCTGCTGAGC 
ACCCCCGGCT CGCGCTACCT GCTGCGTGAA GATCCCGACG GCAGTCCCCG GCACGTCGCG
TGGGGATCGC CGGAGGCGGT GCACGCCTGC GCGCCGACGG CGCCGGCCAG CAGCTTCGAC
GGCGACGGCG CGGCCGACGA GCTCGGCATC GAGACCGGTG CCCGGTTCGG CCCGGCGGGC
TTGCAGGTCC GGTTCGCCGA CGGCACGCGC GGCGCGCAGT GGACCGGCGC CGGGCACGAG
ATCGACGGCG GGCATCTGGT GATCCGCCTG CGGGACCGGC GTTATCCGCT GCGTGCCGAA
CTGCACTATC GGGTGCGCCC CGACACCGAC GTCATCGAAC GCTGGACGGT CCTGGCCAAC
GACGGCGAGG CGCCGATCAC CGTCGGCCGC CTGGACTCGG CGGCATGGAC GATCCCGCAC
CTCACCGACT ACCGGATGTC GCATCTGGTC GGCGGCTGGA ACCACGAGTT CCAGCTGCGC
CGCACGCAGG TCCCGGTCGC CGAGACGGTC TTCACCAGCC GCCGCGGACT CACCAGCCAC
CACGCGAACC CCTGGCTCGC CGTCGACGAC GGAACCGCCG AGGAGGACCG CGGATCGGTG
TGGAGCACCG CGCTCGCCTG GAGCGGAAGC TGGCGCGTCA CCGTGCACCG CGATCCGGCG
GACCGGGTCA CCTGGACCGG CGGCTTCGGC CACGAGGGCA TCACCTGGAC CTTGGGTCCG
GATCAGAGCC TTGAGACGCC GGTCTTCGCC GGGCTGCACA CCGTCGGCGG CTTCGGCGGC
GCCGCCCGGG CTTGGCACGA CTACCTGCGG CGCTATGTCA TTCCGGCGCC GGCCGAGGAC
CGGCCGGTGG TCTACAACTC CTGGGAGGCG ACCGGCTTCG CGGTGGACGA GGCCGGGCAA
CTGCGGCTGG CCGAGACCGC CGCGCAGCTC GGCGTCGAGC TGTTCGTCCT CGACGACGGC
TGGTTCGGCG GCCGCCGCGA CGACACCGCC GGGCTCGGCG ACTGGCGCCC CTACCCCGGC
GCCTTCCCGC ACGGGCTCGG GCCGCTGGTG CAGAAGGTGC ACCAGCTCGG CATGCGCTTC
GGGCTCTGGG TCGAGCCGGA GATGGTCAAC GCCGACAGCG ACTTGTTCCG CGAGTACCCC
GACTGGGTGG TGCACACGCC GCAGCGCGAC GCGACGGAGC TGCGGCAGCA GCTCATGCTC
AACTACGGCC GCGAGGACGT GGCGCAGTGG GCGCACCAGT GGCTCGACCA GCTGGTGCGC
GAGCATGGCA TCGACTTCTT GAAGTGGGAT GCGAACCGGG CGGTCACTGA CGCCGGCTGG
CCGGGGCACC CTGACCCTGA CCGGCTGTGG ATCGACCACA CTCGAGCCGT CTACCGGATC
ATGGACCGGC TGCGCGCCGA CCATCCGCAG CTGCGGATCG AGGCCTGCGC CGGGGGTGGC
GGGCGTGCCG ACATCGGCGT TCTGGCGCGC ACGGACCAGG TCTGGACGTC GGACAACACT
GATCCGGTGG ACCGGCTCGC GATTCAGAAC GGCTTCAGCA TGCTCTTCCC CGCCGAGGTC
ATGGCGGCGT GGGTCACTGA CAGCCCGAAC ATCGCGACGG GTCGTTCGAC GCCGCTGCGT
TTCCGGTTCC ACGTCTCCAT GGCCGGCGCG CTCGGCATCG GGGGGAAGCT GACCGAGTGG
ACGCGCGAGG AGCTGGCCGA GGCGGCGGAG CTCGTCGCGG TGTACAAGCG GGTTCGCGGG
GTCGTGCAGC ACGGTGTGCT GTATCGGCCG GCGACCGATG GGCACACCGC GGCGGTGCAC
TACGCGTCGG AGGGAGGCGA GGGAGGCGAG GGAGGCGATG AGCATGTGGT CATCGCTTGG
CGGGCTGCGA CAGCGGTCGG ACTACCGGGT CCGCTGGTAC GTCTGACGGC GCTGGATCCC
GACGCCGAGT ACTACGACGT CGATCGGCAG GTGCGGATCA CCGGCGCCGC GGCGCGGGCG
GGGCTCCGTC TGGATCTGCC GCGCGGGGAC TATGCCAGCG CTCTGTATCA CCTGCGGCGG
GTGTAG
 
Protein sequence
MRVPPQPVVH DPARGLWLLS TPGSRYLLRE DPDGSPRHVA WGSPEAVHAC APTAPASSFD 
GDGAADELGI ETGARFGPAG LQVRFADGTR GAQWTGAGHE IDGGHLVIRL RDRRYPLRAE
LHYRVRPDTD VIERWTVLAN DGEAPITVGR LDSAAWTIPH LTDYRMSHLV GGWNHEFQLR
RTQVPVAETV FTSRRGLTSH HANPWLAVDD GTAEEDRGSV WSTALAWSGS WRVTVHRDPA
DRVTWTGGFG HEGITWTLGP DQSLETPVFA GLHTVGGFGG AARAWHDYLR RYVIPAPAED
RPVVYNSWEA TGFAVDEAGQ LRLAETAAQL GVELFVLDDG WFGGRRDDTA GLGDWRPYPG
AFPHGLGPLV QKVHQLGMRF GLWVEPEMVN ADSDLFREYP DWVVHTPQRD ATELRQQLML
NYGREDVAQW AHQWLDQLVR EHGIDFLKWD ANRAVTDAGW PGHPDPDRLW IDHTRAVYRI
MDRLRADHPQ LRIEACAGGG GRADIGVLAR TDQVWTSDNT DPVDRLAIQN GFSMLFPAEV
MAAWVTDSPN IATGRSTPLR FRFHVSMAGA LGIGGKLTEW TREELAEAAE LVAVYKRVRG
VVQHGVLYRP ATDGHTAAVH YASEGGEGGE GGDEHVVIAW RAATAVGLPG PLVRLTALDP
DAEYYDVDRQ VRITGAAARA GLRLDLPRGD YASALYHLRR V