Gene Caci_6867 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_6867 
Symbol 
ID8338233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp7931630 
End bp7933069 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content67% 
IMG OID644959956 
ProductRicin B lectin 
Protein accessionYP_003117547 
Protein GI256395983 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4833] Predicted glycosyl hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.269811 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCGC GAACAGGAAA GACCCGAATT CTCGGCGCGG TCGTGACAGC CGCCGCCCTC 
GCCGTCTCGG TGCTGATCGG AGCAGCCGGC ACCGCTTCGG CGGCCTCGCC CGCCGCCATT
GGCGCCGCCG CGCTGATGAA GTCCTATGAC TCCACGACCG GCCAGATAGG CACGGGCTGG
TGGAACTCCG CGGTGGCCCT GAGCACGATC GAGACCTACC AGCAGACGAC GGGTGACAGC
TCGTACGCCT ACGCGATGTC CGGGGCGTTC GCCAAACACC AGTCCTCGAA CTTCGAGAAC
GAGTACATGG ACGACACCGG CTGGTGGGCC CTGGCCTGGG TCCAGGCCTA CGACATCACC
GGCAACTCCG CCTACCTGCA GATGGCCCGC ACCGATGCCG ACTACATCCA CGGCTATTGG
GATTCGACCT GCGGCGGCGG GGTCTGGTGG AGCAAGGCCA AGGGATACAA GAACGCCATC
CCCAACGAAC TCTTCCTCGA ACTGACCGCC GACCTCCACA ACCGCATCCC CGGCGACACC
CAGTACCTGG GCTGGGCGAA GCAGGAGTGG AGCTGGTTCA GCGGCAGCGG CATGATCAAC
AGCTCGCACC TGGTCAACGA CGGCCTCAGC AGCTCCTGCA AGAACAACAA CGGCATCGCC
TGGTCCTACA ACCAGGGCGT CGTACTGGGC GGCCTGGCGG CGCTGTCCCA GGCCACCGGG
GACACCAGTC TCCTCACCAC GGCCCGCCAG ATCGCCGACG CGGCGACGTC CAGCCTGTCG
CAGAACGGCG TCTTCACCGA GTCTTGCGAG CCGACGAACT GCAACCAGGA CCAGGTCTCC
TTCAAGGGCA TCTTCGTGCG CGGCTTGCGG ACCCTGGCCT CAGCCGCCGG CACCAGCGCC
TACGACGCGT GGTTCACCGC CCAGGCCGGC TCGATCGAGG CGCACGACAC CTCCGCCACG
GGGTTCGGCG TGTCCTGGGC CGGGCCGATC CGACAGCTGT CCTCCAGCTC CACGGCGAGC
GCCGAGGACG CACTCGTCGC GGCCCTGCCG GGAGCCGGAA CGCCGGCCGG CGCGATGAAA
TCGGGGATCG CCGGCAAGTG TCTGGACGAC CCCAAGGGAT CGTCGACGCC GGGAACGAAG
GCCCAGCTGT GGGACTGCAA CGGTGGATCG ACCCAGCAGT GGACGGTCGT GGGTCAGACG
CTGCGTGTTC AGGGCCTCTG CCTTGACATC ACCGGCGCCC GCACCGCCAA CGGAACGCTC
GTGGAGCTGT GGAGCTGCAA CGGCGGCGCC AACCAGAATT GGACGTCCGC CAACGGCGCT
GTGGCCAATC CCGCGACCGG CAAGTGCCTC GACGTTCCGC ACTCCAGCAC CACGAACGGC
ACTCAGCTCC AGATCTGGGA CTGCAACGGC GGCGCCAATC AGAAGTGGAT TCTGCCTTGA
 
Protein sequence
MKARTGKTRI LGAVVTAAAL AVSVLIGAAG TASAASPAAI GAAALMKSYD STTGQIGTGW 
WNSAVALSTI ETYQQTTGDS SYAYAMSGAF AKHQSSNFEN EYMDDTGWWA LAWVQAYDIT
GNSAYLQMAR TDADYIHGYW DSTCGGGVWW SKAKGYKNAI PNELFLELTA DLHNRIPGDT
QYLGWAKQEW SWFSGSGMIN SSHLVNDGLS SSCKNNNGIA WSYNQGVVLG GLAALSQATG
DTSLLTTARQ IADAATSSLS QNGVFTESCE PTNCNQDQVS FKGIFVRGLR TLASAAGTSA
YDAWFTAQAG SIEAHDTSAT GFGVSWAGPI RQLSSSSTAS AEDALVAALP GAGTPAGAMK
SGIAGKCLDD PKGSSTPGTK AQLWDCNGGS TQQWTVVGQT LRVQGLCLDI TGARTANGTL
VELWSCNGGA NQNWTSANGA VANPATGKCL DVPHSSTTNG TQLQIWDCNG GANQKWILP