Gene Caci_4641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4641 
Symbol 
ID8335995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5277274 
End bp5278998 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content70% 
IMG OID644957741 
ProductRicin B lectin 
Protein accessionYP_003115343 
Protein GI256393779 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3345] Alpha-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0200277 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.347937 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAACGC CCCGCCCCAC GCCCCGCCGG TTCGCTGTCG CCGCCCTGGC CTGCACTCTG 
GCAGCTCTAC CCCTGACATC CAGCACCGCT CACGCGACGC CAACGCCACC GCCCTCAACA
GCCCACACGG CCACCGCTGC CAGTACTGCC GCCACCCTGG CCGCCACACC CCCGATGGGC
TGGAACGACT GGGCCCACTA CCAGTGCTCC GTCGACGAAT CCACGGTCGT CGCGAACGCC
AACGCGCTCG TCAGCAGCGG CCTGGCCGCC AAGGGCTACA AGACCGTCAC GGTCGACGAC
TGCTGGATGG CCTCCTCCCG CGACTCCGGC GGCACCCTGG TCGCCAACTC CACGAAGTTC
CCCCACGGCA TGGCCTGGCT CGGCAGCTAC CTGCACAGCA AAGGCCTGAA CTTCGGCATC
TACGAAGACG CCGGCTCCTC CACCTGCGGC GGCTATCCCG GCAGCGGCCA ACCCCAAGGC
GGCGGCGCGG ACCACTTCGC GCATGACGCC GCGACCTTCG CCTCCTGGGG TGTGGACTAC
CTGAAGCTCG ACGGCTGCAA CGTCTACATC CCGAGCGGGG AAAGCACCGA GCAGGCGTAC
CACAACGCCT ACACCGCCGA GTCGACCGCG CTGGCGAACG CCGGCCGCCC GATCGTGTTC
TCCGAGTCCG CGCCCGCGTA CTTCCAGAGC GGCGAGTGGG GCAATCCCAC CTGGTTCGAC
GTCCTGGGCT GGGTCGGCCA GCTCGGGCAG CTGTGGCGCG AAGGGTACGA CATCGCCACG
TACAACAGCG GCAACCCCAC CGCCAGCCGC TGGTCCTCGG TGATGTCCAA CTACGGCTAC
AACCGCTGGA TCGCCCGCTA CGCCCACCCC GGCAACTGGA ACGACCCCGA CTTCCTCATC
GCCGGCGACC CTGGCCTGAC CGCTGAGGAG TCCCGCAGCC AGGTCGCCTT GTGGGCGATG
ATGAACGCCC CGATGATCCT GTCCTCGGAC GTCGCCAACC TCAGCGCCGA CGGCCTGGCC
GCCCTGGGCA ACACCGACCT GATCGCCCTG GACCAGGACA GCGCGGGCCG CCAGGCAGGC
GTGGTCTCCA CCAACGGCAC CACCGACGTC CTGGCCAAGC CGCTGGCCAA CGGCGACCGC
GCCGTAGCGG TCCTGAACCG CGGCAGCGCG TCGCAGAACG TGTCGACCAC CCTGGCCTCG
ATCGGCCTGC CCAACTGCAC GGCCAGCGCC AAGAACCTGT GGACCGGCAC CACCACGACC
AGCAGCACCC TGACCGCGAC CATCCCCGCC CACGGCACCG CCATCTGGCG CCTGGCCCCC
TCCGCCGGCT GCGCAGCCGC AGTGCCGACC GGCGAGATCG TCGGCAACGG CGCCAAGTGC
GTGGACGTCA CCGGCAGCGG CACCGCCAAC GGCACGGCGG CGATCCTGTA CACCTGCACC
GGCAACGCGA ACCAGTCCTG GACGCGCCCC GGCAACAGCA GCATCCAGAC CCTCGGCAAG
TGCCTCACTG CCAACGGCAC CACGGCGGGC AGCACGGTCG TGATCTCCGC CTGCATCGGC
GCCAGTGCAC AGCAGTGGAC GGCGCAGGCT GACGGGACGG TCGCGAACGG CGCGTCCGGC
CTGTGCCTGG ACGTCTACGG CGGCGGCAGC GCCGACGGCA CGAAGCTGGA CACCTGGACG
TGCGGCAGCC ACCAGGCGAA CCAGACCTGG GCGATGCCGA GCTGA
 
Protein sequence
MSTPRPTPRR FAVAALACTL AALPLTSSTA HATPTPPPST AHTATAASTA ATLAATPPMG 
WNDWAHYQCS VDESTVVANA NALVSSGLAA KGYKTVTVDD CWMASSRDSG GTLVANSTKF
PHGMAWLGSY LHSKGLNFGI YEDAGSSTCG GYPGSGQPQG GGADHFAHDA ATFASWGVDY
LKLDGCNVYI PSGESTEQAY HNAYTAESTA LANAGRPIVF SESAPAYFQS GEWGNPTWFD
VLGWVGQLGQ LWREGYDIAT YNSGNPTASR WSSVMSNYGY NRWIARYAHP GNWNDPDFLI
AGDPGLTAEE SRSQVALWAM MNAPMILSSD VANLSADGLA ALGNTDLIAL DQDSAGRQAG
VVSTNGTTDV LAKPLANGDR AVAVLNRGSA SQNVSTTLAS IGLPNCTASA KNLWTGTTTT
SSTLTATIPA HGTAIWRLAP SAGCAAAVPT GEIVGNGAKC VDVTGSGTAN GTAAILYTCT
GNANQSWTRP GNSSIQTLGK CLTANGTTAG STVVISACIG ASAQQWTAQA DGTVANGASG
LCLDVYGGGS ADGTKLDTWT CGSHQANQTW AMPS