Gene Caci_6873 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_6873 
Symbol 
ID8338239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp7940082 
End bp7941851 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content68% 
IMG OID644959962 
ProductRicin B lectin 
Protein accessionYP_003117553 
Protein GI256395989 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3345] Alpha-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.783664 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCGC CGATCACTTT GCGGCCACCC GGCACCGGAG GCATCCGGCG CCGCGCTTGG 
TGGTCCCTGC TCGCGGTCAC CGCCCTGCTG GCCTCCCTGC TGTACGGGGC CGCGTTCTCC
ACCGCACCGT CGGCTCACGC GGAGTCCAAC GGCGCCGCGG TGTCACCCCA GTCGGGGTGG
AGCAGTTGGA GCTTCATCCG CAAGAACCCG ACCGAGGCGG CCATCGAGGC GCAGGCCGCC
GCGATGGCCT CCACCGGCCT GGTCAGCCAC GGTTTCTCCC TCATCAACAT CGACGACTTC
TACTACCTGA GCCCCGGGTC GCACGTCGAC TCCTACGGGC GCTGGGTGGT GGACACCGCC
AAGTTCCCGC ACGGTATGAA GGCGGTCGCG GACTACGTGC ACGGCAAGGG TGAGAAGTTC
GGCATGTACC TGACGCCGGG CATCCCGGTG GCGGCGTACA ACCAGAACAC CCCGATCCAG
GGCACGTCGT ACCACGCCCG CGACATCGTC TCCGACACCA CGGACTACGA GAAGAACTAC
AACTTCGCCA ACGGCGCGAT GTACTTCATC GACTACGCCA AGAACCCCGC CGCGGCGCAG
GCGTTCGTCA ACTCGTGGGC GAATCTGCTC GCCTCCTACG GCGTGGACTA CCTCAAGATG
GACGGCGTCG GCACGTCCGA CAAGGCCGAC GTGCAGCACT GGTCCCAGGC GCTCGCCCAG
TCCGGGCGCA CCATCCAGTT CGAGCTGTCC AACTCCCTGG ACATCAACCA GGGCCCGTTC
TGGAAGCAGT ACGCCAACGG CTGGCGGGTC GGCGGCGACG TGGAGTGCTA CTGCAGCACC
CTGACCAACT GGAGCAACGT CTCCAAGCGG TTCAACACCC TGCCGGCGTG GGTGCAGTAC
GACAGCCCCG GCGGCTGGGG CGACCCGGAC TCGGTCGAGG TCGGCAACGG CTCGCAGGAC
GGTCTGACCC CCGACGAGCG GCGCACCCAG CTCAGTCTGT GGGCCATCTC CGCGGCACCG
CTGACCCTGG GGACCGACCT GACCCACATG GACTCCGGCG ATCTGGCTCT GCTGACCGAC
GACGAGGTGC TCGGCGTCGA CCGTGCCGGA CACCCCGCGC ACCCGGTCGC GCAGGGCGCA
ACGCAGCAGA CCTGGTGGGC GTACAACGGC GACGGCACCT ACACCGTCGG CCTGTTCAAC
CTCGCCGGCT CGGCCGCCAA CGTCACCGCG AACTGGTCCG ACCTCGGCTT CACCGGCGGC
GCCGCGGTCC GCGACCTGTG GAGCCGCACC AACCTGGGCT CCTCGTCCGG GAAGTTCACC
GCCTCCGTGC CCGCCCACGG CTCCCGGCTG CTGCGCGTCA CCCCGGCCTC CGGGCACGGA
ACGATCGGGG CGCCCATGGT CAGCAAGCTC TCCGGGCGCT GCGCGGACTC CCCGCAGGGC
ACCACGTACA ACGCCGTCCA GGTGCAGGTC TGGACCTGCA ACGGCGGCCC CAACCAGAAC
GTCACCTACA ACTCCTCGTC GAAGACGCTG GTGCTGGAAG GCAAGTGCTT CGACGCGCAC
AACAACCAGA AGACCGCCGG GACGCACGTC GAGATCTACG ACTGCAACGG CGGCGCCAAC
CAGCAATGGA CCAAGAACAG CAACGGCACC ATCACCGGCG TCCAGTCCGG CCTGTGCCTC
GACGTCACCG GCGCCACCAA CCCCAACGGG TCGGGCCTGG AGCTGTGGAC GTGCAACGGC
GGCCAGAACC AGCAGTGGAC GCTCGGGTGA
 
Protein sequence
MNPPITLRPP GTGGIRRRAW WSLLAVTALL ASLLYGAAFS TAPSAHAESN GAAVSPQSGW 
SSWSFIRKNP TEAAIEAQAA AMASTGLVSH GFSLINIDDF YYLSPGSHVD SYGRWVVDTA
KFPHGMKAVA DYVHGKGEKF GMYLTPGIPV AAYNQNTPIQ GTSYHARDIV SDTTDYEKNY
NFANGAMYFI DYAKNPAAAQ AFVNSWANLL ASYGVDYLKM DGVGTSDKAD VQHWSQALAQ
SGRTIQFELS NSLDINQGPF WKQYANGWRV GGDVECYCST LTNWSNVSKR FNTLPAWVQY
DSPGGWGDPD SVEVGNGSQD GLTPDERRTQ LSLWAISAAP LTLGTDLTHM DSGDLALLTD
DEVLGVDRAG HPAHPVAQGA TQQTWWAYNG DGTYTVGLFN LAGSAANVTA NWSDLGFTGG
AAVRDLWSRT NLGSSSGKFT ASVPAHGSRL LRVTPASGHG TIGAPMVSKL SGRCADSPQG
TTYNAVQVQV WTCNGGPNQN VTYNSSSKTL VLEGKCFDAH NNQKTAGTHV EIYDCNGGAN
QQWTKNSNGT ITGVQSGLCL DVTGATNPNG SGLELWTCNG GQNQQWTLG