Gene Caci_7221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_7221 
Symbol 
ID8338589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp8390902 
End bp8392806 
Gene Length1905 bp 
Protein Length634 aa 
Translation table11 
GC content67% 
IMG OID644960302 
ProductRicin B lectin 
Protein accessionYP_003117891 
Protein GI256396327 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5520] O-Glycosyl hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.243667 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAAAT CGTTCCAAGG CTCCACCGCG AGACGCCGGA TCCGACTCGT GATCCGGGCG 
GTCTCGGCGG CGGCGCTCGC CATGTCCGGC TTCGCGCTCG CCGACGTCCC CGCGCACGCC
GCGAACGAGT CGGTCAACGT CTGGCTGACC AGCACCAACG ACTCCGCCGG CCGGAACGTC
ACCCGCGGTC TGCAACAGCA GGCGGCGGTC TCCTTCGCCT CCGGCTCCGG CAGCGGCGGC
CAGGTGGTCA CCGTCAACGA GAACACGCAC TACCAGCAGT TCACCGGGGC CGGCGCGTCG
TTCACGGACA CCGCGGCGTA TCTGATGAAC AGCAGCGGCG CGCTCAGCGC GTCGACTCGC
AATACCCTGA TGACCAACCT GTTCAGCCCC ACGTCCGGGA TCGGTCTGGA CTTCCTGCGC
AACCCGCTGG GCGCCTCGGA CCTCGCGCGC TACAGCTACA CCTTCGACGA CATGCCCGCC
GGGCAGACCG ACCCGAGCCT GGCGAAGTTC TCGATCGCCC ACGACCTGGT CGACGTGCTG
CCGCTGACCA AGCAGGCGCA GCAGCTCAAC CCGGGTCTGA AGGTCATGGC CTCGCCGTGG
ACCGCCCCAC CGTGGATGAA GGACAGCGGC GCGTACAGCC AGGGCTACCT CCAGTCGCAG
TACTACGCCG CCTACGCGCA GTACTTCGTG AAGTACATCC AGGCCTACCA AGCGCAGGGC
GTGCCGATCA ACTACGTGTC GGTCCAGAAC GAGCCCACCT GCTGCTCGGG GTATCCCTCG
ATGCAGTGGA ACGGATCGGG CCTGGACTAC TTCACGGCGA ACGACCTGCT TCCGGCGTTC
CACTCCGCGG GTCTGTCGAC GAAGGTCCTG GCGCTTGACT GGAACCCGGA CAGCTACGCC
TCGTACGGCG CCCCCACCGT CGACGACGCG ACCGTCCGCA ACGACCCGAA CTTCGGCGGC
ATCGCCTGGC ACGGGTACGA GGGCAGCGTC ACCACCCAGA CGGACATCCA CAACCAGTAC
CCGAACGTGG ACGCCTACGA CACCGAGCAC TCCGGCGGCA CCTGGATCGG CAACCAGCAG
CAGGAGGACA TGAACAACAT CATCGACTAC ACCCGCAACT GGGGTAAGTC GGTGGTGAAG
TGGTCCCTGG CGGTGGACCA GAACATGGGC CCGCACAACG GCGGCTGCGG CACTTGCACC
GGCCTGGTCA CGGTCCACAA CGGCGACTCG CGCTCCGGCC AGGTCGACTA CAACATCGAG
TACTACGACA TGGGCCAGCT CACCAAGTTC GTGAAGCCCG GCGCCTACCG CATCGACTCC
ACGGCGAACT CGAGCGTCCC GAACGTCGCC TGGCAGAACC CGGACGGGTC CAAGGCGCTG
GTCGCGTACA ACGAGTCCGG CAGCACCCAG ACGCTGACGG TGAACTGGGG CAACGAGCAC
TTCAGCTACT CCCTGCCGGC GCAGACCTCC GCGACGTTCA CCTGGAACGG CACGCAGGGC
ACCGGCGGCG GCACCGGCAC CCCGACCGGC CAGATCAGCG GCTACGGCGG CAAGTGCGTC
GACGTCGCGG GCGCCAACCC GGCGAACGGC ACCGCGGTCC AGCTCTACGA CTGCAACGGC
ACCGGTGCGC AGCAGTGGAC GGTCGCCTCG AACGGCTCGC TGCAGTCCCT CGGGAAGTGC
ATGGACGTGA CAAGCGCGGG GACGACGAAC GGAACGAAGG TGCAGCTCTA CGACTGCAAC
GGAACCGCGG CGCAGCACTG GACGCACCAA GCCAACGGAG AGTTGGTGAA CGCCGGCTCC
GGACGCTGCC TGGACGCCAC GGGCCCGAGC TCGGCGAACG GCACCCGGCT GCAGATCTGG
GACTGCACGG ACGCCGCGAA CCAACAATGG AACCTACCGT CGTGA
 
Protein sequence
MPKSFQGSTA RRRIRLVIRA VSAAALAMSG FALADVPAHA ANESVNVWLT STNDSAGRNV 
TRGLQQQAAV SFASGSGSGG QVVTVNENTH YQQFTGAGAS FTDTAAYLMN SSGALSASTR
NTLMTNLFSP TSGIGLDFLR NPLGASDLAR YSYTFDDMPA GQTDPSLAKF SIAHDLVDVL
PLTKQAQQLN PGLKVMASPW TAPPWMKDSG AYSQGYLQSQ YYAAYAQYFV KYIQAYQAQG
VPINYVSVQN EPTCCSGYPS MQWNGSGLDY FTANDLLPAF HSAGLSTKVL ALDWNPDSYA
SYGAPTVDDA TVRNDPNFGG IAWHGYEGSV TTQTDIHNQY PNVDAYDTEH SGGTWIGNQQ
QEDMNNIIDY TRNWGKSVVK WSLAVDQNMG PHNGGCGTCT GLVTVHNGDS RSGQVDYNIE
YYDMGQLTKF VKPGAYRIDS TANSSVPNVA WQNPDGSKAL VAYNESGSTQ TLTVNWGNEH
FSYSLPAQTS ATFTWNGTQG TGGGTGTPTG QISGYGGKCV DVAGANPANG TAVQLYDCNG
TGAQQWTVAS NGSLQSLGKC MDVTSAGTTN GTKVQLYDCN GTAAQHWTHQ ANGELVNAGS
GRCLDATGPS SANGTRLQIW DCTDAANQQW NLPS