Gene Caci_2637 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_2637 
Symbol 
ID8333986 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp3020694 
End bp3022091 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content69% 
IMG OID644955788 
ProductRicin B lectin 
Protein accessionYP_003113394 
Protein GI256391830 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.352697 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTCC CCATCCGCGG CAAGCGCTTC GGAGTCTGGC TCGCCGTGAT TCCGACCGCG 
GCCGTGACCG TGGCGGCGGC CGGCGGCATA GCCATGGCCA CCGGGAGCGC GTCCGCGCAG
GCTGCGGCGT CGTACCCGGC CCACTACTCC GCGCCGTATC TCCAGCTCGA CGGTTCGGAC
TCCGGTGACA TGGTCGCCGA CATGAACGCC AGCGGCGACA AGTTCTACAC GCTGGCCTTC
CTGACGCCGA AGTCCGGCTG CACGGCGCAG TGGGAGGGCG GCGGCGAGGC GATGAACGCC
TTCACCTCGC AGGTCACCAC CCTCAAGAAC GACGGCGGCA ACGTCATCCT CTCCTTCGGC
GGGGAGCCGA ACGGCAACAC GCCGAACGAG ATCGCGCAGA CCTGTACCAG CGTCAGCTCG
CTGACGGCCG CGTACCTGAA CATCGTCAAC ACCTACGGCG TCAACCGGCT CGACTTCGAC
ATCGAGGGCA GCGTGCTGGC GGACACCGCG GCGACGAGCC GCCGGGACCA GGCGCTGGCC
GCGCTCCAGG CCGAGGACCC GGCCGTGCAG ATCGACTTCA CGCTCGCCGT CGATCCCGGC
GGTCTGCCCA CCGGCAACGC CTCGGAGTAC GCGCTGCTCC AGGACGCGAA GAACGCGAAG
GTCAAGGTCA GCGTCGTGAA CATCATGACG ATGGACTTCT ACGACGGGAA GTCCGTGCTC
TCCGACGCCG AGTCCGCGGC GAAGGCGACC GCGGGCCAGC TCGCCGGGCT CTACGGCGTC
TCGACCTCGG CCGCCTACGG CATGATGGGC CTGACCCCGA TCGCCGGCAC CAACGACGAC
GGCGCCCCCT TCAGCCAGGC CAACGCCTCC AGCCTGGAGT CCTTCGCGGC TTCCAACGGT
GTGCAGGAGC TGGCCTTCTG GGAGGTCGAC GGCTACGACA AGGGCACCGG CTACGCCTAC
TCCAAGATCT TCCAGAAGAT CGCGAGCGGC GGCACGACCC CGCCGCCCCC GACCGGTCAC
ACCGTCGTCA ACAACAACTC CGGGACCTGC CTGAGCGTGT CCGGCGCGTC GACCTCGCCC
GGCGCCACCG CTGACATCTA CACCTGCAAC AGCAGCCCGG GGCAGAGCTG GACGGTGAAC
AGCAACGGCA CGATCACCGG CAACGGCTCG GGCCTGTGCC TGAGCACCTC CGGGAACAAC
CCCGCCCTGA AGACCACCGC GGACATCAAC ACCTGCGACG GCGACGCCTA CGAGAAGTGG
ACCGTCTCCG GCGGCACGAT CGTCAACGGC GCCTCGGGCC TGTGCCTGAG CATCACCGGC
GGTGCCACCG CGAACTACTC CCTCGCCGAC CTGTACACCT GCAACGGCAG CGTCAGCGAG
AACTGGACCG TCGGCTGA
 
Protein sequence
MKVPIRGKRF GVWLAVIPTA AVTVAAAGGI AMATGSASAQ AAASYPAHYS APYLQLDGSD 
SGDMVADMNA SGDKFYTLAF LTPKSGCTAQ WEGGGEAMNA FTSQVTTLKN DGGNVILSFG
GEPNGNTPNE IAQTCTSVSS LTAAYLNIVN TYGVNRLDFD IEGSVLADTA ATSRRDQALA
ALQAEDPAVQ IDFTLAVDPG GLPTGNASEY ALLQDAKNAK VKVSVVNIMT MDFYDGKSVL
SDAESAAKAT AGQLAGLYGV STSAAYGMMG LTPIAGTNDD GAPFSQANAS SLESFAASNG
VQELAFWEVD GYDKGTGYAY SKIFQKIASG GTTPPPPTGH TVVNNNSGTC LSVSGASTSP
GATADIYTCN SSPGQSWTVN SNGTITGNGS GLCLSTSGNN PALKTTADIN TCDGDAYEKW
TVSGGTIVNG ASGLCLSITG GATANYSLAD LYTCNGSVSE NWTVG