Gene Caci_2737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_2737 
Symbol 
ID8334086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp3135667 
End bp3139113 
Gene Length3447 bp 
Protein Length1148 aa 
Translation table11 
GC content68% 
IMG OID644955886 
ProductRicin B lectin 
Protein accessionYP_003113492 
Protein GI256391928 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.285354 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.294912 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCGCTC GCCATAGGTT GTCGCGGTTT ATCGCCGCGG CGGCCGTATT TTCTCTCGCC 
CCGCCTCTGG TGGTGGCGAT CGGTTCTGGG ACGCCCGCGC TCGCGGCGAC CACGACCACC
GCTTGGCAGA ATGGTTCCTT CGCACAGAAC GTGAGCGGGA TCGTCTCACG GTCCAATGTG
GTGATCGGTA AGGCGAACAC CGCGGCGACG CAGTTCCTGC CGCTGGGCAA CGGCTCGCTG
GGTGTGGCCG AGTGGGCGGC GAACGGATTC ACCGCGCAGC TCAACCGCAG CGACACGATG
CCGAACCGGC TCTCGCCGGG GCAGGTCGAC ATCCCCGGCC TGTCGGCGAT GACCTCGGCG
TCGAACTTCG TGGGCTACCT CGACGTGTAC AACGGGGTGC TGCACGAGTC TGGCGGCGGG
ATGAGCCTGA CCGCGTGGGT GCCGGCGGGC AAGGACGAAC TGGTCGTCGA CGTGACCGGT
GCGAATCCGG GGACGCGGCA GACAGCGAGC GTGAACCTGT GGAGCGGTCG CGGGCCGACC
GCGTCGGCGT CCGGGTCCAT CGCCAGCCTG GCGCAGACCT GGGTGGACAA TTCCCAGACC
GGATACTCGG GCAAGACCTT CGGCGCGATG GCCGCGATCA CCGCCGGCGG CTCCAACGTG
ACCGCCTCGA CGTCCGGCTC GACGCAGGCG CTGGTCTCGT TCAACCCGAA CAGCGACGGC
ACGTACCGCG TGATCGTCGC CTCGCCGTCA TGGGCCGGCG GCAACGCGAA CAGCACCGCG
TCCTCCCTGA TCGGCAGCGA CACCGGCGCG AGCGAGGCCT CGCTGCTGGC GACGCAGTCC
GCGTGGTGGA ACACCTACTG GGCCAACAGC GGCCTGATCG AGGCGAACTC CTCGGACGGC
ACCGCGCAGT ACATGGAGAA TCTGCACACG CTGTACCTGT ACTTCGAGGC CGGGACCATG
CACTCGGGCC AGTATCCGGG CAGCCAGGCG GGCCTGGCCG ACCTGTTCAA CTTCAACCAG
GACCACCAGG CCTGGTATCC GGCCGGGTAC TGGTTGTGGA ACCTGCGCGG CCAGATCCAG
GCGAACCTGG ACTCCGGCGA GTTCGCGCAG AACATTCCGA TCTTCGACAT GTATCTCAAC
GACCTGCCGG CGATCCAGTC GTGGACCGGC GCGCAGATGA ACGGCAAGCC GGGCGCGTGT
GTGCCGGAGA CGATGCGGTT CAACGGCAAC GGCTATTACT GGGGCGGCAG CATCACCAAC
GACGCCTCGT GCGCGGTGGC CTCCAGCCCT GGCTTCAACG CCGAGACGAT CACCAGCGGC
GCGGAGATCG CGCTGTGGGT GTGGCAGCAG TACCAGGACA CGGGGGATGT CAACTTCCTG
CAGAAGTACT ATCCGCTGCT GCAGCAGACC TCGACGTTCC TGCTCGCCTG GCAGTCGGTC
GGCTCGGACG GCTACCTGCA CGCGGTCGCG AACGCGCACG AGACGCAGTG GCAGGTGCAG
GACCCGACCA CCGACATCGC CGCCGACCAG GCGCTGTTCA CCGCCACGGT GAACGCCGCG
ACGCGGCTGA ACACGGACTC CTCGCTGGTC TCCCAGCTGC GTGGCGCGCT GACGCACATT
CAGCCCTACG CGCGGACTGA CGAGAACAGC CACAGCCAGT TGCTCGGCCC GTCCGCGGAC
TCCTCGGGCA CGGACGTGAT CGGCACCTCC TACCAGCCGA CCGCGGCGAC GCACAACGTG
GAGAACCTCG GCCTGGAACC GGTGTGGCCC TTCGGCGTGA TCAGCGACAA CACGGTCGTC
AACGGCGACA ACCTGACCGC ACTGGCGGAC CGCACCTACC AGCACCGGCA GAACGTCAAC
AACCCGGACT GGACCTACGA CTCGATCCAG GCCGCGCGCC TGGACATGTC CTCCGAGGTG
GCCAACGACC TGGTGGCCAG CACCAAGAGC TACCAGGTCT ACCCCTCCGG TCTCGCCGCG
TGGAACCCGG GTTCGGTGGA CGAGCCCTAC ATCGAGCAGA TCTCGAACGT CGCCGCGACG
CTGGACGAGG CGTTCGCGAC CGACTACGAC GGCACGGTGC GCTTCGCTCC GGCGTGGCCT
TCGGGCTGGG ACGGTTCGGG CCGCGTGTAC ATCCAGGGCG GCTCGAAGGT CGACGTCCAG
GTCGAGGGCG GCGTGCTGGC CACCGCCGCG ATCGAGGCCG GGTCGAGCGG GACCATGAGT
GTGCGCAATC CCTGGTCCGG CCAGCAGGCG CAGGTGGTCA ACGGCTCGAC CGGCGCCGTC
GTGGTCGCGG CCACCAATGC CGCGACGCTG AGCGTCCCGG TCACCGCCGG CCAGAGCTAC
CTGGTCGAGC AGCCGGCCAC CCCGACGACC TCGCTGCCGT TCGCCCAGGT GACCGGAACC
GCGGCGCGCG GGTTCCGGCA GTTGGGCAGC GTGAGCATCG GCCTGGGCGG CAACACCCTG
CCCGCCGGCA ACACGGTGAC CGTCACCAGC CCCGGCAGCC AGTCCGGCAC CGTGGGCACG
GCGATCAGTG CCCTGCAGAT CCACGCGACT GACTCCGCCT CGGGCCAGAC GCTGTCCTAC
AGCGCCGCCG GCTTGCCGCC TGGGCTGTCG ATCAGCTCTT CGGGTCTGGT CAGCGGGACG
CCGAGCGCGT CCGGGACCTT CACCGTCACC GTCACGGCGA CCGACTCCAC CGGCGCGTCC
GGAGCGGCGT CGTTCACCTG GACGGTCGGC GGCGGCAGCG GGAACGTGGT GTCGGTGACG
AATCCGGGCA GCCAGTCCGG GACAGTCGGT ACGGCGATCA GCGGTCTACA GATTCAGGGC
ACTGATTCGG CGGGCCAGAC GCTGACGTAC ACGGCCGGTG GTCTTCCCAC CGGGCTGTCG
ATCTCCTCGT CCGGTCTGAT CAGCGGGACG CCGAGCGCGT CCGGGACCTT CACCGTCACC
GTCACGGCGA CCGACTCCAC CGGCGCGTCC GGGGCGGCGT CGTTCACCTG GACGATCAGT
GGTGGCACCA CCGGATTCCC GGGTGGTTAT CACAGCCTGG TCGTGGCGAA GAGCAGCCTG
TGCCTGGACG TGTTCGGCAA CACCAGCACC GCCGGTGCGG CCATCGACCA GTACACCTGC
AACAGCCAGA GCAACCAGCA GTTCCAGTTC CTCCCGATCG CCAACGGCTA CGGTGAACTC
CAAGCCCAGA ACTCCGGGCA AGACGTGACC GTCGCCAACA GCTCCACCGC CCAAGGCACC
CCAGACATCG TCCAGCAGCC GGTCAACGGC GCTGCGGCAA GCCTGTGGCT GCCCCAGCAG
CAGTCCGACG GCTCCTGGCA GTTCAAGAAC CAGAACAGCG GACTGTGCCT GGACGTCTAC
GGCAACGGAA GCACCACCGG CCAGCAACTC GACCAATGGC CGTGCAAGAA CGCACCCGGA
ACCAACCAGG ACTTCAACCC CCGCTGA
 
Protein sequence
MSARHRLSRF IAAAAVFSLA PPLVVAIGSG TPALAATTTT AWQNGSFAQN VSGIVSRSNV 
VIGKANTAAT QFLPLGNGSL GVAEWAANGF TAQLNRSDTM PNRLSPGQVD IPGLSAMTSA
SNFVGYLDVY NGVLHESGGG MSLTAWVPAG KDELVVDVTG ANPGTRQTAS VNLWSGRGPT
ASASGSIASL AQTWVDNSQT GYSGKTFGAM AAITAGGSNV TASTSGSTQA LVSFNPNSDG
TYRVIVASPS WAGGNANSTA SSLIGSDTGA SEASLLATQS AWWNTYWANS GLIEANSSDG
TAQYMENLHT LYLYFEAGTM HSGQYPGSQA GLADLFNFNQ DHQAWYPAGY WLWNLRGQIQ
ANLDSGEFAQ NIPIFDMYLN DLPAIQSWTG AQMNGKPGAC VPETMRFNGN GYYWGGSITN
DASCAVASSP GFNAETITSG AEIALWVWQQ YQDTGDVNFL QKYYPLLQQT STFLLAWQSV
GSDGYLHAVA NAHETQWQVQ DPTTDIAADQ ALFTATVNAA TRLNTDSSLV SQLRGALTHI
QPYARTDENS HSQLLGPSAD SSGTDVIGTS YQPTAATHNV ENLGLEPVWP FGVISDNTVV
NGDNLTALAD RTYQHRQNVN NPDWTYDSIQ AARLDMSSEV ANDLVASTKS YQVYPSGLAA
WNPGSVDEPY IEQISNVAAT LDEAFATDYD GTVRFAPAWP SGWDGSGRVY IQGGSKVDVQ
VEGGVLATAA IEAGSSGTMS VRNPWSGQQA QVVNGSTGAV VVAATNAATL SVPVTAGQSY
LVEQPATPTT SLPFAQVTGT AARGFRQLGS VSIGLGGNTL PAGNTVTVTS PGSQSGTVGT
AISALQIHAT DSASGQTLSY SAAGLPPGLS ISSSGLVSGT PSASGTFTVT VTATDSTGAS
GAASFTWTVG GGSGNVVSVT NPGSQSGTVG TAISGLQIQG TDSAGQTLTY TAGGLPTGLS
ISSSGLISGT PSASGTFTVT VTATDSTGAS GAASFTWTIS GGTTGFPGGY HSLVVAKSSL
CLDVFGNTST AGAAIDQYTC NSQSNQQFQF LPIANGYGEL QAQNSGQDVT VANSSTAQGT
PDIVQQPVNG AAASLWLPQQ QSDGSWQFKN QNSGLCLDVY GNGSTTGQQL DQWPCKNAPG
TNQDFNPR