Gene Caci_4926 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4926 
Symbol 
ID8336280 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5616868 
End bp5619114 
Gene Length2247 bp 
Protein Length748 aa 
Translation table11 
GC content69% 
IMG OID644958025 
ProductRicin B lectin 
Protein accessionYP_003115627 
Protein GI256394063 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.164927 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.377281 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACGGC TCGCCGCGGC GGCCGCGGTG ATGCTGGCGT TGGCCGTCCC GTCGTTCTCG 
CCGGCCCGCG GGCAGGCCGC CGCGGGTGAG GCGCTGAGCG TGAATTTGGC GAGCCCGACG
CACTCGGCCA CCGGGGTGGG GGAGGGCTTC CTGTACGGCC TGACCGTCGA CGGCACACAG
CCCGCCGACC AGTACCTGGA ACCGCTCGGC GTCACGGCCC ACCGGGGCGG CGGATGGTAT
TCCGGCGGCT GGATCAAGGA CGGCTACCAG TACGGCTCGG CCTCGCAGGC CGACGTGGCC
TCGATCATCG CCCAGGCGCG CCGGCTGACC CAGCCGCCGT ATCACGCGCA GTACCAGGTG
CTCATGACCG ACATCTACGG TCTGAACGGC GGCCAGCCCT CGAACACGCG CTACCCCTGC
GACAACGGCG ACTGCTCGAA CTGGGCCTCG TTCATCGACT CCACGGTCGG CGCGCTGCAG
GCCTCGGGAG TGAAGTTCGC CTTCGACATC GACAACGAGC CGGACATCTC GGTCTTCTGG
ACCCGGGGTG TGAACAGCAC GCAGTACTTC CAGATGTGGG ACACCGCCTA TCGGGAGATC
CGGCGGGTCG CCCCGAACGC GCAGATCGTC GGACCGTCCT TCGCGTACAC CCCGCAGAGC
CGGGCGAGCC AGTGGCAGAC GTGGCTGGCC CACGTGAAGG CGGCCGGAAC CGTCCCGGAC
ATGATCACCA ACCACGACGA GGGCGACGTC GACGACCCGG TCGCGGTGTC CCAGGCGCTC
AACAGCGTGG TATCGGCGGC GGGGCTGAGC CCGATCCCGC TGTCGGCTAA CGAGTACCAG
CCGGCCGACC GGCAGACCGC CGGCGTGACC GCGTGGTACC TGGCGCGCTT CGCCCAATCC
GGGTACACCA ACGCGATGCG CGGCAACTGG CAGTGCTGTA TCTCGCCGAA CCTGACCGGG
GTCATCGATC CGAACGCCAG CGGCGGTGCG GCCTACACCG GCAACTGGTG GGCGATGCGG
GCCTACGCGG ACCTGACGGG TTCGCTGGTC TCCACCTCCG GCCAGGTGGG CTCGACGGCG
ATCTCCGCGG CGGAGGACAG CAGCCGTCGG CGCGCGGTGG CCGTGGTCGG TGATGCCAAC
GGCTATACCG GCGCGGCGTC AGTGGCCTTC ACCGGCTTCT CATCGGTGCC CTGGCTGGCG
AACAACGGCA CCGCCGAAGC GGTCGTCTAT CGGATCCCGG ACCAGTCCGG GCTCACCGCA
CCGCAGGTGG TCTCCGATCA GATCGTGAAC GTCTCCGGCG GATCGGTGAC GCTGCCGCTC
AATTTCCAGG CCGCGCACGA CGCCTTCGCG GTCTACCTGC TGCCCGGCTA CGCGCAGGGC
TTCACCAGCA GCGTGGTCAA CCAGGGCGAC AACCTGTGCA TGGAGAACCC CGCCTTCACC
ACCGTGGCTT CGGCACAGTT CGACCAGGGA GCGTGCGGGG CGGGCGCCGA CCAGCAGTTC
CAGTTCGTCC CGACCTCGGC CGGCAGCAGC ACGTACTTCG TCCGTCCGAT GACGCCCGGT
GATTGTGTGG GAGTCGCGGG CAGTTCGACG TCCGCGGGCG CGGCGGTGAC GCAGAATCCG
TGCGGCTATG GCACCGACCA ACAGTTCACC CTCCGCTCCG TGGCCACCGG CGTCTACCAG
GTGGTCAACC AGAACTCAGG GCTCTGCGTC GTCCCCAACG GCGGCGGCAC GGCGTCGGGG
ACCGGGCTGG TCCAAGCCGC TTGCAGCACC GCGGGCTCGG GGGAGTGGAA GATCCAGCAG
GGCCAGACGA CCGGCTTTCC CAGCGGCTAC CACCAGTTCG TCATCGGGAG CAACAGCCTG
TGCCTGGACG TCTACGGTGC CAGCGGCGCC GGCGGCGCTG CGATCGATCA GTGGACCTGC
AACAGCCAGA CCAACCAGCA GTTCCAGTTC GTGCCCGTCT CCGGCGGGTA CGGCGAGCTT
CAGGCTCGGA ACTCCGGTGA CGACGTGACG GTGTCCGGCG GCTCCACCAC CGCGGGACAG
CCGGACATCG TGCAGCAGAG CCCGAACGGC GCGGCGAGCA GTCTGTGGCT TCCGGTCCGG
CAGTCCGACG GCGGCTATGC GTTCCAGAAC CAGGGCAGCG GCCTGTGCCT GGACGTCTAC
GGCGCCAGCA GCAGCCTGGG CCAGCAACTC GACCAGTGGC AGTGCAAGAA CGCCTCGGGA
AGCAACCAGG ACTTCACCGT CCGCTGA
 
Protein sequence
MRRLAAAAAV MLALAVPSFS PARGQAAAGE ALSVNLASPT HSATGVGEGF LYGLTVDGTQ 
PADQYLEPLG VTAHRGGGWY SGGWIKDGYQ YGSASQADVA SIIAQARRLT QPPYHAQYQV
LMTDIYGLNG GQPSNTRYPC DNGDCSNWAS FIDSTVGALQ ASGVKFAFDI DNEPDISVFW
TRGVNSTQYF QMWDTAYREI RRVAPNAQIV GPSFAYTPQS RASQWQTWLA HVKAAGTVPD
MITNHDEGDV DDPVAVSQAL NSVVSAAGLS PIPLSANEYQ PADRQTAGVT AWYLARFAQS
GYTNAMRGNW QCCISPNLTG VIDPNASGGA AYTGNWWAMR AYADLTGSLV STSGQVGSTA
ISAAEDSSRR RAVAVVGDAN GYTGAASVAF TGFSSVPWLA NNGTAEAVVY RIPDQSGLTA
PQVVSDQIVN VSGGSVTLPL NFQAAHDAFA VYLLPGYAQG FTSSVVNQGD NLCMENPAFT
TVASAQFDQG ACGAGADQQF QFVPTSAGSS TYFVRPMTPG DCVGVAGSST SAGAAVTQNP
CGYGTDQQFT LRSVATGVYQ VVNQNSGLCV VPNGGGTASG TGLVQAACST AGSGEWKIQQ
GQTTGFPSGY HQFVIGSNSL CLDVYGASGA GGAAIDQWTC NSQTNQQFQF VPVSGGYGEL
QARNSGDDVT VSGGSTTAGQ PDIVQQSPNG AASSLWLPVR QSDGGYAFQN QGSGLCLDVY
GASSSLGQQL DQWQCKNASG SNQDFTVR