Gene Caci_4995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4995 
Symbol 
ID8336349 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5714971 
End bp5718021 
Gene Length3051 bp 
Protein Length1016 aa 
Translation table11 
GC content69% 
IMG OID644958094 
ProductRicin B lectin 
Protein accessionYP_003115696 
Protein GI256394132 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.214567 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0224239 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCACGC CCCGTGCTCC GTTCCGCCGG CTCGCCGCGC TCGCCGCGGT GGTCCTGGCC 
GCCGGAACCG TCAGTCTTAC CACCAGCTCG CCACCGGCCG CCGCGGCGGT GTCGTTCACC
ACGACGGTGG TGAACCAGGC GTCCGGCGAC TGCGCCGACG ATCCGAACTC GGCCACCACC
ACCGGAACGC AGCTGATCCA GTACTCCTGC AACTCCGGCA CGAACCAGAA CTGGACGTTC
ACCCCGGTCT CCGGCACCGG CGCCACCTAC ACCGTCGCCA ACGGCGCCTC GGGACTGTGC
GTCGACGTCT CCGGACGCTC CACCGCCGAC AACGCCCAGA TCATCCAGTG GACCTGCAAC
GGCCAGACGA ACCAGCAGTT CACCCTCCAG CCGGTCGGCA ACGGCTTCAC CCTGGCCGCC
GTCCACTCCG GCAAGTGCGT CGCCCCCACC GGGGACACCA CCGCCAACGA CACCGTCCTG
GTCCAGTTGC CCTGCACCAC CGCCGGCACC CGGGTCTGGA ACCTGCCCGG CTACACCAGC
GGCGGAAGCG GCAGCAACAC CGTGACCGTC GCCAATCCCG GGACGCAGAC CTCGAAGGTC
GGCGCGGTGG TCTCCCTGCA GATGGCCGGG AGCGACTCGG CATCGGGCCA GACGCTGACC
TACTCCGCGA CCGGTCTGCC GGCCGGGCTG TCGATCAGCG CGTCGGGTCT GATCTCCGGC
ACCGCGAGCA CTGCCGGCAG CTCGACCGTC GCGGTCACCG CGAAGGACTC CACCGGCGCG
TCCGGCAGCA CCTCCTTCGG CTGGACGGTG AACACCGCCG GCGCGGCGTA TCAGGGCGTT
CCCGCGGTCG CGCCCAACGC GTGCGGGAAC TCCTCGCTGC CCAACGCCTA CGGCACCAAC
TTCCCAACGC CCAGCGACCC CTACGGCCAG GGCTACTCCA ACCAGACCGC GCTGGGCTGG
GACGGCAACT ACTGGCCGGT GTTCCAGTAC CTCTCAGGGT CCTTCTTCGC ACGCGGCGTG
CCGACGACGT ACAACGCCAA CGGGACCACG GTCTGCGGCG CCATGTACTC CTTCTCGATC
TACAACCACA ACGGCAACCG TCCGGCGCAG TCGGTGACCT GGACCCAGGA GTCCGGATAC
CTGCCGGCGA TGACGACCTC GTTCAGCACC GGCTCGATGG CCGTGGCCAT CAAGGAGTTC
GAGGACAAGG CCACCATCGG CGGCCACCCG GTCGGCGTGG TGTACGCGCG GGTGTCGGTG
ACGAACAACG GGTCCGCAGC GCTCACCCAG GACCCCGGCG CGTCCGGGCC CAACCTGGTG
CGGCTGACGA GCACCTCGCT GAACACGGTC CAGCCCGGGG CGACCAGCAA CCACGACTAC
ATCGTCGCCG TCGACAACTT CGGCTCCGGC GCGGCGCTTC CCACCGGGAG CACGCTGTCG
TCCGGCGCCC CGGACTTCAC CACCGCCGAC AGCCAGATGG CGGCCTACTG GAACGGCCGG
ATCGGCGAGA CCGCGTCCTT CTCGCTGCCC GACGTCGCGC TCCCGAACAC CGGCGGTCTG
GCCAACCCCG GGACCGCGAT GACCAACGCC TACAAGGCCG GCACCGTCTA CATCCTGATG
ATGCAGATCG GCGAGGCGCA GTTCTCCGCG GCGAACAACT ACTTCTGGCT GCTGAACCAC
GACGTCCCCG GCGAGCTCAA CGCCCGGCTG GAGACCGGTG ACTTCCACGA CGCGCAGAAC
CTGCTGCTGA CCGCGCGGAT ATCGCAGGCC ACGAACTTCG ACGAGCACGG CGCGAACTGG
TACTTCGACG GCGACTGGAA GACGCCCTCG ACCTGGGCCT ACTACCTGGC CAAGACCAAC
GACACGGCGT TCGTCTCGCA GTACTTCCAC GACGACGCCG GCGGCTCCAG CCCGTGGGGT
CCGAGCCTGT ACACGATCAT GCACGGCATC TACCAGGGCC AGCTCGCCTC CGACGGGGCG
CTGGCGACCA GCTTCGACAA CGATTCCTCG GGCCGCTGGC TGTTCGACGA CTACTCGGCG
CTGGAGGGCC TGGCGGCGTA CAAGTACATT GCGACGCGCA TCGGCAACAC GGCAGAGGCG
CAGTACGCCG ACGGCGCGTA CACCAAGCTG CTGAACGCGA CGAACACCCT GCTGTCCCGC
AACGAGTCGG CCAACGGGTT CTCCTACCTG CCGTGCACGG TCGACCAGCC GAATTCGGCC
AACCGCTGCA ACACCTACAA CGACGCCAAC TGGGCCTCGC CGGTGTGGGT CGGACAGAAT
CAGTGGTCGA CGATGCTCAT GGGCGGCACG CTGTCCGGTA TCGCGGGCGA CCCGGCGCAG
GGCGACGCGA TGTACAAGTG GGGCTTCGCG CGTCTGTCCG CGAACGGTCT GCCGTATCCG
ACCTTCGGCG CGTTCAACGG CTACTCGACC GCGTACAACA CCGCCTACGC CAGCGACGGC
TTGTACGGCA CGTCCTACCG CGACCTGCCG ATCACCAGCT ACGCCTGGCA GATCGCGACC
ACCACCGGCG GCCCGAACGC CTGGTGGGAG GCCAACGGTA CGGGCCCGGA CAGCGGCAAC
CCCTGGATCG GCAACCATGC CGGCCCGGAG TTCGGCGCCT GCCCGTACGC CTGGCCGATC
TCCGCTCAGC AGGAGGGACT GCTGCAGTCG ATCGCGGCCG AGGGCTTGTC CTCGACCGGT
TCTGGCCCGT ACACCTACAC GAGGCCGCTG ATCATCGGCC GCGGCGTCCC GAACGCCTGG
ATCGCCAACG GCCAGAAGGT TTCGGCGTCG AACCTGACCA CCTCCTATGA CACGAACAGC
GGTGCCCGGA GCACGTACGG CGTGTCGATC TCGACCTCGA CGTCCAGCGG CACGCGCGTG
GTCACGGTGA CGCTGTCCGG AACGCTGCCG GCCGGCGCGG TGTCGGTGCA GCTGCCGGTC
TTCACCTCGG TGGGGGTCAC GGCGGTCAGC GGCGGGTCGT ACAACTCCTC GACGCACTCG
GTGAGTGTGA ACTCCGGAAC GACCACAGTG GCCATCACCC TGGCCGGCTG A
 
Protein sequence
MRTPRAPFRR LAALAAVVLA AGTVSLTTSS PPAAAAVSFT TTVVNQASGD CADDPNSATT 
TGTQLIQYSC NSGTNQNWTF TPVSGTGATY TVANGASGLC VDVSGRSTAD NAQIIQWTCN
GQTNQQFTLQ PVGNGFTLAA VHSGKCVAPT GDTTANDTVL VQLPCTTAGT RVWNLPGYTS
GGSGSNTVTV ANPGTQTSKV GAVVSLQMAG SDSASGQTLT YSATGLPAGL SISASGLISG
TASTAGSSTV AVTAKDSTGA SGSTSFGWTV NTAGAAYQGV PAVAPNACGN SSLPNAYGTN
FPTPSDPYGQ GYSNQTALGW DGNYWPVFQY LSGSFFARGV PTTYNANGTT VCGAMYSFSI
YNHNGNRPAQ SVTWTQESGY LPAMTTSFST GSMAVAIKEF EDKATIGGHP VGVVYARVSV
TNNGSAALTQ DPGASGPNLV RLTSTSLNTV QPGATSNHDY IVAVDNFGSG AALPTGSTLS
SGAPDFTTAD SQMAAYWNGR IGETASFSLP DVALPNTGGL ANPGTAMTNA YKAGTVYILM
MQIGEAQFSA ANNYFWLLNH DVPGELNARL ETGDFHDAQN LLLTARISQA TNFDEHGANW
YFDGDWKTPS TWAYYLAKTN DTAFVSQYFH DDAGGSSPWG PSLYTIMHGI YQGQLASDGA
LATSFDNDSS GRWLFDDYSA LEGLAAYKYI ATRIGNTAEA QYADGAYTKL LNATNTLLSR
NESANGFSYL PCTVDQPNSA NRCNTYNDAN WASPVWVGQN QWSTMLMGGT LSGIAGDPAQ
GDAMYKWGFA RLSANGLPYP TFGAFNGYST AYNTAYASDG LYGTSYRDLP ITSYAWQIAT
TTGGPNAWWE ANGTGPDSGN PWIGNHAGPE FGACPYAWPI SAQQEGLLQS IAAEGLSSTG
SGPYTYTRPL IIGRGVPNAW IANGQKVSAS NLTTSYDTNS GARSTYGVSI STSTSSGTRV
VTVTLSGTLP AGAVSVQLPV FTSVGVTAVS GGSYNSSTHS VSVNSGTTTV AITLAG