Gene Caci_3803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3803 
Symbol 
ID8335156 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4300030 
End bp4302861 
Gene Length2832 bp 
Protein Length943 aa 
Translation table11 
GC content70% 
IMG OID644956942 
ProductRicin B lectin 
Protein accessionYP_003114545 
Protein GI256392981 
COG category 
COG ID 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.307116 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACTG CGTCACTCCC CGGTGATTCG TCCGGGCCCG GGTCGGGGGT GGATCGCCGC 
TCGCTGCTGA AGCTCGGCGG GGTGCTCGGC GCCTCCCTCG CGATCAGCGG GCTGCCGGTC
TTCACCTCGA AGGCGCGGGC CGTGACCCGG CCCGGCGCGC TGAGGCTGGT CCCGCCGGCC
ACCGCCACCC GGCTCTGGTA CACCTCGCCG GGCAGCGCGG CGAACATCAT GCAGGAGGGA
CTGGCGGTCG GGAACGGGCG GCTTGGGGCG ATGGTGACCG GCGACCCGGC GTCGGACGCG
CTTTACCTGA CCGACATCAC GCTGTGGACC GGGGGCGCCA ACGCCAGCCT CGGCAGCGAC
GGCCAGTTTC CGTACGGCGT TGACAACTTC GGTTCGTACC AGTCGCTGGC CCAGGCGCAC
GTCTCCGTCC CGGCGCACGC GCTGTCGGCC GTCTCCGGCT ATCAGCGGGT GCTGGACCTC
AGCAACGGGT ATGTGTCAGC GTCCTACCAG TACAACGGCG TCACGTACAC GCGTGAGGTC
TACTCCAGCA ACCCGGACGA CGTGCTCATC GTGCGGCTCA AGCAGAGCGG CGGCGGCAGC
TACACCGGCA GCGTCTCGCT CAGCGGCACG CACGGCGAGA CCACCAGCGC CGATCCGGCA
GGTCTCACCG CCTCGTTCAG CGGCACGCTC GCCAACGGCC TGAAGTACGC GTGCGCGGTC
GCGGTGACCG CCACCGGCGG CCGGGTCGCG GTCTCCGGCA GCAGTGTCAC CTTCACGTCG
TGCAGCGAGG TGATCCTCGT GTTCTCCGGC GGGACGAACT ACAAGCCGGA CCGCACTGTC
GGCTACAAGG ACAGCACCCT GATCCCGCTG TCCGTCGCCG TCGCCAAGGC CCACGCCGTC
GCCGGCGTCA GCGGCGACTC GCTGCTGGCC ACCCATGTCG CGGACTACCA GGCGCTGTAC
AACGCCACGA CCGTGAACCT CGGTACCTCC AGCACCGCGC AGCGCGCCAT GGACACCCCC
TCTCGGCTCA CGGCGCGCGC TGCGTCCGGT GCGGCGCCGG ACCCGGAGCT GGAGGCGTCC
TACCTGCAGT TCGGCCGGTA CCTGGCCATC ACCGGCTCAC GGGGGTCGCT GCCGACGAAC
CTGCAGGGAC TGTGGCTGGA CAACAACAAT CCGGCGTGGA TGAGCGACTA CCACACCGAC
ATCAATGTCC AGATGAACTA TTGGCTGCCC GACCGGGCCG GGCTGGGATC CTGTTTCGAC
GCGTTCGCGA ACTACTGCGT CGCGCAGCTG CCGGGGTGGA CGGCCACGAC CCAGAGCCTG
TTCCAGAGCT CGACCAACGG GTTCCGGAAC AGCAGCGGCA AGGTGGCCGG CTGGACCGTC
GCGATCAGCA CCAACACCTG GGGCGGCGGC GGATGGTGGT GGCACCCGGC CGGCAACGCC
TGGCTGGCCA ACGCCCTGTA CAGCCACTAC GAGTTCACGC TGGACGCCGG CTACCTGCAG
CGGATCTACC CGCTGCTCAA GGGCGCCTGC CAGTTCTGGC AGGCCCGGCT CGTCACGGAT
CCCGGCACCG GCAAGCTGAT CGACGACGCC GACTGGTCGC CCGAGCACGG GCCGACGAAC
GCCAAGGGCA TCACCTACGC CCAAGAACTG GTGTGGCAGC TCTTCCAGAA CTACACCGCG
GCGGCGGCGA AGCTGAACCA GGACGCCGCC TACGCCGCCA CCATCGCCGG CCTCCAGGCG
AACCTCTACC TGCCGCAGGT CAGCCCCACC ACCGGCTGGC TCGAGGAGTG GATGACGCCG
GACAACCTCG ACACCTCCGA CCTCACCCAC CGTCACCTGT CGCCGTTGGT CGGCCTGTTC
CCCGGCGACC GCGTCACCGC CGACCAGAGC CCGGCGGCGC TGCTCACCGG CGTCACCAAC
CTGCTGACCG CGCGGGGCAT GAACTCGTTC GGCTGGGGCA TGGCCTGGCG CGCGCTGTGC
TGGGCACGGC TGAAGAACGC GGGCATGGCC TATCAGGCGG TGACGACCGT GCTGCGGCCC
TCGGTGAACT TCAGCAACGG GGCGGCGATC AACCTCTTCG ACATGTACAG CTTCGGCAGC
AGCTCGGTGT TCCAGATCGA CGCCAACTTC GGCACCCCCA GCGCGATGAT CGAGATGCTG
GTCTACCACC GCCCGGGCCT GGTGGAGCTG CTCCCCGCGC TCCCGGACGC CTGGTCGGTC
GCCGGCAGCG TCACCGGCGT CCCGGTCCGC GGCGCGATGG CGCTGGACAT GGCATGGTCC
GGCGGGCAGG TGACCACGGC GACCCTGCAC GGCACGCCCG GCGCCGGCAC CACCGTGAAG
TTCGGCGCCT GGTCGCAGGC GGTGACCATC GGCAGCGGCG GCACCGTCAC GGTCGTCCCG
CCTCCGCGCG CGACGGTCTT CAACCTGGTC AATCGCCGCA GCGGCAAGGC GATCGACGTT
CCCGGATCGT CGACGACCGC CGGCACCGCC CTGATCCAGT ACACGCTGCA CAATTCGCCG
AACCAGCAGT GGAAGTTCGC CCCGGCCGCG ACCGGCTACA CGGTCACGAA CATCAACTCC
GGCATGGTCG CCGACGTCAA CGGCGGAAGC ACCGCCGATG GCACCGCCAT CGTCCAGTGG
CCCGCCAACT CGGGCACGAA CCAGGAGTGG ACGCTCGCCG ACGCTGGCAA CGGCTACGTC
AAGCTGGTGT GCGTCCGCAG CGGCAAGGTG CTGGGGGTCA GCCAGGACTC GACGTCGGAC
CTGGCCGGCA TCACCCAGCA GACCGACACC GGCGACATCA GCCAGCACTG GCAGCGCATC
GCAGTGCGCT GA
 
Protein sequence
MSTASLPGDS SGPGSGVDRR SLLKLGGVLG ASLAISGLPV FTSKARAVTR PGALRLVPPA 
TATRLWYTSP GSAANIMQEG LAVGNGRLGA MVTGDPASDA LYLTDITLWT GGANASLGSD
GQFPYGVDNF GSYQSLAQAH VSVPAHALSA VSGYQRVLDL SNGYVSASYQ YNGVTYTREV
YSSNPDDVLI VRLKQSGGGS YTGSVSLSGT HGETTSADPA GLTASFSGTL ANGLKYACAV
AVTATGGRVA VSGSSVTFTS CSEVILVFSG GTNYKPDRTV GYKDSTLIPL SVAVAKAHAV
AGVSGDSLLA THVADYQALY NATTVNLGTS STAQRAMDTP SRLTARAASG AAPDPELEAS
YLQFGRYLAI TGSRGSLPTN LQGLWLDNNN PAWMSDYHTD INVQMNYWLP DRAGLGSCFD
AFANYCVAQL PGWTATTQSL FQSSTNGFRN SSGKVAGWTV AISTNTWGGG GWWWHPAGNA
WLANALYSHY EFTLDAGYLQ RIYPLLKGAC QFWQARLVTD PGTGKLIDDA DWSPEHGPTN
AKGITYAQEL VWQLFQNYTA AAAKLNQDAA YAATIAGLQA NLYLPQVSPT TGWLEEWMTP
DNLDTSDLTH RHLSPLVGLF PGDRVTADQS PAALLTGVTN LLTARGMNSF GWGMAWRALC
WARLKNAGMA YQAVTTVLRP SVNFSNGAAI NLFDMYSFGS SSVFQIDANF GTPSAMIEML
VYHRPGLVEL LPALPDAWSV AGSVTGVPVR GAMALDMAWS GGQVTTATLH GTPGAGTTVK
FGAWSQAVTI GSGGTVTVVP PPRATVFNLV NRRSGKAIDV PGSSTTAGTA LIQYTLHNSP
NQQWKFAPAA TGYTVTNINS GMVADVNGGS TADGTAIVQW PANSGTNQEW TLADAGNGYV
KLVCVRSGKV LGVSQDSTSD LAGITQQTDT GDISQHWQRI AVR