Gene Caci_3804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3804 
Symbol 
ID8335157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4303239 
End bp4304732 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content67% 
IMG OID644956943 
ProductRicin B lectin 
Protein accessionYP_003114546 
Protein GI256392982 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.404288 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.817018 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATCGT CGTCCGAGCA CAACCCAGAT CGCAAAAGCC TGGCCGCCAA GCTCCGAATT 
CTCGCGGCGG GCGCGCTCGC CGCCACGAGT CTGGTGGCGG CCGGGCAGAT CCCGGCGCAC
GCGGCCACCG CCGCGGCCGC TTCCACCTCC CAGTTCAAAG GCGTGAACTG GGCCGACCAG
CGCGACAACT TCGTCAACGG CGTCCTGTAC CCCTCCGGCC TCAACGCCTC CGACACCCAC
GCCTCCGCCT CGACGGTCGC GGCCCAGGTC GTGGGCCAGC TCGACACGAT CACCGGCGCG
AACACCGTCC GGATGCCGAT CAACGAGCCG ACCGTCTCGA CCTACTGGAG CACCTACACC
GGCGCGATCG ACGCGGCGCT CACCAAGGGC AAGGTGATCC TCGCCTACTG GGCCTACAGC
GGCGGGAAGC CCACCAGCAC GGCGGCGTTC AACCAGATGT GGGACACCGT CGTCGCCTCG
TACGGGAGCA ACCCGAACGT GTACTTCGAG GTCATCAACG AGCCTTACGG CTACAGCTCC
ACGGACTTGA ACAACTTCTA CAACACCTGG CTGACCAGGT ATCCCGCCGT CCCGCGCGGT
CAGGTCATCC TCGACGGTAC GGGCGACGCC ACGAACATCG CGGGAGTCGC CGGCGACAGC
CGGCTGGCGA ACACGCTGCT CGCAGTGCAC TACTACACGT TCTTTGCCGG AACATCCACG
AACGAGTCCG ACTGGGCGAA CGGCATCGCG AACGAGATCG GCAGCTATGC GAGCCGGACT
GTCGCCACCG AGTGGGGCGC GCCGATGAGT CCCGGCAGCA AGAACGGCGT CCACTACGAC
ACGATCAACT ATGACGTGCC GGGCGGGAAC TTCTTCGACG CCTACGTCCG GGGCGTCAGC
AGCGAGCTGC GCAAGCTCGG CGTCGGCAGC GTGTACTGGC CGGGGCTGCG TGACGGCGAC
TGGTACAGCC TGACCAGTAA GACCGGTACC GGTGCGTCGA TCGCGCTGAC GCTGGTGAAC
GCCTCCGGGC TGGACCGGCT GCAGTACGCG TGGGGAATCG GCAACGGCGG TGGCGGCGGC
GGGACGTACG ACCAGATCCG TGACGTGGCC ACCGGCCTGT GCGTCGACGG TCTGGGCAGT
ACCACAGTCG GTACCAATGC CAGCCAGTCC AGCTGCGTCA CAGGCGACAC CAACCAGGAG
TGGACCATCG TGAGCAGCGG GGGTTACGTC CGTATCCAGA ACCGCGCCAC CGGCCTGTTC
CTCGACGGCA TGGGCCGCAC GACCAACGGT TCAGCAGCCG GTCAGTACAG CAGCTCCACC
AGCAACAACC AGCAGTGGAC CGAGGTGAGC ACCGCCGGCA GCGCCCGCTT CCAGAACCGC
GCGACGGGCT TGTACCTCGA CGGCATGGGC CGCACCTCCA ACGGCTCCGA CCTCGGCCAA
TACGCCGGCA GTACCAGCAC CAACCAGCAG TGGACTCTGG TATCCGCGAG CTGA
 
Protein sequence
MRSSSEHNPD RKSLAAKLRI LAAGALAATS LVAAGQIPAH AATAAAASTS QFKGVNWADQ 
RDNFVNGVLY PSGLNASDTH ASASTVAAQV VGQLDTITGA NTVRMPINEP TVSTYWSTYT
GAIDAALTKG KVILAYWAYS GGKPTSTAAF NQMWDTVVAS YGSNPNVYFE VINEPYGYSS
TDLNNFYNTW LTRYPAVPRG QVILDGTGDA TNIAGVAGDS RLANTLLAVH YYTFFAGTST
NESDWANGIA NEIGSYASRT VATEWGAPMS PGSKNGVHYD TINYDVPGGN FFDAYVRGVS
SELRKLGVGS VYWPGLRDGD WYSLTSKTGT GASIALTLVN ASGLDRLQYA WGIGNGGGGG
GTYDQIRDVA TGLCVDGLGS TTVGTNASQS SCVTGDTNQE WTIVSSGGYV RIQNRATGLF
LDGMGRTTNG SAAGQYSSST SNNQQWTEVS TAGSARFQNR ATGLYLDGMG RTSNGSDLGQ
YAGSTSTNQQ WTLVSAS