Gene Caci_2054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_2054 
Symbol 
ID8333398 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp2324754 
End bp2326133 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content74% 
IMG OID644955204 
ProductRicin B lectin 
Protein accessionYP_003112815 
Protein GI256391251 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCGA GTGGGTACCG GCCCGGTAAC CAGAGCACCG GGCGCCAGGT GCGCGGACGC 
TCGCTGCACG ACGTGCTGCA CGATCCGGGC ACCGGTATCG CGCAGCAGGC GGTCATCCCG
CTGGACCCGA GCCGCGTGGC GCTGATCGCC CGCGCGATCC TCACCGCGCT GGCCGCCGGC
ACCGTCCACG GCTACATCAC CCCGCACACC ATCGTGCTCA CCGACGACGG CACCGCGCTG
CTGGTCGGCG ACCGCGTCGC GCCGGGCGCC ACCCCGGCGA CCGACGTCTT CGCGCTGGGG
ATGGCGCTGT TCGAGGCGGT GGAGGGATAC GCGCCGCATC CGGCCGAGCC GCTGCCGCCG
ATGACGCGCG CCGGGGCGCT CGGGCCGCTG ATCGAGGCGC TGACCGCGCA GGATCCGGGG
CGGCGGCCGA CTGCGGCGGC GGCGCTGGGG CTGCTGGAGG CGGGGATGGC GGCGCCGGCT
TCGCCGGACG CCACCGAGAC GCGGACGCTG CCGCAGGTTC CAGCGGCAGG CGCGACAGGA
GCACCGGCAG CCGGGAACAG CGCGGATTCG ACAGCTACGT CCCTGCTGCC TCCGGTTCCG
GCCGGACCGG CGATGGGGAT GGCTGCGCCG CCCGGCACTT CCGGCTCTTC CGGGCCACCC
GGACCACCCG ACTACAGCGC TTCGGGCTAT GCGACATATG ACGACTATCC AGAGGCGAAG
CGCCGCGCGT TGCTCATAGG CGGCGGGTTG GCGGCGGTCG TGATCATCGC CGGAGTCGCG
CTCGGGGTCA CGAGACCGTG GCACCAGACG GACACCAGCA GTTCGACCGT CCCCGGCGTG
CCGACCGCGA GCTCCGTGCC GACCACTCCG GTCGCCGCGA CGCCGTCCGC CTCGGTACCG
GCCACGCCGT CCACGCCGTC CACGACCAGC AGCACTCCCA CCCCGACGCC GACGCCGACC
CCCACCCCGA CGGTCTCGAA CCCGACCGCT CTGGTGTCGA TGTCCCTGTG CCTGGACGCG
GCGGCCGACG GCGGCATCCA CTCCGGCGCG AAGGTCACTG CCGGGACCTG CCAGGGCAGC
GCGAACCAGG GGTGGCAGAT CCACACCGAC GGCACGATCC GCTCCCTGGC CGACGCGAAC
CTGTGTCTGG ACGCGGCCGC CCCGAACGGC AAGGTCGCGA ACGCCGCGCA GGTCGCGGTG
TGGAACTGCC ACGGCGGGTC GAACCAGGCT TGGGTCTGGA ACGCCAACGG CACGGTGTCG
CCGGTGGCGA ACACCGCGCT GTGCCTGGAC GCCGCGGGAC CGGTGCCGAT CCACGCGGGC
GCGAACGTCA CGGCGTGGCC GTGCAACGCC GGACCGAACG AGATCTGGGC GAAGCAGTAG
 
Protein sequence
MDASGYRPGN QSTGRQVRGR SLHDVLHDPG TGIAQQAVIP LDPSRVALIA RAILTALAAG 
TVHGYITPHT IVLTDDGTAL LVGDRVAPGA TPATDVFALG MALFEAVEGY APHPAEPLPP
MTRAGALGPL IEALTAQDPG RRPTAAAALG LLEAGMAAPA SPDATETRTL PQVPAAGATG
APAAGNSADS TATSLLPPVP AGPAMGMAAP PGTSGSSGPP GPPDYSASGY ATYDDYPEAK
RRALLIGGGL AAVVIIAGVA LGVTRPWHQT DTSSSTVPGV PTASSVPTTP VAATPSASVP
ATPSTPSTTS STPTPTPTPT PTPTVSNPTA LVSMSLCLDA AADGGIHSGA KVTAGTCQGS
ANQGWQIHTD GTIRSLADAN LCLDAAAPNG KVANAAQVAV WNCHGGSNQA WVWNANGTVS
PVANTALCLD AAGPVPIHAG ANVTAWPCNA GPNEIWAKQ