Gene Caci_4874 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4874 
Symbol 
ID8336228 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5543842 
End bp5546991 
Gene Length3150 bp 
Protein Length1049 aa 
Translation table11 
GC content67% 
IMG OID644957973 
ProductRicin B lectin 
Protein accessionYP_003115575 
Protein GI256394011 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.524665 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.633916 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCGC GCGGCATCAG GTCGCGGACG GCACTGTGGC TCTCCGCTCT GGTCGCCGTC 
TCAGCGGCCG CATACGGGAC GCAGGCCGCT GCGACGCCGA GTGTCGCGCG TGCGGCCGTG
CAGACCACCT TGTACGCCTC CCCCACTGGG TCGGGATCTT CCTGCTCCCA GACCTCGCCG
TGCGCGCTGG CCGAGGCGCG CAGCGTCGTC GAGGGCATGA ACGGCTCGAT GACCGGCGAC
ATCGTGGTGT ACCTGACGGG CGGCACCTAC AGGCTCACCA GCACCTTCGC TCTGGGACCG
CAGGACTCCG GGACCAACGG GCACACCGTG TACTGGGAGG CTCTTCCGGG CCAGACACCG
GTCTTCGACG GAGCGCAGCA GGTGACCGGG TGGTCGCAGT ACAACTCGGG CCTGAACATC
TGGCGCGCGC CGGTGGCGGC GGGTACGCGG GCCGGGGACC TGTGGGTGGA CGGCGCGCGG
GCCTCGGTGA CCAAGAGCGC GATCAACCCC GGCGGGTTCT CCCAGTCGGG CGCGTCGTTC
ACGACCTCCG ACGGCTCGTA TCGCTCGTGG ATCGATCCCA CCGAGGCGGA GATCGTCGAT
GACAACCCGT GGCGGCAGCT GCACTGCCCG CTGGCCGGCA TCACCGCCAA CGGCGGCGGC
TCCAGCCTGA ACGTGAACCA GGCCTGCTAC AACGCCACGT TGAGCAACAC CGGCTTCCCG
TTCAACGGCG CGGGCTACCC GACGCTGAAC CACATCACCT GGATCGAGAA CGCCTACGCG
TTGCTGAACC AGCCGGGACA GTGGTTCCTC GACAGCGCCG GCGGCAGCCT GTACTACATC
CCGCTGCCGG GACAAAACAT GTCCACCGCC GACGTCGAAC TGCCCGTGGT CCAGGATCTG
GTCGACGTGC AGGGCACCCC GGGACACCTG ACCCCGATCA ACGACACGGC GTCAGGGATC
GCCTATTCCG GATCCGGCTG GGGCTCCTCG ACCGGCCGCG GGCTCGGCGA CTTCCAGAAC
GACGTGCACT CCACCCAGAC CAACGGCGAC TCGGTCAGCT ACACCTTCAC CGGCAGCGGC
ATCACCGCGC TGACGGAGCT GAACAGCGAC GAAGGCAGCA TCGGGGTCTA CATCGACGGC
ACGCTCAACC AGACCGTCAG CGCCGCCACC TCGGGCCAGC GAACCGCCGA GGACGCGGTG
GTGGCCGTCA GCGGACTGAC GCCCGGCAGC CACACGATCA AGCTGGTCAA ACAGAGCGGG
ACCTGGATGC TGCTCGACGG CTTCGTGGTG ATCCCGACCG CCGTCCAGCC GGCCCACGAC
ATCACCTTCT CCGGGATCAC CTTCCAGCAC AACACCTGGC TCACCGAGCT CTCCCAGGGC
TACCCGGAGA ACCAGACCGG CGTCATGTGG AGCGAGACCA ACCCGTGGAA TCAGGTCAAG
GACCCCGGGA TGATCAACGT CGAGCGCGGC AACCACATCA CCTTCGCCGG CGACACCATC
GCGCACACCG GCGACGCCGG CGTCGACTTC GGCAACGGGA CCCAGAACTC CACGCTCAGC
ACCAGCAGGA TCCTGGACAC CGCCGTCAAC GCCGTGCAGG TCGGCGAGGT CGACGACTAC
TACCTGACCG ACACGGCGCT GATGACCTCC GGGGACACCG TCACCGGCAA CTTCATCGAA
CACAGCGGTG TGGTCTTCGA AAGCACCAGC GGCATCCTGG TCGGCTACAC CCGCAATGTG
ACCGTCTCCC ACAACGACGT CGGCTACTCG GCCGCGCAGG GCATCTCCGT GGGCTGGGGC
TGGGGCTACG CCTCGCCCTA CTGCTCCGGT TGCGCCCACG GCTATGACTA CGCCGGGGGG
AACCAGGTCT CGTACAACTA CGTCTACGAC AACGGCCTGG GCGACGGAGC CGGCAACACG
GTGATGGCCG AATGCATCTA CACCCTGGGC GGCCAGGGCG ACGGCAACGG CTCGGTGTGG
TCGACCCTGA CCGGCAACGT GTGCCAGGAC CAGTACATCT ACAGCAACTG GGGCACCATC
GCCCACGACG AAGGCAGCTC CTACTGGCAG GACCGCGACA ACGTCGTGCG CTGGTCGGGC
CAGGACTGGA TGTACTACGA CCAGCCCACC GTCAACAACA TCACGGTCGG TCCCACGAAC
TACTCCGACA ACGCCGGCTA CCGGGCCGTC GTCCCGAACA ACACCAGCTT CACCCAGGCG
GCCATCGTTC CCGACGGCCA GTGGCCCGCC GGCGCGCAGG CCGTCATCAC CGCCGCCGGA
CCGCCCGCGC AGGTCGCGCC CCTCACCGGA ACCCTGGACG ACACCTGCCT GTGCCTGAAC
TACACCGGGT CCTCCTGGAC GTGGAGCGGC GACCGCAGAC TCGGCGACTT GGACAACGGA
ATACACCAAG CCACCGGCAA CGGTGACAGC TTCTCCGTGC AGTTCACCGG CACCGGCGTC
TCCTGGATCG GGGAGAAGAG CGGCAGCGAG GGCACGGCGG AGATCTACGT CGACGGTGCC
GACAAGGGTT CGGTCAACGC CAACAGTTCC CCCACGCAAG CCCAGCAGAC GCTCTACAGT
GTCGCCGGGC TGGCCGGCGG CACCCACACC CTCAAGGTCG TCAAGACCGG CGGCACCTAT
CTCCAGGTCG ACGCCGTCAA CATCACCGGG ACGGCCGTCG TCACCGGCGG CTCGGGCGGT
ACCGGCGGGG GCACGGGCAC CTATCCGACC GGCTACCACG CCCTGACCAT CGCCAGCAAT
AACCTGTGCC TGGACAACTA TGGCGCTGGT TCCACCGCCG GCGCGATCAT CGACCAGTGG
TCGTGCAACA GCGGGACCAA CCAGCAGTTC CAGTTCGTCC CCACCTCCGG CGGATACGGC
CAGCTCCAGA TCGAGAACTC CGGCCAGGAC GTCACAGTGT CCGGCGGCTC GGCCTCCCAA
GGGGTGGCGG GCATCGTGCA GCAACCGGTC AGCACGTCGA CCGCCGCCCA ATGGCTGCCC
CAGCAGCAGT CCGACGGCTC CTGGCAGTTC AAGAACCTGA ACAGCGGCCT TTGCCTGGAC
GTGTACGGAG CGAGCAGCAG CCAAGGCCAG CAACTCGACC AGTGGCCGTG CAAGAACGCA
CCGGGGACCA ATCAGGACTT CAAGCCCTGA
 
Protein sequence
MKARGIRSRT ALWLSALVAV SAAAYGTQAA ATPSVARAAV QTTLYASPTG SGSSCSQTSP 
CALAEARSVV EGMNGSMTGD IVVYLTGGTY RLTSTFALGP QDSGTNGHTV YWEALPGQTP
VFDGAQQVTG WSQYNSGLNI WRAPVAAGTR AGDLWVDGAR ASVTKSAINP GGFSQSGASF
TTSDGSYRSW IDPTEAEIVD DNPWRQLHCP LAGITANGGG SSLNVNQACY NATLSNTGFP
FNGAGYPTLN HITWIENAYA LLNQPGQWFL DSAGGSLYYI PLPGQNMSTA DVELPVVQDL
VDVQGTPGHL TPINDTASGI AYSGSGWGSS TGRGLGDFQN DVHSTQTNGD SVSYTFTGSG
ITALTELNSD EGSIGVYIDG TLNQTVSAAT SGQRTAEDAV VAVSGLTPGS HTIKLVKQSG
TWMLLDGFVV IPTAVQPAHD ITFSGITFQH NTWLTELSQG YPENQTGVMW SETNPWNQVK
DPGMINVERG NHITFAGDTI AHTGDAGVDF GNGTQNSTLS TSRILDTAVN AVQVGEVDDY
YLTDTALMTS GDTVTGNFIE HSGVVFESTS GILVGYTRNV TVSHNDVGYS AAQGISVGWG
WGYASPYCSG CAHGYDYAGG NQVSYNYVYD NGLGDGAGNT VMAECIYTLG GQGDGNGSVW
STLTGNVCQD QYIYSNWGTI AHDEGSSYWQ DRDNVVRWSG QDWMYYDQPT VNNITVGPTN
YSDNAGYRAV VPNNTSFTQA AIVPDGQWPA GAQAVITAAG PPAQVAPLTG TLDDTCLCLN
YTGSSWTWSG DRRLGDLDNG IHQATGNGDS FSVQFTGTGV SWIGEKSGSE GTAEIYVDGA
DKGSVNANSS PTQAQQTLYS VAGLAGGTHT LKVVKTGGTY LQVDAVNITG TAVVTGGSGG
TGGGTGTYPT GYHALTIASN NLCLDNYGAG STAGAIIDQW SCNSGTNQQF QFVPTSGGYG
QLQIENSGQD VTVSGGSASQ GVAGIVQQPV STSTAAQWLP QQQSDGSWQF KNLNSGLCLD
VYGASSSQGQ QLDQWPCKNA PGTNQDFKP