Gene Caci_3582 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3582 
Symbol 
ID8334935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp3993472 
End bp3994950 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content66% 
IMG OID644956725 
ProductRicin B lectin 
Protein accessionYP_003114328 
Protein GI256392764 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4833] Predicted glycosyl hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.67342 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGGCA CAAGAACAAG AAGCCTGGTC ACAGCCATCG TCGCGCTGGT CTCGGGCCTG 
GCGTTCGGTA TCGGTCTGCT CGGAGGTGGT GCGGCGGCTG CCGCGCCTCG GACCGCGGTG
CCCGCCGTCT CGAGCCCGGC TGCGGCATCC GGAGCCAAAG TCCTGATGGC CGGCTACAAC
TCCGGCAACG GCCTCATCGG TGGCGACGGC TGGTGGACCT CGGCGGTCGC GCTGAGCACG
ATCATGACGT ATCAGCAGAC CACCGGCGAC ACGTCCTACA GCTACGCCAT CGCCGGAGCC
TTCAACGCCA ACAAGGGCTC GAACTTCGAG AACGACTACA TGGACGACAC CGGCTGGTGG
GGGCTTGCGT GGGTGCAGGC TTACGACATC ACCGGAAACA CCGCGTACCT CCAGATGGCG
CAGACCGACG CGAACTACAT CCACGGCTAC TGGGATTCGG TCTGTGGTGG CGGCGTCTAT
TGGAGCACCG CGAAGTCCTA CAAGAACGCG ATCCCCAACG AGCTCTTCCT GGACCTCACC
GCCGCGCTGC ACAACCGCAT CGCCGGCGAC TCCACCTATC TCGGGTGGGC GAACGCGGAA
TGGAACTGGT TCAACGGCAG CGGCATGATC AACGGCTCGC ACCTGATCAA CGACGGACTC
ACCAGCGGCT GCCAGAACAA CGGCCAGACC GTCTGGACCT ACAATCAGGG CGTGATCCTG
GCCGGTCTTT CTGAGCTGTC CCGGGCCACC GGCAACACCG GCCTGCTCAC CACCGCGGAG
ACTCTCGCCA ACGCCTCGAC GGCACACTTC AACCAGAACG GCATCGTCGT CGAGCCCTGC
GAACCCAACT GCGGCGCGGA CGGCCCGTCG TTCAAGGGCG TCTACGTCCG CGGGCTGCGC
TCGCTGGCCA CGGCAGCAGG CACGACCGCC TATAACAGCT ACCTGCAAGC CCAGGCGAAC
TCGATCATCG CCCACGACAC CAACAGCGCC GGACAGCTCG GCCTGAGCTG GGCCGGACCC
ATCCAGTCGA TAACCTCCGG ATCCCAAGCC AGCGCCGAAG CCGCCCTCGT CGCCGCCCTC
GTCGGCACAG CGCCGCCGAT CGGCCCGATC ACCTCCGGTA TCGCCGGCAA GTGCGTGGAC
GACAACCACC AGGCCACGGC GAACGGCACC GCGATCCAGC TCTGGACCTG CAACGGATCC
TCGGCGCAGC AGTTCACGGT GAACGCCAAC GGCAGCCTCA GCGTCCTGGG CAAGTGCATG
GACATCATCA GCGGCGGCAC AGCCAACGGC GTGAAGGTGC AGCTGTACGA CTGCAACGGC
ACCGGAGCCC AGGTCTGGAA CGCGCAAAGC AACGGCACGC TGCTGAACCC GCAATCCGGC
CGCTGCCTCG ACGACCCCGC CAGCAGCACC ACCGACGGAA CCCAGCTGCA GATCTGGGAC
TGCAACGGCG GCGCCAACCA GAAGTGGACG CTGCCGTAG
 
Protein sequence
MFGTRTRSLV TAIVALVSGL AFGIGLLGGG AAAAAPRTAV PAVSSPAAAS GAKVLMAGYN 
SGNGLIGGDG WWTSAVALST IMTYQQTTGD TSYSYAIAGA FNANKGSNFE NDYMDDTGWW
GLAWVQAYDI TGNTAYLQMA QTDANYIHGY WDSVCGGGVY WSTAKSYKNA IPNELFLDLT
AALHNRIAGD STYLGWANAE WNWFNGSGMI NGSHLINDGL TSGCQNNGQT VWTYNQGVIL
AGLSELSRAT GNTGLLTTAE TLANASTAHF NQNGIVVEPC EPNCGADGPS FKGVYVRGLR
SLATAAGTTA YNSYLQAQAN SIIAHDTNSA GQLGLSWAGP IQSITSGSQA SAEAALVAAL
VGTAPPIGPI TSGIAGKCVD DNHQATANGT AIQLWTCNGS SAQQFTVNAN GSLSVLGKCM
DIISGGTANG VKVQLYDCNG TGAQVWNAQS NGTLLNPQSG RCLDDPASST TDGTQLQIWD
CNGGANQKWT LP